frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Company is rapidly cutting AI tool spend how to prep team?

3•Snakes3727•45m ago
Company I work for is now rapidly planning to scale down its AI tooling spend. Claude code access is basically getting removed and people are forbidden from using personal plans.

Reasoning is cost apparently our monthly Claude bill has become astronomical for the org. Nearly 3x our saas's cloud spend.

Apparently we are going to get limited access to codex at severely reduced plans.

I have tried some local models such as Kimi, however most are barely functional.

I am very concerned as the expectation of amount of work done is to remain consistent. Ignoring the fact teams have made entire workflows around Claude I am very worried and frustrated.

How can I help my team ease this transition? Are their local models that run well on local machines that only have 16gb ram?

Comments

itg•40m ago
The 16GB of RAM will really limit you, what about trying OpenRouter and using the cheaper models such as Kimi instead of running them locally?
Snakes3727•32m ago
Given our field we cannot really use anything not approved by management. Pretty much if it doesn't leave our machine we can use its just i don't find anything good. We even have some new devs on the macbook neos, and i can't even find anything for them.

I was considering having something run locally within out building but the time when something like that would be avaliable is not near term so i am trying to make the best of what i can do.

baigy•40m ago
Specifically: to explore your opensource options with compute limitations, ask the community at r/LocalLLaMA on reddit. That's where the current SOTA opensource text-to-text models live.
Snakes3727•28m ago
Yeah i was looking there earlier, its just we thankfully mostly have macbooks, but i recently found out new devs are getting the smaller 8gb ram macbooks as well. Which is going to be even more frusturating.

Since my team is mostly remote running LLM on a cluster in the office is not really viable short term.

baigy•21m ago
This is totally going to suck, but here's one option I was just suggested a few mins ago: https://www.reddit.com/r/LocalLLaMA/comments/1th1mqx/comment... For context, I was asking about running anything OpenClaw-friendly on my RTX4060 8GB VRAM. I know yours is a more involved use-case, but there's still some optionality here.
xvxvx•31m ago
My own company hired a young goon of a man to spearhead their AI initiative. Lots of smiles and arrogance from him. Fast forward 2 months and reality has hit. Weekly meetings asking for feedback draw blank stares as employees explain that Claude can’t do shit to help their workload. This kid is starting to sweat. I bet he’ll be gone by the summer. Hilarious.
Snakes3727•26m ago
Unfortunately at my company leads have no insight into employees claude code caps, and no one has ever complained until now. Apparently some people were basically running with insane caps on CC (25k+), if you asked for it you were approved. Which lead to some people doing insane things on CC for no purpose.
baigy•18m ago
Just setting up better SOPs around using AI for coding is going to help them a ton. They can chalk off the sunk cost to a "learning phase", with now being the time to use the lesson learnt to formulate some future-looking standard operating procedures. No need to suddenly go cold-turkey on AI. My 2 cents.

Linux 7.1-rc4: security list "almost unmanageable" from AI bug reports

https://lwn.net/Articles/1073192/
1•zdkaster•49s ago•1 comments

I don't want my kids using your stupid AI

https://www.theglobeandmail.com/life/article-no-i-dont-want-my-kids-using-your-stupid-ai/
1•petethomas•1m ago•0 comments

The 30 Year Game

https://remysharp.com/2026/05/17/the-30-year-game
1•tobr•8m ago•0 comments

In Memoriam: Peter G. Neumann (1932-2026)

https://cacm.acm.org/news/in-memoriam-peter-g-neumann-1932-2026/
1•fork-bomber•8m ago•0 comments

Standard Chartered to cut roles as AI use increases

https://www.bbc.com/news/articles/crep3v8vzglo
2•KnuthIsGod•13m ago•0 comments

Xiaomi YU7 GT Breaking the Nürburgring SUV Lap Record [video]

https://www.youtube.com/watch?v=Fx6d-K_8QXg
1•gainsurier•15m ago•0 comments

Mug Shots: A Small Town Noir (2014)

https://theappendix.net/issues/2014/4/mug-shots-a-small-town-noir
3•samclemens•16m ago•0 comments

As of April 2026: Iran has destroyed 42 U.S. Military Aircraft in Op: Epic Fury

https://nationalsecurityjournal.org/iran-destroyed-42-u-s-military-aircraft-in-operation-epic-fur...
14•Gaishan•17m ago•1 comments

We Made a World for Bots

https://empaworld.ai
1•sarah-oates•20m ago•0 comments

Adding Fake Shadows to My Puzzle Game

https://qcgeneral29.itch.io/lets-learn/devlog/1524864/alpha-version-8-fake-shadows
1•LandenLove•22m ago•0 comments

Causal Video Models Are Data-Efficient Robot Policy Learners

https://www.rhoda.ai/research/direct-video-action
1•e_iris•25m ago•0 comments

PyTorch Landscape

https://pytorch.landscape2.io
3•salamo•25m ago•0 comments

Replacing My ISP Router with a UniFi Cloud Gateway Max

https://kevquirk.com/replacing-my-isp-router-with-a-unifi-cloud-gateway-max
1•speckx•27m ago•0 comments

Codex-Maxxing

https://jxnl.co/writing/2026/05/10/codex-maxxing/
2•dnw•30m ago•0 comments

Product is not the problem. Your main image might be

https://www.getwhitebg.com
1•yibaoshan•31m ago•0 comments

SEC to Ready Plan for Trading Crypto Versions of Stocks

https://www.bloomberg.com/news/articles/2026-05-18/sec-is-said-to-ready-plan-for-trading-crypto-v...
3•petethomas•32m ago•0 comments

The first AI Bulk Upscaling tool for filmmakers and creator pipelines

https://upscalehero.com/
1•Ptconnection•36m ago•1 comments

Proposals Repo, a place for ideas to start their incubation journey

https://github.com/WICG/proposals
2•nashashmi•40m ago•0 comments

Balancing persistence vs. pivoting – is grit a virtue or wasteful?

https://optimizedbyotto.com/post/balancing-persistence-vs-pivoting/
1•MaxMussio•42m ago•0 comments

Formal proof that agentic AI governance latency can be O(1) instead of O(days)

https://arxiv.org/abs/2605.17909
1•riddhimohan•42m ago•0 comments

Ask HN: Company is rapidly cutting AI tool spend how to prep team?

3•Snakes3727•45m ago•8 comments

Show HN: Memory Concierge – hotel concierge AI

https://memory-concierge.vercel.app
1•abhilash617•46m ago•0 comments

Using algebra and LLMs to verify a flight-plan bug fix in Lean

https://jameshaydon.github.io/algebra-llms-lean-flight-plan/
2•jameshh•48m ago•0 comments

Show HN: Hsrs – Type-Safe Haskell Bindings Generator for Rust

https://github.com/harmont-dev/hsrs
2•suis_siva•49m ago•0 comments

Digital Growth Starts Here – Digital Marketing Agency

1•magicalweb•49m ago•0 comments

Apple Silicon costs more than OpenRouter

https://twitter.com/rohan_sood15/status/2056585919805714777
8•rohansood15•53m ago•0 comments

LLMCap – A proxy that hard-stops LLM API calls when you hit a dollar cap

https://www.llmcap.io/
2•cfaruk•58m ago•0 comments

Frontier models at open source cost – hot new AI Model Router

https://www.orcarouter.ai/
2•sangwen•1h ago•0 comments

Active Supply Chain Attack Compromises Antv Packages on NPM

https://socket.dev/blog/antv-packages-compromised
3•882542F3884314B•1h ago•0 comments

Finnish spy chief warns Europe may never break free from foreign tech

https://www.politico.eu/article/europe-tech-dependent-us-china-fully-sovereign-finnish-intel-chief/
6•giuliomagnifico•1h ago•1 comments