Reasoning is cost apparently our monthly Claude bill has become astronomical for the org. Nearly 3x our saas's cloud spend.
Apparently we are going to get limited access to codex at severely reduced plans.
I have tried some local models such as Kimi, however most are barely functional.
I am very concerned as the expectation of amount of work done is to remain consistent. Ignoring the fact teams have made entire workflows around Claude I am very worried and frustrated.
How can I help my team ease this transition? Are their local models that run well on local machines that only have 16gb ram?
itg•40m ago
Snakes3727•32m ago
I was considering having something run locally within out building but the time when something like that would be avaliable is not near term so i am trying to make the best of what i can do.