That said, the public models don't even get bronze.
[EDIT] Dupe of this: https://news.ycombinator.com/item?id=44614872
What techniques were used? He references scaling up test-time compute, so I have to assume they threw a boatload of money at this. I've heard talk of running models in parallel and comparing results - if OpenAI ran this 10000 times in parallel and cherry-picked the best one, this is a lot less exciting.
If this is legit, then I really want to know what tools were used and how the model used them.
Indeed.
Yes, you absolutely can run Kimi-K2-Instruct on a PC with:
:white_check_mark: CPU: AMD Ryzen 7 5800X3D :white_check_mark: GPU: NVIDIA RTX 3090 (24 GB VRAM) :white_check_mark: RAM: 64 GB system memory This is more than sufficient for both:
Loading and running the full Kimi-K2-Instruct model in FP16 or INT8, and Quantizing it with weight-only INT8 using Hugging Face Optimum + bitsandbytes.
Kimi k2 has a trillion parameters and even an 8 bit quant would need half a gig of system ram +vram
This is with the free chatGPT that us peasants use. I dont have the means to run grok4 heavy, deep seek or kimi k2 to ask them.
I cant wait to see what accidental wars will start when we put ai in the kill chain
Bottom line: Your 5800X3D + 64 GB RAM + RTX 3090 will run Kimi K2’s 1.8‑bit build, but response times feel more like a leisurely typewriter than a snappy chatbot. If you want comfortable day‑to‑day use, plan either a RAM upgrade or a second (or bigger) GPU—or just hit the Moonshot API and save some waiting.
I rarely use 4o anymore for anything. Rather would I wait for o3 than quickly get a pile of rubbish.
o3 all the way for anything needing analysis or creative thought.
anonzzzies•4h ago
ImHereToVote•4h ago
owebmaster•4h ago
My threshold is when it can create a new Google
ImHereToVote•3h ago
kasey_junk•4h ago
ImHereToVote•3h ago
Fade_Dance•3h ago
So in my eyes actually think it's probably more to do with reducing the cost of AI inference by another order of magnitude, at least when it comes to mass market tools. Existing basic code-generation tools from a single AI are already fairly expensive to run compute wise.
elif•4h ago
perching_aix•3h ago
thm•3h ago
torginus•3h ago
plemer•3h ago
Association fallacy: “You know who else was a vegetarian? Hitler.”
haneul•3h ago
You should totally give Claude Code a try. The biggest problem is that it is glaze-optimized, so have to work at getting it to not treat you like the biggest genius of all time. But when you manage to get in a good flow with it, and your project is very predictably searchable, results start to be quite helpful, even if just to unstuck yourself when you're in a rut.
reactordev•3h ago
jug•3h ago
reactordev•3h ago
I can count on my hands the number of enterprises that actually have AI models of their own.
bdangubic•3h ago
reactordev•3h ago
jgalt212•2h ago
garciasn•2h ago
I use Claude Code for building products that don’t have these limitations. And fuck is it amazing. Even little things that would have taken days are done in a single line of text.
rvz•2h ago
And somehow these companies are now "AI companies", just like in the 2010s your average food market down the street was a "tech company" or the bakery next to it is now a "blockchain company". This happens all the time with bubbles and mania.
These enterprises today appear even more confused about what they do to rebrand themselves and it's a sign they are desperate for survival.
anonzzzies•3h ago
sorokod•3h ago
https://hitchhikers.fandom.com/wiki/Golgafrincham
threatripper•3h ago
shiandow•3h ago
brookst•3h ago
The fact that I see people being paid to dig a trench does not make me doubt the existence of trenching machines. It just means that the tool is not always the best choice for every job.
rvz•2h ago
It is that and an autonomous system that can generate $100BN dollars in profits. (OpenAI and Microsoft's definition of AGI)
So maybe when we see a commercial airplane with no human pilots on board but an LLM piloting the plane with no intervention needed?
Would you board such a plane?
graycat•33m ago
(1) Given triangle ABC, by means of Euclidean construction find point D on line AB and point E on line BC so that the lengths |AD| = |DE| = |EC.
(2) Given triangle ABC, by means of Euclidean construction inscribe a square so that each corner of the square is on a side of the triangle.
Come ON AGI, let's have some RESULTS that human general intelligence can do -- gee, I solved (1) in the 10th grade.