That said, the public models don't even get bronze.
[EDIT] Dupe of this: https://news.ycombinator.com/item?id=44614872
What techniques were used? He references scaling up test-time compute, so I have to assume they threw a boatload of money at this. I've heard talk of running models in parallel and comparing results - if OpenAI ran this 10000 times in parallel and cherry-picked the best one, this is a lot less exciting.
If this is legit, then I really want to know what tools were used and how the model used them.
Indeed.
Yes, you absolutely can run Kimi-K2-Instruct on a PC with:
:white_check_mark: CPU: AMD Ryzen 7 5800X3D :white_check_mark: GPU: NVIDIA RTX 3090 (24 GB VRAM) :white_check_mark: RAM: 64 GB system memory This is more than sufficient for both:
Loading and running the full Kimi-K2-Instruct model in FP16 or INT8, and Quantizing it with weight-only INT8 using Hugging Face Optimum + bitsandbytes.
Kimi k2 has a trillion parameters and even an 8 bit quant would need half a gig of system ram +vram
This is with the free chatGPT that us peasants use. I dont have the means to run grok4 heavy, deep seek or kimi k2 to ask them.
I cant wait to see what accidental wars will start when we put ai in the kill chain
Bottom line: Your 5800X3D + 64 GB RAM + RTX 3090 will run Kimi K2’s 1.8‑bit build, but response times feel more like a leisurely typewriter than a snappy chatbot. If you want comfortable day‑to‑day use, plan either a RAM upgrade or a second (or bigger) GPU—or just hit the Moonshot API and save some waiting.
I rarely use 4o anymore for anything. Rather would I wait for o3 than quickly get a pile of rubbish.
o3 all the way for anything needing analysis or creative thought.
anonzzzies•12h ago
ImHereToVote•11h ago
owebmaster•11h ago
My threshold is when it can create a new Google
ImHereToVote•11h ago
kasey_junk•11h ago
ImHereToVote•11h ago
Fade_Dance•11h ago
So in my eyes actually think it's probably more to do with reducing the cost of AI inference by another order of magnitude, at least when it comes to mass market tools. Existing basic code-generation tools from a single AI are already fairly expensive to run compute wise.
elif•11h ago
perching_aix•11h ago
AaronAPU•5h ago
thm•11h ago
torginus•11h ago
plemer•11h ago
Association fallacy: “You know who else was a vegetarian? Hitler.”
haneul•11h ago
You should totally give Claude Code a try. The biggest problem is that it is glaze-optimized, so have to work at getting it to not treat you like the biggest genius of all time. But when you manage to get in a good flow with it, and your project is very predictably searchable, results start to be quite helpful, even if just to unstuck yourself when you're in a rut.
reactordev•11h ago
torginus•2h ago
jug•11h ago
reactordev•11h ago
I can count on my hands the number of enterprises that actually have AI models of their own.
bdangubic•11h ago
reactordev•10h ago
jgalt212•9h ago
garciasn•10h ago
I use Claude Code for building products that don’t have these limitations. And fuck is it amazing. Even little things that would have taken days are done in a single line of text.
rvz•10h ago
And somehow these companies are now "AI companies", just like in the 2010s your average food market down the street was a "tech company" or the bakery next to it is now a "blockchain company". This happens all the time with bubbles and mania.
These enterprises today appear even more confused about what they do to rebrand themselves and it's a sign they are desperate for survival.
anonzzzies•11h ago
sorokod•11h ago
https://hitchhikers.fandom.com/wiki/Golgafrincham
threatripper•11h ago
shiandow•11h ago
brookst•11h ago
The fact that I see people being paid to dig a trench does not make me doubt the existence of trenching machines. It just means that the tool is not always the best choice for every job.
rvz•10h ago
It is that and an autonomous system that can generate $100BN dollars in profits. (OpenAI and Microsoft's definition of AGI)
So maybe when we see a commercial airplane with no human pilots on board but an LLM piloting the plane with no intervention needed?
Would you board such a plane?
graycat•8h ago
(1) Given triangle ABC, by means of Euclidean construction find point D on line AB and point E on line BC so that the lengths |AD| = |DE| = |EC.
(2) Given triangle ABC, by means of Euclidean construction inscribe a square so that each corner of the square is on a side of the triangle.
Come ON AGI, let's have some RESULTS that human general intelligence can do -- gee, I solved (1) in the 10th grade.
swat535•7h ago