frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model

https://www.kimi.com/blog/kimi-k2-5.html
105•nekofneko•2h ago

Comments

billyellow•2h ago
Cool
mangolie•2h ago
they cooked
jumploops•2h ago
> For complex tasks, Kimi K2.5 can self-direct an agent swarm with up to 100 sub-agents, executing parallel workflows across up to 1,500 tool calls.

> K2.5 Agent Swarm improves performance on complex tasks through parallel, specialized execution [..] leads to an 80% reduction in end-to-end runtime

Not just RL on tool calling, but RL on agent orchestration, neat!

DeathArrow•1h ago
Those are some impressive benchmark results. I wonder how well it does in real life.

Maybe we can get away with something cheaper than Claude for coding.

oneneptune•1h ago
I'm curious about the "cheaper" claim -- I checked Kimi pricing, and it's a $200/mo subscription too?
NitpickLawyer•1h ago
On openrouter 2.5 is at 0.60/3$ per Mtok. That's haiku pricing.
mrklol•32m ago
They also have a $20 and $40 tier.
spaceman_2020•1h ago
Kimi was already one of the best writing models. Excited to try this one out
Tepix•1h ago
Huggingface Link: https://huggingface.co/moonshotai/Kimi-K2.5

1T parameters, 32b active parameters.

License: MIT with the following modification:

Our only modification part is that, if the Software (or any derivative works thereof) is used for any of your commercial products or services that have more than 100 million monthly active users, or more than 20 million US dollars (or equivalent in other currencies) in monthly revenue, you shall prominently display "Kimi K2.5" on the user interface of such product or service.

Imustaskforhelp•26m ago
Hey have they open sourced all Kimi k2.5 (thinking,instruct,agent,agent swarm [beta])?

Because I feel like they mentioned that agent swarm is available their api and that made me feel as if it wasn't open (weights)*? Please let me know if all are open source or not?

dheera•21m ago
> or more than 20 million US dollars (or equivalent in other currencies) in monthly revenue, you shall prominently display "Kimi K2.5" on the user interface of such product or service.

Why not just say "you shall pay us 1 million dollars"?

clayhacks•14m ago
I assume this allows them to sue for different amounts. And not discourage too many people from using it.
lrvick•1h ago
Actually open source, or yet another public model, which is the equivalent of a binary?

URL is down so cannot tell.

Tepix•1h ago
It's open weights, not open source.
Reubend•1h ago
I've read several people say that Kimi K2 has a better "emotional intelligence" than other models. I'll be interested to see whether K2.5 continues or even improves on that.
storystarling•32m ago
yes, though this is highly subjective - it 'feels' like that to me as well (comapred to Gemini 3, GPT 5.2, Opus 4.5).
pplonski86•55m ago
There are so many models, is there any website with list of all of them and comparison of performance on different tasks?
coffeeri•51m ago
There is https://artificialanalysis.ai
pplonski86•4m ago
Thank you! Exactly what I was looking for
Reubend•47m ago
The post actually has great benchmark tables inside of it. They might be outdated in a few months, but for now, it gives you a great summary. Seems like Gemini wins on image and video perf, Claude is the best at coding, ChatGPT is the best for general knowledge.

But ultimately, you need to try them yourself on the tasks you care about and just see. My personal experience is that right now, Gemini Pro performs the best at everything I throw at it. I think it's superior to Claude and all of the OSS models by a small margin, even for things like coding.

Imustaskforhelp•24m ago
I like Gemini Pro's UI over Claude so much but honestly I might start using Kimi K2.5 if its open source & just +/- Gemini Pro/Chatgpt/Claude because at that point I feel like the results are negligible and we are getting SOTA open source models again.
striking•53m ago
https://archive.is/P98JR
zmmmmm•52m ago
Curious what would be the most minimal reasonable hardware one would need to deploy this locally?
NitpickLawyer•24m ago
I parsed "reasonable" as in having reasonable speed to actually use this as intended (in agentic setups). In that case, it's a minimum of 70-100k for hardware (8x 6000 PRO + all the other pieces to make it work). The model comes with native INT4 quant, so ~600GB for the weights alone. An 8x 96GB setup would give you ~160GB for kv caching.

You can of course "run" this on cheaper hardware, but the speeds will not be suitable for actual use (i.e. minutes for a simple prompt, tens of minutes for high context sessions per turn).

rvz•10m ago
The chefs at Moonshot have cooked once again.
Jackson__•8m ago
As your local vision nut, their claims about "SOTA" vision are absolutely BS in my tests.

Sure it's SOTA at standard vision benchmarks. But on tasks that require proper image understanding, see for example BabyVision[0] it appears very much lacking compared to Gemini 3 Pro.

[0] https://arxiv.org/html/2601.06521v1

Heathrow scraps liquid container limit

https://www.bbc.com/news/articles/c1evvx89559o
186•robotsliketea•3d ago•248 comments

The state of Linux music players in 2026

https://crescentro.se/posts/linux-music-players-2026/
23•signa11•52m ago•14 comments

When two years of academic work vanished with a single click

https://www.nature.com/articles/d41586-025-04064-7
20•stmw•4d ago•10 comments

Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model

https://www.kimi.com/blog/kimi-k2-5.html
105•nekofneko•2h ago•26 comments

A list of fun destinations for telnet

https://telnet.org/htm/places.htm
56•tokyobreakfast•4h ago•11 comments

The hidden engineering of runways

https://practical.engineering/blog/2026/1/20/the-hidden-engineering-of-runways
285•crescit_eundo•6d ago•70 comments

ChatGPT Containers can now run bash, pip/npm install packages and download files

https://simonwillison.net/2026/Jan/26/chatgpt-containers/
296•simonw•12h ago•225 comments

Russia using Interpol's wanted list to target critics abroad, leak reveals

https://www.bbc.com/news/articles/c20gg729y1yo
28•breve•1h ago•4 comments

Apple introduces new AirTag with longer range and improved findability

https://www.apple.com/newsroom/2026/01/apple-introduces-new-airtag-with-expanded-range-and-improv...
402•meetpateltech•18h ago•505 comments

AI code and software craft

https://alexwennerberg.com/blog/2026-01-25-slop.html
147•alexwennerberg•14h ago•81 comments

There is an AI code review bubble

https://www.greptile.com/blog/ai-code-review-bubble
245•dakshgupta•16h ago•163 comments

Windows 11's Patch Tuesday nightmare gets worse

https://www.windowscentral.com/microsoft/windows-11/windows-11s-botched-patch-tuesday-update-nigh...
265•01-_-•17h ago•192 comments

Dithering – Part 2: The Ordered Dithering

https://visualrambling.space/dithering-part-2/
182•ChrisArchitect•12h ago•20 comments

France passes bill to ban social media use by under-15s

https://www.rte.ie/news/europe/2026/0127/1555251-france-social-media-ban/
73•austinallegro•1h ago•56 comments

JuiceSSH – Give me my pro features back

https://nproject.io/blog/juicessh-give-me-back-my-pro-features/
288•jandeboevrie•14h ago•129 comments

RIP Low-Code 2014-2025

https://www.zackliscio.com/posts/rip-low-code-2014-2025/
207•zackliscio•16h ago•93 comments

People who know the formula for WD-40

https://www.wsj.com/business/the-secret-society-of-people-who-know-the-formula-for-wd-40-e9c0ff54
139•fortran77•11h ago•220 comments

Knapsack Offline Internet Solution (satellite datacasting)

https://www.netfreedompioneers.org/knapsack-content-station/
13•us321•3d ago•4 comments

Model Market Fit

https://www.nicolasbustamante.com/p/model-market-fit
47•nbstme•6d ago•8 comments

New York Times games are hard: A computational perspective

https://arxiv.org/abs/2509.10846
17•PaulHoule•4d ago•1 comments

I let ChatGPT analyze a decade of my Apple Watch data, then I called my doctor

https://www.msn.com/en-us/news/technology/i-let-chatgpt-analyze-a-decade-of-my-apple-watch-data-t...
74•zdw•9h ago•88 comments

France Aiming to Replace Zoom, Google Meet, Microsoft Teams, etc.

https://twitter.com/lellouchenico/status/2015775970330882319
693•bwb•15h ago•550 comments

Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus

https://tetrisbench.com/tetrisbench/
91•ykhli•13h ago•36 comments

The Adolescence of Technology

https://www.darioamodei.com/essay/the-adolescence-of-technology
182•jasondavies•15h ago•122 comments

Television is 100 years old today

https://diamondgeezer.blogspot.com/2026/01/tv100.html
585•qassiov•17h ago•205 comments

Porting 100k lines from TypeScript to Rust using Claude Code in a month

https://blog.vjeux.com/2026/analysis/porting-100k-lines-from-typescript-to-rust-using-claude-code...
191•ibobev•18h ago•126 comments

Over 36,500 killed in Iran's deadliest massacre, documents reveal

https://www.iranintl.com/en/202601255198
385•mhb•1d ago•182 comments

San Francisco Graffiti

https://walzr.com/sf-graffiti
177•walz•22h ago•190 comments

Cyclic Subgroup Sum

https://m-slee.netlify.app/posts/cyclic-subgroup-sum
3•richard_chase•5d ago•1 comments

Fedora Asahi Remix is now working on Apple M3

https://bsky.app/profile/did:plc:okydh7e54e2nok65kjxdklvd/post/3mdd55paffk2o
507•todsacerdoti•14h ago•183 comments