Kimi-K2 Tech Report [pdf]

https://github.com/MoonshotAI/Kimi-K2/blob/main/tech_report.pdf

82•swyx•3d ago

Comments

dang•1d ago

Related. Others?

China's moonshot launches free AI model Kimi K2 that outperforms GPT4 - https://news.ycombinator.com/item?id=44575309 - July 2025 (3 comments)

Kimi K2 and when "DeepSeek Moments" become normal - https://news.ycombinator.com/item?id=44561565 - July 2025 (2 comments)

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model - https://news.ycombinator.com/item?id=44533403 - July 2025 (178 comments)

jtrn•1d ago

The results without the fluff:

Model Architecture * Type: Mixture-of-Experts (MoE) transformer model. * Total Parameters: 1 trillion. * Activated Parameters: 32 billion. * Experts: 384 total experts, with 8 activated per token. * Attention Heads: 64.

Pre-training * Optimizer: A novel optimizer named MuonClip was used. It integrates the Muon optimizer with a QK-Clip mechanism to address training instability. * Dataset: The model was pre-trained on 15.5 trillion tokens. * Training Process: Kimi K2 was trained with zero loss spikes. The initial context window was 4,096 tokens, later extended to 128k tokens using the YaRN method.

Post-training * The model underwent a multi-stage process featuring a large-scale agentic data synthesis pipeline and a joint reinforcement learning (RL) stage. * The RL framework combines verifiable rewards with a self-critique rubric reward mechanism. * A data synthesis pipeline generated tens of thousands of tool-use training examples.

Performance Benchmarks (non-thinking mode) * SWE-bench Verified: 65.8%. * SWE-bench Multilingual: 47.3%. * LiveCodeBench v6: 53.7%. * OJBench: 27.1%. * Tau2-Bench micro-average: 66.1. * ACEBench (en): 76.5. * AIME 2025: 49.5. * GPQA-Diamond: 75.1. * LMSYS Arena Leaderboard (July 17, 2025): Ranked 1st among open-source models and 5th overall.

chisleu•1d ago

It looks like qwen3-coder is going to steal K2's thunder in terms of agentic coding use.

jadbox•1d ago

Maybe so, but currently I like the sound of K2's writing more so than qwen3 (so far in my testing).

swyx•1d ago

(hi i'm OP) kimi k2 was released a while ago with some headlines like muonclip already discussed* but the tech report is new so submitted here. their own highlights are here: https://x.com/Kimi_Moonshot/status/1947520758760313170

we just covered it today on the latent.space paper club if you want to listen along while reading this paper https://youtu.be/VHwZa7lZhK8

definitely see also sebastian raschka's writeup https://t.co/oEt8XzNxik

*background on muon and muonclip https://www.youtube.com/watch?v=fcTNQLebHb0

OutOfHere•1d ago

It has a small context length of just 128K.

Graphene OS: a security-enhanced Android build

Scientists may have found a way to eliminate chromosome linked to Down syndrome

Inter-Planetary Network Special Interest Group

Positron – A next-generation data science IDE

I wasted weeks hand optimizing assembly because I benchmarked on random data

AMD CEO sees chips from TSMC's US plant costing 5%-20% more

There is no memory safety without thread safety

Alto turns your Apple Notes into a website

A GPU Calculator That Helps Calculate What GPU to Use

Air Force unit suspends use of Sig Sauer pistol after shooting death of airman

PSA: SQLite WAL checksums fail silently and may lose data

RE#: High performance derivative-based regular expression matching (2024)

Visa and Mastercard: The global payment duopoly (2024)

New Aarch64 Back End

Use Your Type System

Revisiting Moneyball

Open Source Maintenance Fee

Information Warfare

How Anthropic teams use Claude Code

Covers as a way of learning music and code

Vet is a safety net for the curl | bash pattern

Intel CEO Letter to Employees

Why concatenative programming matters (2012)

Low-Temp 2D Semiconductors: A Chipmaking Shift

Bus Bunching

Writing is thinking

Mwm – The smallest usable X11 window manager

Show HN: Nia – MCP server that gives more docs and repos to coding agents

UK: Phone networks down: EE, BT, Three, Vodafone, O2 not working in mass outage

The POSIX specification of vi