SWE-Grep and SWE-Grep-Mini: RL for Fast Multi-Turn Context Retrieval

34•meetpateltech•2h ago

Comments

marstall•1h ago

SWE-1 has been being booped up by WindSurf to me lately and I've been impressed - often (enough?) getting me the same answers as GPT5 etc., but almost instantly. Gotta say speed is nice.

tifa2up•1h ago

Searched for 'hi' and it took 166s to return a response using this model: https://pasteboard.co/oB4VqVC5FGkl.png

Claude Code took 0.1s, Cursor CLI 19s

mgambati•45m ago

If you ask a real question, then you might get real results.

silasalberti•16m ago

hey I'm from the SWE-grep team - feel free to ask me any questions :)

swyx•15m ago

(coauthor) main charts/evals here https://x.com/cognition/status/1978867021669413252

you can try the https://playground.cognition.ai/ here

i wrote a longer explainer here https://x.com/swyx/status/1978874342743343254 but saving you the click

this was a perspective cut from the blogpost, but let me explain why subagents kill long context

Like you can spend $500m building 100 million context models, and they would be 1) slow, 2) expensive to use, 3) have huge context rot. O(n) is the lower bound.

Cog's approach is something you learn in day 1 of CS50 - divide and parallelize. Embeddings are too dumb, Agentic Search is too slow. So train limited-agency (max 4 turns), natively parallel tool calling (avg parallelism of 7-8, custom toolset) fast (2800tok/s) subagents to give the performance of Agentic Search under an acceptable "Flow Window" that feels immaterially slower than Embeddings.

The benefit of this is threefold:

- 8 ^ 4 toolcalls cover a very large code search space. can compound subagent calls if more needed.

- predictable cost & end to end latency

- subagent outputs "clean" contexts, free of context failure modes like context poisoning and context rot

we originally called this Rapid Agentic Search, to contrast with RAG. but Fast Context rolls off the tongue better.

-- Second perspective --

The Fundamental Equation of Coding Agents is:

Coding Agent Performance = Ability to Read the Right Files * Ability to Generate the Right Diffs

Fast Context is Cognition's first solution for the Read. As codebases get larger and and tasks get more complex, Reads get more important. the average production codebase first query in Cascade is >60% just searching and reading files.

But if this were just about speed, it might not be that exciting. I think there are unappreciated effects in performance as well when you have very good context. In other words:

Context Engineering is Actually Very Important. Too important for humans and hardcoded rules.

The swe-greps are the first dedicated context engineer agent models.

ntntnt•38s ago

lol dead thread, cognition begging to grab some traction in this space.

Dyerlingo

Play abstract strategy board games online with friends or against bots

PostHog just turned their Homepage UX into a Computer

Gezira: A Deep Dive

Collabora and MediaTek: Pushing boundaries on latest IoT boards and Chromebooks

The Art of Scaling Reinforcement Learning Compute for LLMs

Python as a Configuration Language Using Starlark

Counsel Health Grabs $25M for AI-Augmented Healthcare Service

Deel raises $300M Series E at $17.3B valuation

Fantastic (Small) Retrievers and How to Train Them

M5 MacBook Pro Does Not Include a Charger in the Box in Europe

Google's AI Cracks a New Cancer Code

Making roads safer with a new centre line road marking policy

Making Context-Aware Components: How CSS Inherit() Could Simplify Design Systems

Apple M4 Series Feature Support

Mecabricks – create and display 3D Lego models

Pop star laments missed SF tech investment that would've made him $5B

Apple Readies High-End MacBook Pro with Touch Hole-Punch Screen

Thoguhts on AI Compliance

Private Credit on the Defensive Again over 'Mark-to-Myth' Study

FTX Was Never Insolvent? A Prison Interview with Sam Bankman-Fried

Rebuilding Uber's Apache Pinot Query Architecture

Porting a Segmented List from C to Rust

Timely Arrival: Great British Railways Clock Launches at London Bridge

Show HN: AI Chat Terminal – Private data stays local, rest goes to cloud

Modeling Developer Burnout with GenAI Adoption

Chamber of Commerce Sues over Trump's New $100k H-1B Visa Fee

Tesla brings back 'Mad Max' 'Full Self-Driving' mode that ignores speed limits

Show HN: Gen AI for fonts, 1M free fonts organized by "vibe"

Which Collatz numbers do Busy Beavers simulate (if any)?