frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

SWE-Grep and SWE-Grep-Mini: RL for Fast Multi-Turn Context Retrieval

https://cognition.ai/blog/swe-grep
78•meetpateltech•11h ago

Comments

marstall•10h ago
SWE-1 has been being booped up by WindSurf to me lately and I've been impressed - often (enough?) getting me the same answers as GPT5 etc., but almost instantly. Gotta say speed is nice.
swyx•9h ago
nice, what does booped up mean? is this gen z lingo?
marstall•8h ago
ha more like how i talk to my two year old. WindSurf's Cascade sidebar tool (which i use in RubyMine) has a stable of LLMs and it somewhat randomly switches the active one out from time to time. So I get a taste of what different ones are like, it's kind of cool.
tifa2up•10h ago
Searched for 'hi' and it took 166s to return a response using this model: https://pasteboard.co/oB4VqVC5FGkl.png

Claude Code took 0.1s, Cursor CLI 19s

mgambati•9h ago
If you ask a real question, then you might get real results.
silasalberti•9h ago
hey I'm from the SWE-grep team - feel free to ask me any questions :)
daralthus•8h ago
this would be useful outside of coding. could you release a benchmark so we can have more models tuned for this?
swyx•9h ago
(coauthor) main charts/evals here https://x.com/cognition/status/1978867021669413252

you can try the https://playground.cognition.ai/ here

i wrote a longer explainer here https://x.com/swyx/status/1978874342743343254 but saving you the click

this was a perspective cut from the blogpost, but let me explain why subagents kill long context

Like you can spend $500m building 100 million context models, and they would be 1) slow, 2) expensive to use, 3) have huge context rot. O(n) is the lower bound.

Cog's approach is something you learn in day 1 of CS50 - divide and parallelize. Embeddings are too dumb, Agentic Search is too slow. So train limited-agency (max 4 turns), natively parallel tool calling (avg parallelism of 7-8, custom toolset) fast (2800tok/s) subagents to give the performance of Agentic Search under an acceptable "Flow Window" that feels immaterially slower than Embeddings.

The benefit of this is threefold:

- 8 ^ 4 toolcalls cover a very large code search space. can compound subagent calls if more needed.

- predictable cost & end to end latency

- subagent outputs "clean" contexts, free of context failure modes like context poisoning and context rot

we originally called this Rapid Agentic Search, to contrast with RAG. but Fast Context rolls off the tongue better.

-- Second perspective --

The Fundamental Equation of Coding Agents is:

Coding Agent Performance = Ability to Read the Right Files * Ability to Generate the Right Diffs

Fast Context is Cognition's first solution for the Read. As codebases get larger and and tasks get more complex, Reads get more important. the average production codebase first query in Cascade is >60% just searching and reading files.

But if this were just about speed, it might not be that exciting. I think there are unappreciated effects in performance as well when you have very good context. In other words:

Context Engineering is Actually Very Important. Too important for humans and hardcoded rules.

The swe-greps are the first dedicated context engineer agent models.

vessenes•5h ago
Thanks for the summary. I noticed from the announcement you trained on parallel tool calling to save on serial round tripping. This is awesome.

Most LLM coding is so slow that you're permanently out of flow state, and in 'manager' state right now - I'm interested in a future where you've got enough fast low TTFT support that an engineer could maintain flow state and have sort of super power type productivity at the same time, and this tool makes me think of that.

That is, it looks fast enough to be used as a sort of sidebar info tool, as in "what you're coding might need / refer to these other parts of the codebase" -- effectively increasing an engineer's working memory. Super cool. And obviously useful for an AI engineer as well. Thanks for the writeup!

SafeDusk•2h ago
Any plans to offer this as a tool/MCP server for other coding agents or is it going to be Windsurf exclusive?
ntntnt•9h ago
lol dead thread, cognition begging to grab some traction in this space.
kburman•8h ago
I thought https://playground.cognition.ai/ was just returning some cached query results, but no, they’re actually spinning up real VMs and running live queries without any authentication or restrictions. That must be costing them a fortune.
groby_b•7h ago
Currently, all queries are returning "We're under load and processing too many requests. Please try again later."

So that's how that is going ;)

awsanswers•8h ago
LLM product managers: Show me what's in the context convenient to where I am prompting. Likely the user knowing and editing the precise context between requests will be a user task for a long time
breadislove•7h ago
guys please release the benchmark or the benchmark code. like this is just "trust me bro"
swyx•7h ago
well thats what the playground is for! playground.cognition.ai
breadislove•7h ago
yeah but if people would like to double check the results it would be nice to have the actual benchmark. especially given that your playground is broken...

"We ran into an error processing your request. Please try again"

seanobannon•4h ago
This link redirects to https://cognition.ai/blog/swe-grep now?

How I bypassed Amazon's Kindle web DRM

https://blog.pixelmelt.dev/kindle-web-drm/
659•pixelmelt•7h ago•206 comments

Free the Internet: The Tor Project's annual fundraiser

https://blog.torproject.org/2025-fundraiser-donations-matched/
6•pabs3•11m ago•0 comments

Claude Skills

https://www.anthropic.com/news/skills
525•meetpateltech•12h ago•294 comments

America’s semiconductor boom

https://www.youtube.com/watch?v=T-jt3qBzJ4A
106•zdw•5h ago•51 comments

Gemini 3.0 spotted in the wild through A/B testing

https://ricklamers.io/posts/gemini-3-spotted-in-the-wild/
303•ricklamers•11h ago•180 comments

Cloudflare Sandbox SDK

https://sandbox.cloudflare.com/
153•bentaber•7h ago•48 comments

A 4k-Room Text Adventure Written by One Human in QBasic No AI

https://the-ventureweaver.itch.io/tlote4111
70•ATiredGoat•4d ago•45 comments

Next steps for BPF support in the GNU toolchain

https://lwn.net/Articles/1039827/
5•signa11•1h ago•0 comments

Lead Limited Brain and Language Development in Neanderthals and Other Hominids?

https://today.ucsd.edu/story/did-lead-limit-brain-and-language-development-in-neanderthals-and-ot...
45•gmays•5h ago•14 comments

Your data model is your destiny

https://notes.mtb.xyz/p/your-data-model-is-your-destiny
200•hunglee2•2d ago•30 comments

DoorDash and Waymo launch autonomous delivery service in Phoenix

https://about.doordash.com/en-us/news/waymo
232•ChrisArchitect•14h ago•518 comments

Codex Is Live in Zed

https://zed.dev/blog/codex-is-live-in-zed
194•meetpateltech•12h ago•28 comments

Hyperflask – Full stack Flask and Htmx framework

https://hyperflask.dev/
302•emixam•15h ago•95 comments

Talent

https://www.felixstocker.com/blog/talent
128•BinaryIgor•10h ago•56 comments

Why I have to buy doughnuts with cash

https://www.ft.com/content/8766ef23-3938-4de2-8a37-602c798034aa
10•hhs•5d ago•18 comments

Understanding Spec-Driven-Development: Kiro, Spec-Kit, and Tessl

https://martinfowler.com/articles/exploring-gen-ai/sdd-3-tools.html
49•janpio•6h ago•6 comments

Syntax highlighting is a waste of an information channel (2020)

https://buttondown.com/hillelwayne/archive/syntax-highlighting-is-a-waste-of-an-information/
230•swyx•4d ago•92 comments

Post office in France rolls out croissant-scented stamp

https://www.ctvnews.ca/world/article/french-post-office-rolls-out-croissant-scented-stamp/
101•ohjeez•1w ago•37 comments

Elixir 1.19

https://elixir-lang.org/blog/2025/10/16/elixir-v1-19-0-released/
230•theanirudh•20h ago•51 comments

Microwave technique allows energy-efficient chemical reactions

https://phys.org/news/2025-10-microwave-technique-energy-efficient-chemical.html
36•rolph•6d ago•1 comments

Benjie's Humanoid Olympic Games

https://generalrobots.substack.com/p/benjies-humanoid-olympic-games
106•robobenjie•8h ago•78 comments

A liver transplant from start to finish

https://press.asimov.com/articles/liver
13•mailyk•4d ago•2 comments

Electricity can heal wounds three times as fast (2023)

https://www.chalmers.se/en/current/news/mc2-how-electricity-can-heal-wounds-three-times-as-fast/
145•mgh2•15h ago•90 comments

How to tame a user interface using a spreadsheet

https://blog.gingerbeardman.com/2025/10/11/how-to-tame-a-user-interface-using-a-spreadsheet/
101•msephton•6d ago•25 comments

A conspiracy to kill IE6 (2019)

https://blog.chriszacharias.com/a-conspiracy-to-kill-ie6
169•romanhn•9h ago•100 comments

Lace: A New Kind of Cellular Automata Where Links Matter

https://www.novaspivack.com/science/introducing-lace-a-new-kind-of-cellular-automata
122•airesearcher•14h ago•48 comments

Show HN: Inkeep (YC W23) – Agent Builder to create agents in code or visually

https://github.com/inkeep/agents
64•engomez•15h ago•47 comments

Hacker News – The Good Parts

https://smartmic.bearblog.dev/why-hacker-news/
120•smartmic•7h ago•133 comments

A stateful browser agent using self-healing DOM maps

https://100x.bot/a/a-stateful-browser-agent-using-self-healing-dom-maps
110•shardullavekar•16h ago•54 comments

VOC injection into a house reveals large surface reservoir sizes

https://www.pnas.org/doi/10.1073/pnas.2503399122
91•PaulHoule•5d ago•79 comments