frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

WarpGrep – RL Subagent for Fast Context (Like SWE-Grep)

https://morphllm.com/mcp
1•bhaktatejas922•1d ago

Comments

bhaktatejas922•1d ago
Hello HN,

We’re the team behind WarpGrep. It’s a FAST context retrieval subagent designed to fix coding agents spending ~60% of their time searching for context + the huge context rot problem.

We built this because we found that standard RAG or naive context stuffing leads to "context rot"—where irrelevant files poison the model’s reasoning on long-horizon tasks. Inspired by Cognition’s SWE-Grep, we wanted to build an accessible version that integrates via MCP (Model Context Protocol) or SDK.

How it works: Instead of a single prompt trying to do everything, WarpGrep treats context retrieval as a distinct, RL-trained system. We reward correct context retrieval and penalize irrelevant lines.

Constraints: It operates on a strict budget of 4 turns. Parallelism: It executes up to 8 parallel tool calls per turn (grep, list, read, etc.).

Inference: We worked with NVIDIA to optimize this on B200s. We are hitting ~900 tokens/sec (compared to SWE-Grep’s ~650 t/s). The heavy prefill optimization was critical here because grep operations are read-heavy.

The Results: In our internal benchmarks, offloading retrieval to this subagent speeds up tasks by 40% and reduces token usage by roughly the same amount. More importantly, it seems to reduce "context rot" by ~70% on longer tasks because the agent isn't distracted by irrelevant file headers. On SWE-Bench Pro we see 5-12% improvement on long horizon tasks and stable chats for 2-3x more user messages.

It works with every coding agent - Claude Code, Codex, and OpenCode. We’re curious to see how it handles your edge cases (especially huge repos).

There is a free tier, but if you want to push it hard, you can use the code BF16 for 40M tokens of credit to test the API limits. We do recommend adding a payment method to get around the rate limits but you won't be charged until December 14th. At which point it will still be almost 10x cheaper than Claude Haiku.

Happy to answer questions about the CUDA optimizations or the RL training process!

Backup your Apple voice memos as audio files

https://fabien.cool/en/export-apple-voice-memos/
1•fabienheureux•1m ago•0 comments

Lymphoma Treatment Reaches 100% Survival Rates in Large-Scale Study

https://themedialine.org/headlines/israeli-lymphoma-treatment-reaches-100-survival-rates-in-large...
1•nsoonhui•2m ago•0 comments

A trip through the Graphics Pipeline (2011)

https://fgiesen.wordpress.com/2011/07/09/a-trip-through-the-graphics-pipeline-2011-index/
1•kruuuder•8m ago•0 comments

Just 0.001% hold 3 times the wealth of poorest half of humanity, report finds

https://www.theguardian.com/inequality/2025/dec/10/just-0001-hold-three-times-the-wealth-of-poore...
3•robtherobber•9m ago•0 comments

A Modular, Human-Centric LED System That Reacts to Circadian Patterns

1•emmasuntech•10m ago•0 comments

React: Smart Interval

https://github.com/tkhdev/react-smart-interval
1•handfuloflight•12m ago•0 comments

Using Claude Code to Fine-Tune Open Source LLMs

https://huggingface.co/blog/hf-skills-training
1•victormustar•13m ago•0 comments

I Replaced LLM Tool Calling with Async REST APIs and a Cryptographic Handshake

https://medium.com/towards-artificial-intelligence/i-built-a-distributed-ai-search-engine-to-kill...
1•yaruchyk•14m ago•0 comments

Show HN: Monetising an API by simply emailing public keys

https://img.arible.co/
1•sim04ful•25m ago•0 comments

Sweet Alert++ – A Modern, Accessible Modal Library (SweetAlert2 Alternative)

https://raiank.github.io/sweetalertplusplus.github.io/
1•sueraccount•32m ago•0 comments

Sterilization and contraception increase lifespan across vertebrates

https://www.nature.com/articles/s41586-025-09836-9
1•thunderbong•33m ago•0 comments

How to Make a Crypto Coin

https://coinheadlines.com/news/features/how-to-make-a-crypto-coin/article-22639/
1•RitaDfouni•34m ago•1 comments

ChatGPT and search ads used for malware distribution

https://eclecticlight.co/2025/12/11/how-online-search-and-ai-can-install-malware/
2•louis-paul•35m ago•0 comments

Feedback on Integrated Security Platform

https://substack.com/home/post/p-181243655
2•alex-dozer•36m ago•1 comments

The Waffle Singularity

https://www.tetraslam.world/blog/the_waffle_singularity
2•Tetraslam•37m ago•0 comments

Show HN: Simo.io – Security-first wired open-source Smart Home System for pros

https://simo.io
2•pysupremacy•46m ago•0 comments

Why I Stopped Coding

https://www.youtube.com/watch?v=KBL_RkTx5eI
2•avivby•48m ago•0 comments

The Enterprise AI Revolution: From Chatbots to Autonomous Agentic Architectures

https://medium.com/@mohan.khilariwal/the-enterprise-ai-revolution-moving-beyond-chatbots-to-auton...
2•avivby•49m ago•0 comments

Good Leadership Hinges on "Organizational Intelligence" (2020)

https://hbr.org/2020/06/good-leadership-hinges-on-organizational-intelligence
3•avivby•49m ago•0 comments

Show HN: Turn Git commits into Linear-style release note

https://www.updated.dev/
2•hyun_kim•55m ago•1 comments

$5 whale listening hydrophone making workshop

https://exclav.es/2025/08/03/dinacon-2025-passive-acoustic-listening/
2•gsf_emergency_6•56m ago•0 comments

Parakeets make their home in German trees

https://angiesweb.com/rose-ringed-parakeets-in-germany/
4•doruk101•1h ago•2 comments

Self-hosted Gits battered in 0-day attacks with no fix imminent

https://www.theregister.com/2025/12/10/gogs_0day_under_active_exploitation/
4•Brajeshwar•1h ago•0 comments

Qwen's API platform for image/video generation

https://www.mulerouter.ai
2•dr_dshiv•1h ago•1 comments

Native Parallel Reasoner: Self-Evolving to Learn Parallel Reasoning

2•jacklanda•1h ago•0 comments

Show HN: BJH OS – A Web-Based OS That Works Without Back End or Frameworks

https://github.com/Haris16-code/BJH-OS
1•Haris18•1h ago•1 comments

Skydiver dangles from plane 15,000ft in dramatic new footage of parachute snag

https://www.theaustralian.com.au/business/aviation/skydiver-dangles-from-plane-at-15000ft-in-dram...
1•asdefghyk•1h ago•1 comments

Rails MCP Server: Context-Efficient Tool Architecture

https://mariochavez.io/desarrollo/2025/12/10/rails-mcp-server-context-efficient-refactoring/
1•amalinovic•1h ago•0 comments

Startupideasdb,com is where I got my dream AI Tech Startup Idea. You can Google

5•peterbricks•1h ago•0 comments

ODAM Memory for Cursor – Long-Term Project Memory for Your AI Coding Assistant

https://github.com/aipsyhelp/Cursor_ODAM
1•AndrewMPT•1h ago•1 comments