news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Beating Aider's 20% pass rate on local Qwen 32B using deterministic RAG

https://fararoni.dev/publicacion/caso-estudio-qwen

1•ebercruzdev•2h ago

Comments

ebercruzdev•2h ago

Hi everyone. I’ve been experimenting with local models for autonomous coding, specifically qwen2.5-coder:32b. Using standard tools like Aider directly against Ollama, the model kept suffering from severe context drift, hitting a hard ceiling of a 20% pass rate on multi-step tasks.

Instead of throwing more parameters at it (which caused OOM errors for me), I built a deterministic RAG wrapper in Java that intercepts the model's output and forces it to consult a local index before executing code. By isolating the stochastic nature of the LLM and forcing strict structural patterns, the pass rate jumped to 100% on the same test sets, and execution was roughly 4.6x faster.

I wrote down the full methodology, the benchmark details (using the Aider Polyglot suite), and some architectural notes on how the 'Index Layer' handles the self-correction loop in the article linked above. (Note: The site has an English/Spanish toggle at the top).

I'd love to hear your thoughts or if anyone has tackled local context drift in a similar way.

Beating Aider's 20% pass rate on local Qwen 32B using deterministic RAG

Social media 'addiction' is overrated

https://www.washingtonpost.com/opinions/2026/03/08/instagram-meta-youtube-lawsuit-addiction/

1•paulpauper•1m ago•0 comments

The Rise of Popcorn at the Movies

https://www.wsj.com/arts-culture/food-cooking/movie-theater-popcorn-history-great-depression-6e5a...

1•paulpauper•1m ago•0 comments

Palestinian boy, 12, describes how Israeli forces killed his family in car

https://www.bbc.com/news/articles/c70n2x7p22do

1•tartoran•1m ago•0 comments

A Better Pattern Than MCP for Agent-Friendly CLIs

https://coasts.dev/blog/a-better-pattern-than-mcp-for-agent-friendly-clis

1•jsunderland323•1m ago•0 comments

Building a fractions tutor led us to an architecture for controlling AI

https://github.com/emergent-state-machine/esm-spec/tree/main

1•ih8scargo•5m ago•1 comments

Once: Run multi Dockerized web apps on single server

https://github.com/basecamp/once

1•TobiasBerg•7m ago•1 comments

Refactor: When It Changes Things

https://howtocenterdiv.com/beyond-the-div/refactor-when-it-actually-changes-things

1•imkyssa•7m ago•0 comments

The Japanese snowball fight game vying to be an Olympic sport

https://www.japantimes.co.jp/sports/2026/02/22/more-sports/yukigassen-olympic-dreams/

1•PaulHoule•8m ago•0 comments

A bare-metal 3D engine in Go with custom software/OpenGL rasterizers

https://github.com/markel1974/godoom

1•markel1974•8m ago•1 comments

Bing have functional customer support

https://www.bing.com/search?q=dollardeploy

1•huksley•9m ago•1 comments

Perfectly Imperfect, a social network like the internet of yore

https://english.elpais.com/society/2026-03-15/perfectly-imperfect-the-social-network-that-returns...

1•dxs•10m ago•0 comments

The Conditionally Open Web

https://www.coryd.dev/posts/2026/the-conditionally-open-web

1•cdrnsf•11m ago•0 comments

How to read books again, backed by neuroscience

https://www.fastcompany.com/91502134/cant-read-books-anymore-neuroscience-has-a-5-step-plan-to-ge...

1•speckx•11m ago•0 comments

Git for Beginners: Branches, Commits, and Your First Pull Request

https://jsdev.space/git-branches-commits-pull-request/

1•javatuts•11m ago•0 comments

Show HN: AwardClaw – 24/7 award travel research agent

https://www.awardclaw.com/

1•veritas9•12m ago•0 comments

Mistral.ai Leanstral: open-source model designed for engineering

https://docs.mistral.ai/models/leanstral-26-03

1•james2doyle•13m ago•2 comments

Agent Kitchen

https://github.com/vivekhaldar/agent-kitchen

1•gandalfgeek•15m ago•0 comments

1968: What Will Technology Bring? – Towards Tomorrow – BBC Archive [video]

https://www.youtube.com/watch?v=c-i4sDAGyXQ

1•highspeedbus•15m ago•0 comments

Computational Foundations for the Second Law of Thermodynamics–Stephen Wolfram

https://writings.stephenwolfram.com/2023/02/computational-foundations-for-the-second-law-of-therm...

1•bilsbie•17m ago•0 comments

IBM RPG

https://en.wikipedia.org/wiki/IBM_RPG

1•tosh•19m ago•0 comments

Tom Scott: England – Official Teaser

https://nebula.tv/videos/tomscott-england-teaser

1•toomuchtodo•21m ago•1 comments

Police Investigate German Historian for Hitler-Putin Meme

https://reason.com/2026/03/13/police-investigate-german-historian-for-hitler-putin-meme/

3•whatisabcdefgh•22m ago•0 comments

My Random Forest Was Mostly Learning Time-to-Expiry Noise

https://illya.sh/threads/out-of-sample-permutation-feature-importance-for-random

1•iluxonchik•23m ago•0 comments

Show HN: US-Based PCB Fabrication for $0.75in

https://www.pikkoloassembly.com/panel-pricing

2•pikkoloassembly•25m ago•0 comments

The return-to-the-office trend backfires

https://thehill.com/opinion/technology/5775420-remote-first-productivity-growth/

22•penguin_booze•26m ago•3 comments

Jemalloc un-abandoned by Meta

https://engineering.fb.com/2026/03/02/data-infrastructure/investing-in-infrastructure-metas-renew...

45•hahahacorn•27m ago•4 comments

China just approved world first commercial brain implant

https://www.scientificamerican.com/article/china-just-approved-its-first-brain-implant-for-commer...

5•nothrowaways•28m ago•0 comments

Stop Killing Games

https://en.wikipedia.org/wiki/Stop_Killing_Games

1•cainxinth•28m ago•0 comments

Nono-Gate: Deterministic Offline-Verifiable Security Decisions for CI

https://github.com/88nonog-dev/nono-gate

1•devsec_moh•30m ago•1 comments

Clockwise acquired by Salesforce, shutting down March 27, 2026

https://www.getclockwise.com

2•ajsharma•31m ago•0 comments