frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run 100+ RAG experiments in parallel, even on a single GPU

1•kbigdelysh•1h ago
Hey HN,

We built RapidFire AI, an open-source framework that lets you compare dozens (or hundreds) of RAG and context engineering configurations in parallel, without needing a GPU cluster.

Tuning a RAG pipeline means experimenting with chunk sizes, embedding models, retrieval strategies, reranking thresholds, prompt schemes, generator models, and more. With traditional tools, you run these sequentially, wait for each to finish on the full dataset, and then compare. That's painfully slow and wastes tokens/compute on configs you'd have killed after seeing the first 10% of results.

RapidFire AI shards your eval dataset and schedules all configs one shard at a time, cycling through them with efficient swapping. You get running metric estimates with confidence intervals in real time, based on online aggregation from the database systems literature. Spot a bad config early? Stop it. See a promising one? Clone it and tweak knobs on the fly, no restart needed.

On a beefy machine you can comfortably run 100+ configs in a single experiment. Want to see it in action without installing anything? We have a Google Colab tutorial that runs 4 RAG retrieval configs in parallel on a free Colab GPU, zero local setup, under 2 minutes to get started. It builds a financial Q&A pipeline on the FiQA dataset, grid-searches over chunk sizes and reranker settings, and shows live metrics with confidence intervals as the configs run. If you're only calling OpenAI or other closed APIs, you don't even need a GPU at all.

Colab: https://colab.research.google.com/github/RapidFireAI/rapidfi...

We'd love feedback on what knobs/integrations matter most to you. Happy to answer questions here.

Comments

kbigdelysh•1h ago
Author here, happy to answer any questions about the architecture, how the scheduling works, or anything else. Fire away!

Brazilian Age-Verification Law: I Posit It Does Not Apply to Open-Source OSes

https://www.planalto.gov.br/ccivil_03/_ato2023-2026/2025/Lei/L15211.htm
1•replooda•40s ago•0 comments

Programmable Property-Based Testing

https://arxiv.org/abs/2602.18545
1•PaulHoule•1m ago•0 comments

Yahoo Introduces MyScout, the First Personalized Homepage for AI Answers

https://www.yahooinc.com/press/yahoo-introduces-myscout-the-first-personalized-homepage-for-ai-an...
1•drtz•2m ago•0 comments

I paired NotebookLM with Claude Code, and it feels like a dream team

https://www.xda-developers.com/paired-notebooklm-with-claude-code/
1•speckx•2m ago•0 comments

Replit raises $400M at $9B valuation

https://techfundingnews.com/replit-raises-400m-9b-valuation-ai-app-building/
2•exizt88•2m ago•0 comments

Tcl's Nxtpaper 4.0 screen: A review

https://manualdousuario.net/en/tcl-nxtpaper-4/
1•rpgbr•3m ago•0 comments

Sam Altman says OpenAI will tweak its Pentagon deal after surveillance backlash

https://www.businessinsider.com/openai-amending-contract-with-pentagon-amid-backlash-mass-surveil...
1•doener•5m ago•2 comments

YouTube just approved 30-second unskippable ads for TV

https://www.androidcentral.com/apps-software/youtube/youtube-on-tv-30-seconds-unskippable-ads
1•LorenDB•5m ago•0 comments

Goldman executive says private markets clients glad about Iran war 'distraction'

https://www.ft.com/content/9232dbce-0255-4949-8c4c-ea58d86a4166
1•alephnerd•5m ago•0 comments

Most AI chatbots will help users plan violent attacks, study finds

https://www.engadget.com/ai/most-ai-chatbots-will-help-users-plan-violent-attacks-study-finds-163...
1•mikece•6m ago•0 comments

ChatGPT Took The Pentagon's Killer Robot Deal: Boycott Now

https://quitgpt.org/pentagon?link_id=2&can_id=3b2cebf422aaa35898d6d8ce17355809&source=email-week-...
1•doener•6m ago•0 comments

The Web Is a Guitar Amp Now (Literally)

https://www.silverorange.com/blog/the-web-is-guitar-amp
2•speckx•6m ago•0 comments

The Bay Area Considers the Unthinkable: Life Without BART

https://www.nytimes.com/2026/03/10/us/bart-bay-area-san-francisco-transit.html
1•radley•7m ago•0 comments

ChatGPT Uninstalls Skyrocket

https://twitter.com/SensorTower/status/2029250034772963513
1•doener•7m ago•0 comments

Show HN: AgentSign – Zero trust for AI agents (OWASP-aligned)

https://agentsign.dev
1•AskCarX•8m ago•0 comments

Testers Still Needed?

1•AtulThakor333•8m ago•0 comments

Vectorless RAG Using Neo4j and Agentic Routing

https://github.com/TejasS1233/vectorless_RAG
1•Tejas1233•8m ago•0 comments

Ask HN: Does AI make your product better?

1•brodouevencode•8m ago•0 comments

Tilly Norwood music video is so bad; AI won't be putting actors out of work

https://www.latimes.com/entertainment-arts/story/2026-03-11/tilly-music-video-bad-ai-actors-out-o...
1•jaredwiener•8m ago•1 comments

AI Paranoia: A Conspiracy of Incentives

https://www.jernesto.com/articles/ai_paranoia
2•ponzusouce•9m ago•0 comments

Space Jellyfish Predictor

https://jellyfish.johnkrausphotos.com/
2•LorenDB•10m ago•0 comments

Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids

https://fuelingcuriosity.com/game.html
6•fuelingcurious•10m ago•1 comments

Redgifs Downloader

https://redgifsdownloader.cc/
1•amazingrobin•13m ago•1 comments

Fungal Electronics

https://arxiv.org/abs/2111.11231
2•byt3h3ad•14m ago•1 comments

Fault Tolerance Benchmark: Clockwork TorchPass, TorchFT and Checkpoint Restart

https://clockwork.io/blog/keeping-distributed-training-running-through-failures/
2•danzheng•15m ago•1 comments

Using local LLM models vs. APIs

https://crimede-coder.com/blogposts/2026/LocalvsAPI
1•apwheele•15m ago•0 comments

Klaus Programmieren – "Official" German Coding Assistant

https://klausprogrammieren.com/
3•luplex•16m ago•2 comments

The Controllability Trap: A Governance Framework for Military AI Agents

https://arxiv.org/abs/2603.03515
1•Anon84•19m ago•0 comments

The Most Disruptive Company in the World

https://time.com/article/2026/03/11/anthropic-claude-disruptive-company-pentagon/
1•jdkee•19m ago•0 comments

Ask HN: Merge/Nango open source alternative?

1•hhthrowaway1230•19m ago•0 comments