frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Poet-X Enables Billion-Parameter LLM Training on Single H100 GPU

https://www.simplenews.ai/news/poet-x-enables-billion-parameter-llm-training-on-single-h100-gpu-ktw3
1•goldkey•2m ago•0 comments

Show HN: A simple habit tracker focused on clarity and consistency

https://play.google.com/store/apps/details?id=com.shadowstudios.habitonlinetracker&hl=en_US
1•Belsonsan•2m ago•0 comments

Show HN: Kaeso: an OAuth hub for AI agents

1•devinoldenburg•3m ago•0 comments

Show HN: WebBridge turns any website into MCP tools by recording browser traffic

https://github.com/jalabulajunx/WebBridge
1•nonstopnonsense•4m ago•0 comments

You were born without an account

https://simplex.chat/why/
1•Cider9986•5m ago•0 comments

Crowning the protein that makes jellyfish glow green as a model

https://phys.org/news/2026-02-crowning-protein-jellyfish-green-scientists.html
1•PaulHoule•6m ago•0 comments

Show HN: MultiPowerAI – Trust and accountability infrastructure for AI agents

https://multipowerai-trust.vercel.app
1•rogergrubb•9m ago•0 comments

Java beats Go, Python and Node.js in MCP server benchmarks

https://www.tmdevlab.com/mcp-server-performance-benchmark.html
1•lprimak•9m ago•0 comments

Show HN: Single-header C++ libraries for LLM APIs – zero deps beyond libcurl

1•Shmungus•10m ago•0 comments

Show HN: OpenGuard

https://openguard.sh
1•everlier•11m ago•0 comments

I built a tool to transcribe podcasts after struggling to learn languages

1•marstyl•12m ago•0 comments

Why Language Models Hallucinate (2025)

https://arxiv.org/abs/2509.04664
1•doener•12m ago•0 comments

Ask HN: What Are Your Biggest Career Regrets?

2•karakoram•12m ago•1 comments

There Was a Time before Mathematica (2013)

https://writings.stephenwolfram.com/2013/06/there-was-a-time-before-mathematica/
2•masfuerte•14m ago•1 comments

Russian Ransomware Administrator Pleads Guilty to Wire Fraud Conspiracy

https://www.justice.gov/usao-md/pr/russian-ransomware-administrator-pleads-guilty-wire-fraud-cons...
1•737min•16m ago•0 comments

Show HN: Rust-First L3 Limit Order Book Backtesting Engine with Python Bindings

https://github.com/chasemetoyer/Backtesting-Engine
1•chasemetoyer•16m ago•0 comments

Show HN: Ovumcy – self-hosted menstrual cycle tracker

https://github.com/terraincognita07/ovumcy
1•terrain07•16m ago•0 comments

Show HN: Sheila, an AI agent that replaced our accounting flow

https://soapbox.pub/blog/announcing-sheila/
4•knewter•25m ago•2 comments

Qualcomm CEO: 'Resistance Is Futile' as 6G Mobile Revolution Approaches

https://fortune.com/2026/03/03/qualcomm-ceo-resistance-is-futile-6g-mobile-revolution-approaches/
2•m463•25m ago•1 comments

Show HN: NeoNetrek – modernizing the internet's first team game (1988)

https://neonetrek.com
1•yuriksan•27m ago•0 comments

Show HN: Natural language queries for Prometheus Kafka metrics (StreamLens)

https://github.com/muralibasani/streamlens
1•muralibasani•28m ago•0 comments

Satellite firm pauses imagery after revealing Iran's attacks on US bases

https://arstechnica.com/space/2026/03/satellite-firm-pauses-imagery-after-revealing-irans-attacks...
1•consumer451•30m ago•0 comments

China Suspected in Breach of FBI Surveillance Network

https://www.wsj.com/politics/national-security/china-suspected-in-breach-of-fbi-surveillance-netw...
4•JumpCrisscross•30m ago•1 comments

Show HN: I created list of directories (1000) to create free backlinks

https://kitful.ai/directories
1•eashish93•32m ago•0 comments

Fishing crews in the Atlantic keep accidentally dredging up chemical weapons

https://arstechnica.com/health/2026/03/fishing-crews-in-the-atlantic-keep-accidentally-dredging-u...
3•jnord•34m ago•0 comments

45% of AI-Generated Code Has Security Vulnerabilities

https://www.linkedin.com/pulse/45-ai-generated-code-has-security-vulnerabilities-vijay-shankar-gu...
1•ohjeez•35m ago•0 comments

The National Videogame Museum Has Acquired the Mythical Nintendo PlayStation

https://www.engadget.com/gaming/the-national-videogame-museum-has-acquired-the-mythical-nintendo-...
2•breve•37m ago•0 comments

C# Strings Silently Kill Your SQL Server Indexes in Dapper

https://consultwithgriff.com/dapper-nvarchar-implicit-conversion-performance-trap
13•PretzelFisch•38m ago•3 comments

Show HN: I open-sourced my Steam game, 100% written in Lua, engine is also open

https://github.com/willtobyte/reprobate
2•delduca•38m ago•0 comments

The White House: Touchdown

https://twitter.com/WhiteHouse/status/2030051395294941427
2•TheAlchemist•39m ago•3 comments
Open in hackernews

Show HN: RapidFire AI – parallel RAG experimentation with live run intervention

https://github.com/RapidFireAI/rapidfireai
2•kbigdelysh•6h ago
We built RapidFire AI because iterating on RAG pipelines is painfully sequential: run a config, wait, inspect results, tweak one knob, repeat. When you have 15 things to tune (chunk size, retrieval k, reranker, prompt template, context window strategy...) that cycle compounds fast.

RapidFire uses shard-based interleaved scheduling to run many configurations concurrently on a single machine — even a CPU-only box if you're using a closed API like OpenAI. Instead of config A finishing before config B starts, all configs process data shards in rotation, so you see live side-by-side metric deltas within the first few minutes.

The part we're most excited about: Interactive Control (IC Ops).

Most RAG observability tools tell you what happened after a run finishes. IC Ops closes the loop — you can act on what you're observing mid-run:

  - Stop a config that's clearly underperforming (save the API spend)
  - Resume it later if you change your mind
  - Clone a promising run and modify its prompt template or retrieval 
    strategy on the fly, with or without warm-starting from the parent's state
This changes the experimentation workflow from "observe → write notes → re-queue a new job" to "observe → fix → continue" in a single session.

What you can experiment over in one run: - Chunking strategy and overlap - Embedding model - Retrieval k and hybrid search weighting - Reranking model / threshold - Prompt template variants (few-shot, CoT, context compression) - Generation model (swap GPT-4o vs Claude 3.5 vs local model mid-experiment)

Eval metrics aggregate online (no need to wait for full run), displayed in a live-updating in-notebook table. Full MLflow integration for longer-term experiment governance.

GitHub: https://github.com/RapidFireAI/rapidfireai

Docs: https://oss-docs.rapidfire.ai

pip install rapidfireai

Comments

kbigdelysh•6h ago
Happy to answer questions about the IC Ops design, the scheduling algorithm, or how the online aggregation works. We use a fair-share shard scheduler inspired by how databases handle multi-tenant query execution.