frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: OpenFable – Open-source RAG engine using tree-structured indexes

https://github.com/alainbrown/openfable
1•alainbrown•1h ago
Hi HN, I built OpenFable, an open-source retrieval engine that implements the FABLE algorithm (https://arxiv.org/abs/2601.18116) for RAG pipelines. I'm using it in another project and thought that others might benefit.

  Most RAG systems chunk documents into flat segments and retrieve by vector similarity. This works  
  for simple lookups but breaks when answers span multiple sections, when relevant content is buried
  in a subsection, or when you need to control how many tokens you're sending to an LLM.             
                                                                                                   
  OpenFable takes a different approach: when you ingest a document, it uses an LLM to identify       
  discourse boundaries (not fixed-size windows), then builds a hierarchical tree, root, sections,
  subsections, leaf chunks, with embeddings at every level. Retrieval combines two paths:           
                                                                                                   
  1. LLM-guided path: the LLM reasons about which documents and subtrees are relevant from summaries
  2. Vector path: similarity search with structure-aware score propagation through the tree
                                                                                                     
  Results from both paths are fused, deduplicated, and trimmed to fit a token budget you specify. You
   get the most relevant chunks, in document order, within budget.                                   
                                                                                                     
  From the FABLE paper: the algorithm matches full-context inference (517K tokens) using only 31K    
  tokens, 94% reduction, while hitting 92% completeness vs. Gemini-2.5-Pro at 91% with the full
  document.                                                                                          
                                                                                                   
  Retrieval only; OpenFable returns ranked chunks, not generated answers. Bring your own LLM for    
  generation.
                                                                                                     
  It runs as a Docker stack (FastAPI + PostgreSQL/pgvector) and exposes both a REST API and an MCP   
  server, so LLM agents like Claude Desktop or Cursor can use it directly.
                                                                                                     
  Trade-offs I want to be upfront about:                                                           
  - Ingestion is expensive; every document requires multiple LLM calls for chunking and tree
  construction                                                                                       
  - Retrieval isn't sub-second, the LLM-guided paths add round-trips
  - No built-in auth; designed to sit behind a reverse proxy                                        
  - v0.1.0 — works end to end but the roadmap includes async ingestion, document deletion, and       
  metadata filtering                                                                                 
                                                                                                     
  Stack: Python 3.12, FastAPI, SQLAlchemy, pgvector, LiteLLM, fastMCP. Apache 2.0.                   
                                                                                                     
  Happy to answer questions about the algorithm, implementation choices, or benchmarks.

Cogito: Beautiful AI Markdown Editor for Mac

https://cogito.md
1•0xferruccio•38s ago•0 comments

A rigorous .md specification for AI Daemons

https://ai-daemons.com/spec/
1•mrbbk•44s ago•0 comments

Dario's Weird Race to the Top

https://davidbau.com/archives/2026/04/08/darios_weird_race_to_the_top.html
1•speckx•1m ago•0 comments

Espressif's New ESP32-S31: Dual-Core RISC-V with WiFi 6 and Gbit Ethernet

https://hackaday.com/2026/04/08/espressifs-new-esp32-s31-dual-core-risc-v-with-wifi-6-and-gbit-et...
1•alecco•1m ago•0 comments

Show HN: BAREmail ʕ·ᴥ·ʔ – minimalist Gmail client for bad WiFi

https://github.com/matt-virgo/baremail
1•Virgo_matt•1m ago•0 comments

This method to reverse cellular ageing is about to be tested in humans

https://www.nature.com/articles/d41586-026-01024-7
1•bookofjoe•2m ago•0 comments

The depths of Neptune and Uranus may be "superionic"

https://carnegiescience.edu/depths-neptune-and-uranus-may-be-superionic
1•gmays•4m ago•0 comments

When Agents Have Wallets

https://zeroclick.ai/blog/when-agents-have-wallets
1•mooreds•4m ago•0 comments

He-united-states-is-rewriting-itself

https://richardhogan1.substack.com/
1•Richard_Hogan1•4m ago•0 comments

Decentralized AI in 50 Lines of Python

https://iamtrask.github.io/2026/04/07/decentralized-ai-in-50-lines/
2•williamtrask•4m ago•1 comments

Book Review: Tomorrow, and Tomorrow, and Tomorrow

https://www.stephendiehl.com/posts/tomorrow_tomorrow_tomorrow/
1•ibobev•4m ago•0 comments

Every Democrat Who Enabled Trump's Crypto Corruption

https://www.stephendiehl.com/posts/crypto_democrats/
2•ibobev•5m ago•0 comments

Show HN: IDWIW – a YouTube viewer to avoid algorithm traps

1•kei178•6m ago•0 comments

"The Talk" by Scott Aaronson and Zach Weinersmith

https://www.smbc-comics.com/comic/the-talk-3
2•cwillu•6m ago•0 comments

The Thirty Years' War Is Starting Again

https://www.stephendiehl.com/posts/christian_nationalism/
2•ibobev•6m ago•0 comments

Decoding the MySQL Binary Log: Table_map_event – Table Metadata for RBR

https://readyset.io/blog/replication-internals-decoding-the-mysql-binary-log-part-7-table_map_eve...
2•marceloaltmann•7m ago•0 comments

How to Choose the Best AI for Accountants

https://accountsdraft.com/resources/best-ai-for-accountants-uk-practitioners-guide
1•Rob_Benson-May•7m ago•0 comments

Show HN: Hoeren – Local-only meeting transcription and voice dictation

1•dimaberlin•8m ago•0 comments

Chinese electric truck maker Windrose makes first US delivery

https://www.reuters.com/business/autos-transportation/chinese-electric-truck-maker-windrose-makes...
1•java-man•11m ago•1 comments

Show HN: Go-Bt: Minimalist Behavior Trees for Go

https://github.com/rvitorper/go-bt
1•rvitorper•12m ago•0 comments

Neon Vector Animation

https://davidhampgonsalves.com/neon-animated-eye/
1•dh-g•13m ago•0 comments

BlueHammer – Windows 0day LPE

https://github.com/Nightmare-Eclipse/BlueHammer
1•asimops•15m ago•0 comments

The AI coding agent is a new kind of contributor, and Git isn't made for it

https://agent-undo.com/blog/the-shift/
3•peaktwilight•16m ago•0 comments

Adam Jacob – Communication Breakdown

https://www.adamhjk.com/blog/communication-breakdown/
1•rmoff•17m ago•0 comments

Jitter – Prove human authorship in Google Docs without sharing content

https://scalisos.com
2•theborat•18m ago•0 comments

Show HN: Real-time deepfake in the browser, no GPU needed

https://www.callstrike.ai/deepfake-security-training
1•gyanchawdhary•19m ago•0 comments

Private AI Document Analysis (even in the browser)

https://www.pdflayout.ai/
1•floathub•19m ago•0 comments

WordTrail – Daily Word Puzzle

https://play.google.com/store/apps/details?id=com.wordtrail.app&hl=en_US
1•talha_dev•19m ago•0 comments

Stellar Broadcast colony ship roguelike with on-device neural net planet naming

https://stellarbroadcast.org
1•FaustCircuits•20m ago•0 comments

How much Brits worry about the automation of their work

https://yougov.com/en-gb/trackers/how-much-brits-worry-about-the-automation-of-their-work
1•alphabetatango•21m ago•0 comments