frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How would you architect a RAG system for 10M+ documents today?

5•Ftrea•3h ago
I'm tasked with building a private AI assistant for a corpus of 10 million text documents (living in PostgreSQL). The goal is semantic search and chat, with a requirement for regular incremental updates.

I'm trying to decide between:

Bleeding edge: Implementing something like LightRAG or GraphRAG.

Proven stack: Standard Hybrid Search (Weaviate/Elastic + Reranking) orchestrated by tools like Dify.

For those who have built RAG at this scale:

What is your preferred stack for 2025?

Is the complexity of Graph/LightRAG worth it over standard chunking/retrieval for this volume?

How do you handle maintenance and updates efficiently?

Looking for architectural advice and war stories.

Comments

parentheses•1h ago
If it's < 100M, with vectors of 1024 size, you could fit all of that in ~100G of memory. So, maybe storing it in memory is an easy way to go about it. This ignores a lot of "database problems". If the docs are changing constantly, or uou have other scalability concerns, you may be better off using a "proper" vector db. There have been HN postings which indicate vector db choice matters. Do your research there.

Ask HN: How are Markov chains so different from tiny LLMs?

134•JPLeRouzic•3d ago•88 comments

Restaurant Shift Scheduling via Linear Optimization and Staff Constraints

2•emmahexa•3h ago•0 comments

Ask HN: How would you architect a RAG system for 10M+ documents today?

5•Ftrea•3h ago•1 comments

Ask HN: What is the best way to see what files are being read in Windows?

4•jacobwilliamroy•12h ago•4 comments

Ask HN: What is the current state of the art in BIG (>5TB) cloud backups?

19•jacobwilliamroy•1d ago•17 comments

Facebook has made it impossible to delete Pages – dark patterns everywhere

43•ramharts•2d ago•15 comments

The new Grok on X is aligned to favor Elon Musk over anyone else when asked

12•kranke155•5h ago•0 comments

Ask HN: Struggling founders, pls share your startup struggle

15•vieews•1d ago•12 comments

Tell HN: Cursor exposes side projects to your employer

29•throwawaybbbbbb•1d ago•22 comments

Official gRPC Benchmark

2•bijan7•1d ago•0 comments

Ask HN: Cloud providers are losing in favor of bare-metal?

35•clostao•4d ago•26 comments

Ask HN: Vitalik says that QC might break ECC before 2028. This is crazy, right?

10•jMyles•1d ago•5 comments

Ask HN: Have you ever seen a perfect codebase?

6•mcdow•1d ago•12 comments

Ask HN: What operating systems, apps, etc. had your favorite UI designs?

4•pixelworm•1d ago•6 comments

Why doesn't someone just send the Epstein files to WikiLeaks?

14•aniken•1d ago•10 comments

Ask HN: Does anyone else feel like a 'manager' now, with AI?

11•keepamovin•1d ago•12 comments

An exposed .git folder let us dox a phishing campaign

64•spirovskib•4d ago•21 comments

What is the most beautiful / highest quality code you've seen (or written)?

33•gooob•6d ago•17 comments

Built a Pomodoro timer for ADHD brains: always visible progress bar

17•raoarjun4•4d ago•4 comments

Ask HN: How does one stay motivated to grind through LeetCode?

93•blutoot•1w ago•111 comments

You've reached the end!