frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

From zero to a RAG system: successes and failures

https://en.andros.dev/blog/aa31d744/from-zero-to-a-rag-system-successes-and-failures/
52•andros•2d ago

Comments

Horatius77•2d ago
Great writeup but ... pretty sure ChromaDB is open source and not "Google's database"?
threatofrain•1h ago
I'm afraid this hits the credibility of the article for me, that's a pretty weird mistake to make. It's like paying for a Model 3 while thinking it comes from Ford.
andros•43m ago
Thank you for your feedback!
nalinidash•1h ago
ChromaDB is open source with Apache-2.0 license.

https://github.com/chroma-core/chroma

z02d•1h ago
Maybe a bit off-topic: For my PhD, I wanted to leverage LLMs and AI to speed up the literature review process*. Due to time constraints, this never really lifted off for me. At the time I checked (about 6 months ago), several tools were already available (NotebookLM, Anara, Connected Papers, ZotAI, Litmaps, Consensus, Research Rabbit) supporting Literature Review. They have all pros and cons (and different scopes), but my biggest requirement would be to do this on my Zotero bibliographic collection (available offline as PDF/ePub).

ZotAI can use LMStudio (for embeddings and LLM models), but at that time, ZotAI was super slow and buggy.

Instead of going through the valley of sorrows (as threatofrain shared in the blog post - thanks for that), is there a more or less out-of-the-box solution (paid or free) for the demand (RAG for local literature review support)?

*If I am honest, it was rather a procrastination exercise, but this is for sure relatable for readers of HN :-D

bee_rider•45m ago
I tried to do RAG on my laptop just by setting it all up myself, but the actual LLM gave poor results (I have a small thin-and-light fwiw, so I could only run weak models). The vector search itself, actually, ended up being a little more useful.
mettamage•1h ago
51 visitors in real-time.

I love those site features!

In a submission of a few days ago there was something similar.

I love it when a website gives a hint to the old web :)

aledevv•52m ago
I made something similar in my project. My more difficult task has been choice the right approach to chunking long documents. I used both structural and semantic chunking approach. The semantic one helped to better store vectors in vectorial DB. I used QDrant and openAi embedding model.
JKCalhoun•32m ago
And some have been saying that RAGs are obsolete—that the context window of a modern LLM is adequate (preferable?). The example I recently read was that the contexts are large enough for the entire "The Lord of the Rings" books.

That may be, but then there's an entire law library, the entirety of Wikipedia (and the example in this article of 451 GB). Surely those are at least an order of magnitude larger than Tolkien's prose and might still benefit from a RAG.

mentos•27m ago
I assume it’s not possible to get the same results by fine tuning a model with the documents instead?
notglossy•3m ago
You will still get hallucinations. With RAG you use the vectors to aid in finding things that are relevant, and then you typically also have the raw text data stored as well. This allows you to theoretically have LLM outputs grounded in the truth of the documents. Depending on implementation, you can also make the LLM cite the sources (filename, chunk, etc).

Personal Encyclopedias

https://whoami.wiki/blog/personal-encyclopedias
332•jrmyphlmn•16h ago•68 comments

Swift 6.3

https://www.swift.org/blog/swift-6.3-released/
121•ingve•4h ago•54 comments

From zero to a RAG system: successes and failures

https://en.andros.dev/blog/aa31d744/from-zero-to-a-rag-system-successes-and-failures/
52•andros•2d ago•11 comments

Running Tesla Model 3's computer on my desk using parts from crashed cars

https://bugs.xdavidhu.me/tesla/2026/03/23/running-tesla-model-3s-computer-on-my-desk-using-parts-...
672•driesdep•15h ago•220 comments

Obsolete Sounds

https://citiesandmemory.com/obsolete-sounds/
64•benbreen•8h ago•9 comments

LibreOffice and the Art of Overreacting

https://blog.documentfoundation.org/blog/2026/03/25/libreoffice-and-the-art-of-overreacting/
57•bundie•2h ago•25 comments

ARC-AGI-3

https://arcprize.org/arc-agi/3
422•lairv•18h ago•268 comments

Shell Tricks That Make Life Easier (and Save Your Sanity)

https://blog.hofstede.it/shell-tricks-that-actually-make-life-easier-and-save-your-sanity/
165•zdw•11h ago•73 comments

Niche Museums

https://www.niche-museums.com/
20•bookofjoe•2d ago•11 comments

The truth that haunts the Ramones: 'They sold more T-shirts than records'

https://english.elpais.com/culture/2026-03-17/the-uncomfortable-truth-that-will-always-haunt-the-...
162•c420•4d ago•112 comments

Earthquake scientists reveal how overplowing weakens soil at experimental farm

https://www.washington.edu/news/2026/03/19/earthquake-scientists-reveal-how-overplowing-weakens-s...
172•Brajeshwar•22h ago•90 comments

My DIY FPGA board can run Quake II

https://blog.mikhe.ch/quake2-on-fpga/part4.html
173•sznio•3d ago•51 comments

More precise elevation data for GraphHopper routing engine

https://www.graphhopper.com/blog/2026/03/23/more-precise-elevation-data-for-graphhopper/
59•karussell•2d ago•3 comments

The EU still wants to scan your private messages and photos

https://fightchatcontrol.eu/?foo=bar
1263•MrBruh•15h ago•340 comments

90% of Claude-linked output going to GitHub repos w <2 stars

https://www.claudescode.dev/?window=since_launch
315•louiereederson•18h ago•196 comments

The Cassandra of 'The Machine'

https://www.thenewatlantis.com/publications/the-cassandra-of-the-machine
17•Hooke•8h ago•3 comments

Ashby (YC W19) Is Hiring Engineers Who Make Product Decisions

https://www.ashbyhq.com/careers?ashby_jid=c3c7125d-7883-4dff-a2bf-f5a55de4a364&utm_source=hn
1•abhikp•5h ago

Supreme Court Sides with Cox in Copyright Fight over Pirated Music

https://www.nytimes.com/2026/03/25/us/politics/supreme-court-cox-music-copyright.html
362•oj2828•21h ago•282 comments

Show HN: Robust LLM Extractor for Websites in TypeScript

https://github.com/lightfeed/extractor
50•andrew_zhong•8h ago•35 comments

What came after the 486?

https://dfarq.homeip.net/what-came-after-486/
58•jnord•3d ago•55 comments

Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR

https://github.com/jonwiggins/optio
60•jawiggins•19h ago•34 comments

Quantization from the Ground Up

https://ngrok.com/blog/quantization
280•samwho•20h ago•50 comments

Thoughts on slowing the fuck down

https://mariozechner.at/posts/2026-03-25-thoughts-on-slowing-the-fuck-down/
908•jdkoeck•22h ago•399 comments

The Last Contract: William T. Vollmann's Battle to Publish an Epic (2025)

https://www.metropolitanreview.org/p/the-last-contract
17•benbreen•8h ago•0 comments

Two studies in compiler optimisations

https://www.hmpcabral.com/2026/03/20/two-studies-in-compiler-optimisations/
89•hmpc•3d ago•14 comments

Optimization lessons from a Minecraft structure locator

https://purplesyringa.moe/blog/optimization-lessons-from-a-minecraft-structure-locator/
7•ftk_•5d ago•0 comments

Maxell MXCP-P100 – wireless cassette player

https://maxell-usa.com/product/cassetteplayer/
37•ChrisArchitect•2d ago•22 comments

Government agencies buy commercial data about Americans in bulk

https://www.npr.org/2026/03/25/nx-s1-5752369/ice-surveillance-data-brokers-congress-anthropic
92•nuke-web3•6h ago•38 comments

False claims in a widely-cited paper

https://statmodeling.stat.columbia.edu/2026/03/24/false-claims-in-a-published-no-corrections-no-c...
302•qsi•11h ago•125 comments

Show HN: A plain-text cognitive architecture for Claude Code

https://lab.puga.com.br/cog/
114•marciopuga•12h ago•34 comments