frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

So you wanna build a local RAG?

https://blog.yakkomajuri.com/blog/local-rag
30•pedriquepacheco•1h ago

Comments

mips_avatar•18m ago
One thing I didn’t see here that might be hurting your performance is a lack of semantic chunking. It sounds like you’re embedding entire docs, which kind of breaks down if the docs contain multiple concepts. A better approach for recall is using some kind of chunking program to get semantic chunks (I like spacy though you have to configure it a bit). Then once you have your chunks you need to append context to how this chunk relates to the rest of your doc before you do your embedding. I have found anthropics approach to contextual retrieval to be very performant in my RAG systems (https://www.anthropic.com/engineering/contextual-retrieval) you can just use gpt oss 20b as the model for generation of context.

Unless I’ve misunderstood your post and you are doing some form of this in your pipeline you should see a dramatic improvement in performance once you implement this.

simonw•10m ago
My advice for building something like this: don't get hung up on a need for vector databases and embedding.

Full text search or even grep/rg are a lot faster and cheaper to work with - no need to maintain a vector database index - and turn out to work really well if you put them in some kind of agentic tool loop.

The big benefit of semantic search was that it could handle fuzzy searching - returning results that mention dogs if someone searches for canines, for example.

Give a good LLM a search tool and it can come up with searches like "dog OR canine" on its own - and refine those queries over multiple rounds of searches.

Plus it means you don't have to solve the chunking problem!

nilirl•10m ago
Why is it implicit that semantic search will outperform lexical search?

Back in 2023 when I compared semantic search to lexical search (tantivy; BM25), I found the search results to be marginally different.

Even if semantic search has slightly more recall, does the problem of context warrant this multi-component, homebrew search engine approach?

By what important measure does it outperform a lexical search engine? Is the engineering time worth it?

Can Dutch universities do without Microsoft?

https://dub.uu.nl/en/news/can-dutch-universities-do-without-microsoft
115•robtherobber•2h ago•84 comments

Bringing Sexy Back. Internet surveillance has killed eroticism

https://lux-magazine.com/article/privacy-eroticism/
60•eustoria•55m ago•11 comments

So you wanna build a local RAG?

https://blog.yakkomajuri.com/blog/local-rag
31•pedriquepacheco•1h ago•3 comments

C++ Web Server on my custom hobby OS

https://oshub.org/projects/retros-32/posts/getting-a-webserver-running
22•joexbayer•45m ago•3 comments

Don't tug on that, you never know what it might be attached to

https://blog.plover.com/2016/07/01/#tmpdir
44•todsacerdoti•1h ago•9 comments

True P2P Email on Top of Yggdrasil Network

https://github.com/JB-SelfCompany/Tyr
33•basemi•1h ago•6 comments

Meta hiding $27B in debt using advanced geometry

https://stohl.substack.com/p/exclusive-credit-report-shows-meta
160•FreeQueso•1h ago•74 comments

Atuin’s New Runbook Execution Engine

https://blog.atuin.sh/introducing-the-new-runbook-execution-engine/
63•emschwartz•3d ago•8 comments

JSON Schema Demystified: Dialects, Vocabularies and Metaschemas

https://www.iankduncan.com/engineering/2025-11-24-json-schema-demystified/
6•navigate8310•26m ago•0 comments

Show HN: Glasses to detect smart-glasses that have cameras

https://github.com/NullPxl/banrays
417•nullpxl•12h ago•150 comments

Show HN: An LLM-Powered Tool to Catch PCB Schematic Mistakes

https://netlist.io/
10•wafflesfreak•30m ago•3 comments

AI Adoption Rates Starting to Flatten Out

https://www.apolloacademy.com/ai-adoption-rates-starting-to-flatten-out/
84•toomuchtodo•1h ago•35 comments

Petition to formally recognize open source work as civic service in Germany

https://www.openpetition.de/petition/online/anerkennung-von-open-source-arbeit-als-ehrenamt-in-de...
356•PhilippGille•3h ago•92 comments

Tech Titans Amass Multimillion-Dollar War Chests to Fight AI Regulation

https://www.wsj.com/tech/ai/tech-titans-amass-multimillion-dollar-war-chests-to-fight-ai-regulati...
144•thm•8h ago•143 comments

Moss: a Rust Linux-compatible kernel in 26,000 lines of code

https://github.com/hexagonal-sun/moss
307•hexagonal-sun•6d ago•76 comments

Pocketbase – open-source realtime back end in 1 file

https://pocketbase.io/
551•modinfo•14h ago•149 comments

Stellantis Is Spamming Owners' Screens with Pop-Up Ads for New Car Discounts

https://www.thedrive.com/news/stellantis-is-spamming-owners-screens-with-pop-up-ads-for-new-car-d...
50•cf100clunk•1h ago•17 comments

Apple and Intel Rumored to Partner on Mac Chips

https://www.macrumors.com/2025/11/28/intel-rumored-to-supply-new-mac-chip/
41•bigyabai•1h ago•8 comments

Lobsters Interview

https://susam.net/my-lobsters-interview.html
4•blenderob•1h ago•1 comments

The Signal Is the Noise

https://www.magazine.dirt.fyi/p/the-signal-is-the-noise
11•surprisetalk•1h ago•4 comments

A Tale of Four Fuzzers

https://tigerbeetle.com/blog/2025-11-28-tale-of-four-fuzzers/
45•jorangreef•5h ago•13 comments

Generating 3D Meshes from Text

https://cprimozic.net/notes/posts/generating-3d-meshes-from-text/
10•todsacerdoti•2h ago•1 comments

A Remarkable Assertion from A16Z

https://nealstephenson.substack.com/p/a-remarkable-assertion-from-a16z
244•boplicity•5h ago•97 comments

Tell HN: Want a better HN? Visit /newest

184•alecco•2h ago•57 comments

Swedish publishers file police report against Meta's Zuckerberg for fraud

https://www.sverigesradio.se/artikel/swedish-publishers-file-police-report-against-metas-zuckerbe...
71•Frieren•2h ago•20 comments

Playtiles: The Pocket-Sized Gaming Platform

https://get.playtil.es
13•surprisetalk•1h ago•4 comments

Writing Builds Resilience in Everyday Challenges by Changing Your Brain

https://scienceclock.com/writing-builds-resilience-in-everyday-challenges-by-changing-your-brain/
18•PikelEmi•4h ago•2 comments

A Repository with 44 Years of Unix Evolution

https://www.spinellis.gr/pubs/conf/2015-MSR-Unix-History/html/Spi15c.html
74•lioeters•8h ago•19 comments

The Math of Why You Can't Focus at Work

https://justoffbyone.com/posts/math-of-why-you-cant-focus-at-work/
59•0x79de•8h ago•18 comments

Show HN: Spikelog – A simple metrics service for scripts, cron jobs, and MVPs

https://spikelog.com
25•dsmurrell•1d ago•12 comments