frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How are you doing RAG locally?

59•tmaly•15h ago
I am curious how people are doing RAG locally with minimal dependencies for internal code or complex documents?

Are you using a vector database, some type of semantic search, a knowledge graph, a hypergraph?

Comments

eajr•14h ago
Local LibreChat which bundles a vector db for docs.
whattheheckheck•13h ago
Anythingllm is promising
rahimnathwani•13h ago
If your data aren't too large, you can use faiss-cpu and pickle

https://pypi.org/project/faiss-cpu/

notyourwork•1h ago
For the uneducated, how large is too large? Curious.
itake•5m ago
FAISS runs in RAM. If your dataset can't fit into ram, FAISS is not the right tool.
motakuk•13h ago
LightRAG, Archestra as a UI with LightRAG mcp
ramesh31•10h ago
SQLite with FTS5
nineteen999•9h ago
A little BM25 can get you quite a way with an LLM.
jeffchuber•1h ago
try out chroma or better yet as opus to!
electroglyph•1h ago
simple lil setup with qdrant
CuriouslyC•1h ago
Don't use a vector database for code, embeddings are slow and bad for code. Code likes bm25+trigram, that gets better results while keeping search responses snappy.
lee1012•1h ago
static embedding models im finding quite fast lee101/gobed https://github.com/lee101/gobed is 1ms on gpu :) would need to be trained for code though the bigger code llm embeddings can be high quality too so its just yea about where is ideal on the pareto fronteir really , often yea though your right it tends to be bm25 or rg even for code but yea more complex solutions are kind of possible too if its really important the search is high quality
itake•6m ago
With AI needing more access to documentation, WDYT about using RAG for documentation retrieval?
lee1012•1h ago
lee101/gobed https://github.com/lee101/gobed static embedding models so they are embedded in milliseconds and on gpu search with a cagra style on gpu index with a few things for speed like int8 quantization on the embeddings and fused embedding and search in the same kernel as the embedding really is just a trained map of embeddings per token/averaging
undergrowth•47m ago
undergrowth.io
undergrowth•47m ago
Undergrowth.io
pdyc•36m ago
sqlite's bm25
init0•33m ago
I built a lib for myself https://pypi.org/project/piragi/
jeanloolz•22m ago
Sqlite-vec

The URL shortener that makes your links look as suspicious as possible

https://creepylink.com/
138•dreadsword•2h ago•30 comments

Claude Cowork exfiltrates files

https://www.promptarmor.com/resources/claude-cowork-exfiltrates-files
575•takira•9h ago•238 comments

Furiosa: 3.5x efficiency over H100s

https://furiosa.ai/blog/introducing-rngd-server-efficient-ai-inference-at-data-center-scale
125•written-beyond•5h ago•64 comments

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR

https://www.tavus.io/post/sparrow-1-human-level-conversational-timing-in-real-time-voice
28•code_brian•12h ago•2 comments

Scaling long-running autonomous coding

https://cursor.com/blog/scaling-agents
165•samwillis•7h ago•79 comments

Ask HN: Share your personal website

504•susam•12h ago•1490 comments

Ask HN: What did you find out or explore today?

39•blahaj•12h ago•27 comments

Tech Writers Are About to Become Obsolete

https://kibbler.dev/blog/turn-your-codebase-into-a-knowledge-base
5•kewun•28m ago•10 comments

Project SkyWatch (a.k.a. Wescam at Home)

https://ianservin.com/2026/01/13/project-skywatch-aka-wescam-at-home/
10•jjwiseman•13h ago•2 comments

Ask HN: How are you doing RAG locally?

61•tmaly•15h ago•20 comments

Bubblewrap: A nimble way to prevent agents from accessing your .env files

https://patrickmccanna.net/a-better-way-to-limit-claude-code-and-other-coding-agents-access-to-se...
54•0o_MrPatrick_o0•4h ago•45 comments

The State of OpenSSL for pyca/cryptography

https://cryptography.io/en/latest/statements/state-of-openssl/
112•SGran•8h ago•19 comments

Ask HN: Weird archive.today behavior?

59•rabinovich•7h ago•17 comments

New Safari developer tools provide insight into CSS Grid Lanes

https://webkit.org/blog/17746/new-safari-developer-tools-provide-insight-into-css-grid-lanes/
14•feross•5h ago•1 comments

Ask HN: What is the best way to provide continuous context to models?

32•nemath•4h ago•14 comments

Why some clothes shrink in the wash and how to unshrink them

https://www.swinburne.edu.au/news/2025/08/why-some-clothes-shrink-in-the-wash-and-how-to-unshrink...
482•OptionOfT•4d ago•252 comments

Show HN: Ever wanted to look at yourself in Braille?

https://github.com/NishantJoshi00/dith
19•cat-whisperer•5d ago•9 comments

Show HN: WebTiles – create a tiny 250x250 website with neighbors around you

https://webtiles.kicya.net/
152•dimden•5d ago•23 comments

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP

https://github.com/cosinusalpha/webctl
79•cosinusalpha•15h ago•25 comments

SparkFun Officially Dropping AdaFruit due to CoC Violation

https://www.sparkfun.com/official-response
426•yaleman•15h ago•430 comments

Sun Position Calculator

https://drajmarsh.bitbucket.io/earthsun.html
87•sanbor•8h ago•19 comments

Find a pub that needs you

https://www.ismypubfucked.com/
246•thinkingemote•14h ago•195 comments

ChromaDB Explorer

https://www.chroma-explorer.com/
48•arsentjev•7h ago•3 comments

Generate QR Codes with Pure SQL in PostgreSQL

https://tanelpoder.com/posts/generate-qr-code-with-pure-sql-in-postgres/
67•tanelpoder•4d ago•5 comments

Crafting Interpreters

https://craftinginterpreters.com/
56•tosh•7h ago•8 comments

How can I build a simple pulse generator to demonstrate transmission lines

https://electronics.stackexchange.com/questions/764155/how-can-i-build-a-simple-pulse-generator-t...
30•alphabetter•5d ago•6 comments

Roam 50GB is now Roam 100GB

https://starlink.com/support/article/58c9c8b7-474e-246f-7e3c-06db3221d34d
268•bahmboo•14h ago•313 comments

Is Rust faster than C?

https://steveklabnik.com/writing/is-rust-faster-than-c/
250•vincentchau•4d ago•274 comments

Ford F-150 Lightning outsold the Cybertruck and was then canceled for poor sales

https://electrek.co/2026/01/13/ford-f150-lightning-outsold-tesla-cybertruck-canceled-not-selling-...
537•MBCook•12h ago•710 comments

I Designed a Custom Protocol for My App

https://blog.roj.dev/how-i-designed-a-custom-protocol-for-my-app
4•_roj•2d ago•2 comments