frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Stop fine-tuning LLMs for docs, use RAG

https://intlayer.org/blog/rag-powered-documentation-assistant
2•MarineCG40•1h ago

Comments

MarineCG40•1h ago
I keep seeing people fine-tune LLMs for use cases where they probably don’t need to. In most doc/product scenarios, you don’t need another fine-tuned model—you just need retrieval-augmented generation (RAG). Why I think RAG wins in most cases: Fine-tuning is expensive, slow, and brittle. Most use cases don’t require “teaching” the model, just giving it the right context. With RAG, you keep things fresh: update your docs → update your embeddings → done. I built a small proof-of-concept to test this: a documentation assistant where docs are chunked + embedded, user queries are matched with cosine similarity, and GPT answers with the relevant context injected. Every query is logged, which turns out to be valuable—surfacing missing docs, common user struggles, and even feature requests. Demo: https://intlayer.org/doc/chat Write-up + code: https://intlayer.org/blog/rag-powered-documentation-assistan... My question: Do you see fine-tuning + RAG coexisting for these types of tasks? Or is RAG simply the obvious solution for 80% of real-world doc/product use cases?

I launched a Mac utility; now there are 5 clones on the App Store using my story

1•tTarnMhrkm•45s ago•0 comments

Ask HN: Why don't we have a shared "libchrome" the way we have glibc or DirectX?

1•omagdy7•3m ago•0 comments

Show HN: Scientific Calculator for Android

https://play.google.com/store/apps/details?id=scientific.codegres.calculator&hl=en_US
1•Codegres•6m ago•0 comments

VoidBreaker Was Made by One Person and Might Be 2025's Best FPS

https://kotaku.com/voidbreaker-fps-review-steam-pc-gamepass-titanfall-roguelike-roguelite-2000621067
1•PaulHoule•9m ago•0 comments

Show HN: WhatsApp SMS IVR Email SchedulerText

https://timetext.in/
1•Codegres•11m ago•0 comments

Detaching GraalVM from the Java Ecosystem Train

https://blogs.oracle.com/java/post/detaching-graalvm-from-the-java-ecosystem-train
2•philonoist•13m ago•0 comments

Show HN: MSPaint for Android

https://play.google.com/store/apps/details?id=com.sketch.paint&hl=en_US
1•Codegres•15m ago•0 comments

Learn Your Way: transform content into interactive lessons by Google

https://learnyourway.withgoogle.com/
2•mustaphah•15m ago•0 comments

US drops Colombia as drug war partner, puts it on rogue nation list

https://www.business-standard.com/world-news/us-drops-colombia-as-drug-war-partner-puts-it-on-rog...
3•geox•15m ago•0 comments

Show HN: Luna, an in-memory SQL server for object storage data

https://github.com/flowerinthenight/luna
1•f14t•16m ago•0 comments

Extracting text from a pdf broke ChatGPT

https://www.surgehq.ai//blog/the-pdf-that-broke-chatgpt
1•landonxi•17m ago•0 comments

Claude Can (Sometimes) Prove It

https://www.galois.com/articles/claude-can-sometimes-prove-it
3•Bogdanp•18m ago•0 comments

Fairchild PPS-25: 4-bit CPU for 25-digit precision

https://www.cpushack.com/2025/02/01/fairchild-pps-25-4-bit-cpu-for-25-digit-precision/
2•pinewurst•20m ago•0 comments

The General Automation GA-16 16-bit CPU

https://www.cpushack.com/2025/08/16/the-general-automation-ga-16-16-bit-cpu/
1•pinewurst•21m ago•0 comments

Chronon: A data platform for serving for AI/ML applications

https://github.com/airbnb/chronon
2•tanelpoder•21m ago•0 comments

A simple guide to finding the right blogging platform

https://kangminsuk.com/blog/choose-a-blogging-platform/
3•billybuckwheat•22m ago•0 comments

Monochrome 2: custom fanless 7.5 L Strix Halo system

https://smallformfactor.net/forum/threads/monochrome-2-my-custom-fanless-7-5-l-strix-halo-system-...
1•JBiserkov•22m ago•0 comments

Mostek 5065

https://en.wikipedia.org/wiki/Mostek_5065
1•pinewurst•23m ago•0 comments

Towers of Silence

https://99percentinvisible.org/episode/towers-of-silence/
2•thunderbong•24m ago•0 comments

Why Is My Kitchen So Clean?

https://lopespm.com/notes/2025/09/16/why_is_my_kitchen_so_clean.html
3•lopespm•24m ago•0 comments

GitHub/spec-kit: Toolkit to help you get started with Spec-Driven Development

https://github.com/github/spec-kit
2•ibobev•25m ago•0 comments

Consumer Reports asks Microsoft to keep supporting Windows 10

https://www.theverge.com/news/779079/consumer-reports-windows-10-extended-support-microsoft
2•cebert•29m ago•0 comments

LLM misalignment may stem from role inference, not corrupted weights

https://echoesofvastness.substack.com/p/cross-domain-misalignment-generalization
1•PinResearch•31m ago•1 comments

The next 20: Powering the future of entertainment together at Made on YouTube

https://blog.youtube/news-and-events/made-on-youtube-2025/
1•amrrs•31m ago•0 comments

Low-Code and No-Code Platforms Are Changing Web Development

1•siteitnow•31m ago•0 comments

Texts from Suspect in Charlie Kirk Shooting Offer Insight into a Motive

https://www.nytimes.com/2025/09/16/us/politics/kirk-shooting-suspect-motive-messages.html
1•Redoubts•33m ago•1 comments

Machine Learning vs. Human Learning: They're Not Alike [video]

https://www.youtube.com/watch?v=04NjWFl3X74
2•LordNibbler•42m ago•0 comments

Southern Television broadcast interruption (1977)

https://en.wikipedia.org/wiki/Southern_Television_broadcast_interruption
2•sys_64738•45m ago•1 comments

SK On's breakthrough all-solid-state EV batteries will arrive ahead of schedule

https://electrek.co/2025/09/16/sk-ons-all-solid-state-ev-batteries-will-arrive-ahead-of-schedule/
2•breve•46m ago•0 comments

From Alphabet to Visa, US giants drive euro-denominated bond surge

https://www.reuters.com/business/finance/alphabet-visa-us-giants-drive-euro-denominated-bond-surg...
10•nabla9•51m ago•1 comments