news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Chip Huyen's 'AI Engineering' Deep Dive

https://www.youtube.com/watch?v=KuPA1l9Rb_E

1•barisbll•6h ago

Comments

barisbll•6h ago

I recently did a deep dive into Chip Huyen's "AI Engineering" and one argument stood out as particularly crucial for teams building with LLMs today. It's the clear, principled distinction between Retrieval-Augmented Generation (RAG) and finetuning.

Many engineering teams instinctively reach for finetuning as a way to "teach" a model their private data. The thinking is that if you train it on your documents, it will "know" them. However, this is often a misuse of the technique. Finetuning is most effective at altering the form and behavior of a model—making it communicate in a certain style, adhere to a specific JSON schema, or follow a complex chain of instructions. It is an expensive and imprecise tool for knowledge injection.

The book argues that RAG is the superior tool for providing facts. By retrieving relevant information from an external knowledge base at inference time and adding it to the prompt context, you get several advantages:

Factual Grounding: The model is less likely to hallucinate because its context is bounded by the retrieved documents.

Traceability: You know exactly which source documents were used to generate an answer.

Up-to-date Knowledge: The knowledge base can be updated continuously without the cost of retraining/finetuning the model itself.

The core takeaway is that teams should default to RAG for knowledge-based tasks and reserve the more complex and expensive process of finetuning for tasks that require altering the model's fundamental behavior. This seems like a critical architectural decision that could save significant resources. Curious to hear how others are approaching this trade-off.

Show HN: ColecoVision Emulator in Zig

https://github.com/braheezy/zoleco

2•braheezy•3m ago•0 comments

YouTube No Translation

https://addons.mozilla.org/en-GB/firefox/addon/youtube-no-translation/

1•doener•3m ago•1 comments

GOP Bill Adds Surprise Tax That Could Cripple Wind and Solar Power

https://www.nytimes.com/2025/06/29/climate/gop-bill-adds-surprise-tax-that-could-cripple-wind-and-solar-power.html

1•pesenti•12m ago•0 comments

Community Is Not That

https://rosie.land/posts/community-is-not-that/

1•rosiesherry•12m ago•2 comments

Show HN: Ciara – Securely deploy any application on any server

https://ciara-deploy.dev/index.html

2•andresribeiro•13m ago•0 comments

Sea slugs consume algae, incorporate photosynthetic parts to produce nutrients

https://news.harvard.edu/gazette/story/2025/06/stealing-a-superpower/

1•gmays•14m ago•0 comments

The Forgotten Half of Scientific Thinking

https://www.pnas.org/doi/10.1073/pnas.1404649111

1•gtsnexp•14m ago•0 comments

Rome: The Wild Within the Walls

https://www.biographic.com/the-wild-within-the-walls/

1•bookofjoe•14m ago•0 comments

Writing Code to Be Read at a Glance

https://jelv.is/blog/Writing-Code-To-Be-Read-at-a-Glance/

1•tikhonj•15m ago•0 comments

SlugOS has a working PMM, VMM, and Heap allocator

https://imgur.com/a/MG06P0A

2•OSDeveloper•16m ago•1 comments

Nanosaur: Jetson robot 3D printable, open-source, ROS 2 and Isaac ROS based

https://nanosaur.ai/

1•882542F3884314B•22m ago•0 comments

Reimplementing Dynamic Arrays

https://github.com/florianmarkusse/FLOS/blob/master/articles/dynamic-array/article.md

2•flox901•23m ago•0 comments

Ask HN: Languages Designed for WASM?

2•90s_dev•24m ago•3 comments

Foxing

https://en.wikipedia.org/wiki/Foxing

2•perihelions•29m ago•0 comments

Mailcoach

https://www.mailcoach.app

1•effortlessback•30m ago•0 comments

TenZorro – AI-based platform with various digital services

https://tenzorro.com/en

1•paulo20223•34m ago•0 comments

Boeing 787 software may have caused AI crash: Aviation expert

https://sundayguardianlive.com/investigation/boeing-787-software-may-have-caused-ai-crash-aviation-expert

2•fcpguru•35m ago•1 comments

Mathematician Terence Tao's three hour long conversation with Lex Fridman [video]

https://www.youtube.com/watch?v=HUkBz-cdB-k

1•ViktorRay•35m ago•0 comments

Ask HN: According to HN and r/reactjs, NextJS = bad. Yet many sites use it. Why?

1•consumer451•36m ago•0 comments

UV-C light kills nearly everything–except this unusual organism

https://www.science.org/content/article/uv-c-light-kills-nearly-everything-except-unusual-organism

1•domofutu•41m ago•0 comments

ESP32-LLM: Running a Little Language Model on the ESP32

https://github.com/AIWintermuteAI/esp32-llm

1•lnyan•43m ago•0 comments

Ruthless prioritization while the dog pees on the floor

https://longform.asmartbear.com/prioritization/

1•kiyanwang•44m ago•0 comments

Y Combinator (Math) Explained

https://anish.ink/posts/ycombinator.html

2•kaiwenwang•46m ago•0 comments

Ask HN: What Happened to James Halliday ( Substack)?

1•bit1993•47m ago•0 comments

Silicon Valley Insider Exposes Cult Like AI Companies [video]

https://www.youtube.com/watch?v=8enXRDlWguU

1•ttctciyf•47m ago•1 comments

Uncovering and addressing the secret water footprint of AI models

https://arxiv.org/abs/2304.03271

1•1vuio0pswjnm7•47m ago•0 comments

Semantic Kernel for JavaScript

https://kerneljs.com/

1•afshinmeh•48m ago•1 comments

Show HN: Kanban-Board agent management for software dev agents, open source

https://github.com/cairn-dev/cairn

1•brich4419•52m ago•0 comments

Buridan's Ass

https://en.wikipedia.org/wiki/Buridan%27s_ass

1•danielschreber•53m ago•0 comments

Apple's Swift Working to Support Android App Development

https://www.macrumors.com/2025/06/27/swift-to-support-android-app-development/

1•have-a-break•54m ago•0 comments