frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Building reliable agentic AI systems

https://martinfowler.com/articles/reliable-llm-bayer.html
53•sarangk90•2h ago

Comments

marsven_422•1h ago
You cannot
Littice•53m ago
The part about context discipline feels underrated. Larger context windows don’t remove the need to decide what the model shouldn’t see.
sarangk90•41m ago
Totally!
padolsey•45m ago
These vast multi-agentic systems with roles like 'Researcher', 'Writer' (with a review loop), 'Reflection agent', seem to ~feel~ mostly right but lack evals as to the merit of agent decomposition. So it forms a satisfying enough flowchart but I see no evidence these authors actually tried other approaches or agent roles. And let's be honest: an agent is just a system prompt and output contracts, and these rich architectures seem to be pontificating beyond their worth. It all feels a bit vibe-y.
ai_slop_hater•31m ago
> Sarang Kulkarni is a Principal Consultant at Thoughtworks

> teaches an O’Reilly course on building production-ready RAG applications

isn't this basically saying that you are a scammer? or am I paranoid?

ai_slop_hater•26m ago
Why is comment from padolsey dead? Seriously, something fishy is going on on this website.
altmanaltman•25m ago
> The author used AI assistance during the writing of this article. AI tools were used for brainstorming ideas, creating outlines, and reviewing drafts to polish language and improve clarity.

The first sentence makes it seem like they just used to improve sentence structure etc but the second line makes it seem like they used it for 90% of the work. Which one is true?

ares623•15m ago
Your question answered it I think. The first sentence aims to mislead. The second sentence covers their ass.

I'd love to see the number of man hours that led to that sentence, and how proud they were to have come up with it.

smallnix•19m ago
What was the main driver for a dynamic workflow with loops vs a rigid forward running only workflow. The non-deterministic nature of these loops with LLM decision points doesn't mesh well with the transparency requirement imho
bob1029•16m ago
The most important part is the database that the agent can see and how clean the data is. I pitched a custom enterprise agent to a client thinking it would be maybe 50/50 time on data vs agent tuning, but it's more like 99/1.

The alignment process goes very quickly once you have all the fish in exactly one barrel. I think pulling data dynamically from the source systems is where this turns into a game of whack-a-mole.

The problem with dynamic fetch is that you don't get any kind of persistent or compounding gains. There are queries that you simply cannot run because you'd chew through your GitHub, et. al., API quotas. It takes over 48h to fully hydrate the database for GitHub items on my current project. But, once that process is complete I can query across things like issue comments and do crosscutting joins with the state of other vendor systems in milliseconds.

I am finding the MSSQL dialect to be quite agreeable to the OAI models. With absolutely no prompting they will bootstrap off information schema and extended description properties every single time. If you design the schema for your audience, the amount of "Jesus prompting" you will require is much better controlled.

Developers don't understand CORS (2019)

https://fosterelli.co/developers-dont-understand-cors
108•toilet•5h ago•45 comments

Building reliable agentic AI systems

https://martinfowler.com/articles/reliable-llm-bayer.html
53•sarangk90•2h ago•10 comments

Zigzag Decoding with AVX-512

https://zeux.io/2026/06/17/zigzag-decoding-avx512/
28•luu•3d ago•0 comments

Renting a sewing machine from the library

https://www.bbc.com/future/article/20260618-the-weird-and-wonderful-libraries-of-finland
191•sohkamyung•7h ago•96 comments

Loupe – A iOS app that raises awareness about what native apps can see

https://github.com/mysk-research/loupe
195•Cider9986•18h ago•52 comments

Epoll vs. io_uring in Linux

https://sibexi.co/posts/epoll-vs-io_uring/
118•Sibexico•7h ago•33 comments

The 100k Whys of AI

https://lcamtuf.substack.com/p/the-100000-whys-of-ai
25•surprisetalk•58m ago•4 comments

Slow breathing modulates brain function and risk behavior

https://www.cell.com/neuron/fulltext/S0896-6273(26)00339-9
144•croes•8h ago•29 comments

Show HN: TownSquare, a tiny presence layer for websites

https://townsquare.cauenapier.com/
138•cauenapier•18h ago•69 comments

Your brain was never designed for this much bad news

https://www.sciencedaily.com/releases/2026/06/260614012006.htm
90•colinprince•2h ago•60 comments

15-minute at-home Lyme disease tick test

https://www.bostonglobe.com/2026/06/17/business/lyme-disease-tick-test/
85•bookofjoe•2d ago•29 comments

Guide to the TD4 4-bit DIY CPU

https://www.philipzucker.com/td4-4bit-cpu/
23•andrewstuart•2d ago•1 comments

SMPTE Makes Its Standards Freely Accessible

https://www.smpte.org/blog/smpte-makes-its-standards-freely-accessible-openingstandards-library-t...
245•zdw•13h ago•72 comments

When I reject AI code even if it works

https://vinibrasil.com/when-i-reject-ai-code-even-if-it-works/
138•vnbrs•5h ago•78 comments

UHF X11: X11 Built for VisionOS and Apple Vision Pro

https://www.lispm.net/apps/uhf-x11/
191•zdw•13h ago•33 comments

DOS Game "F-15 Strike Eagle II" reversing project needs DOS test pilots

https://neuviemeporte.github.io/f15-se2/2026/06/20/needyou.html
233•LowLevelMahn•15h ago•62 comments

Unauthorized alert sent to cell phones across Brazil

https://www.cnn.com/2026/06/20/americas/brazil-hackers-unauthorized-alert-latam
120•zdw•10h ago•84 comments

Armstrong Effect

https://en.wikipedia.org/wiki/Armstrong_effect
22•userbinator•2h ago•1 comments

Whole cross-sectional human ultrasound tomography

https://www.nature.com/articles/s41551-026-01660-4
60•lnyan•2d ago•10 comments

Semiconductor Lifeline Keeps Fighter Jets in the Air

https://spectrum.ieee.org/phoenix-semiconductors-legacychips-oems
68•rbanffy•4d ago•18 comments

Project Fetch: Phase Two

https://www.anthropic.com/research/project-fetch-phase-two
54•stopachka•6h ago•19 comments

The Lost Story of Alan Turing's "Delilah" Project

https://spectrum.ieee.org/alan-turings-delilah
6•asdefghyk•1h ago•1 comments

NOLA 'Nacular: One man's crusade to preserve New Orleans's vernacular signage

https://countryroadsmagazine.com/art-and-culture/people-places/nola-nacular/
33•NaOH•4d ago•2 comments

Linux eliminates the strncpy API after six years of work, 360 patches

https://www.phoronix.com/news/Linux-7.2-Drops-strncpy
167•simonpure•9h ago•132 comments

Alice is impatient

https://brooker.co.za/blog/2026/06/19/waiting.html
79•birdculture•10h ago•24 comments

Temporary Cloudflare accounts for AI agents

https://blog.cloudflare.com/temporary-accounts/
200•farhadhf•19h ago•104 comments

Show HN: StartupWiki – A Free Alternative to Crunchbase

https://startupwiki.tech/
187•shpran•14h ago•58 comments

Proportion-Integral-Derivative Controllers

https://en.wikipedia.org/wiki/PID_controller
16•dhorthy•1d ago•5 comments

The rise of South Korea’s weapons business

https://www.politico.com/news/magazine/2026/06/20/south-korea-weapons-dealer-trump-00959559
137•JumpCrisscross•18h ago•48 comments

Inference cost at scale with napkin math

https://injuly.in/blog/napkin-inference-cost/index.html
74•gmays•4d ago•15 comments