frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: SAIA – SCUMM for AI Agents

https://github.com/serendip-ml/llm-saia/blob/main/docs/INTRO.md
1•serendip-ml•1h ago

Comments

serendip-ml•1h ago
Hi HN, Karpathy's recent post [1] described Claws as "a new layer on top of LLM agents, taking orchestration, scheduling, context, tool calls to a next level." That's the right framing - but orchestration alone isn't enough. SAIA is the rails layer that makes that orchestration predictable.

[1] https://simonwillison.net/2026/Feb/21/claws/

Instead of prompting an LLM and hoping it does what you meant, the idea is to write in 12 verbs (ASK, VERIFY, CRITIQUE, REFINE, etc.) with typed outputs - each verb returns a dataclass, enforced by JSON schema at the API level. The name comes from SCUMM - the scripting language LucasArts used for Monkey Island. Constrained vocabulary, structured outputs, debuggable behavior.

The bigger goal: agents that actually improve over time. What I've learned building these is that without training, agents plateau quickly. They can remember facts, but they don't get better at their job. So feedback from execution flows into fine-tuning, and the model gets better at the specific task. Not "memory," but real learning.

For that to work, I needed to build multiple layers: - *llm-saia*: the protocol layer (this post) - rails between Python and LLM - *llm-infer*: inference server (vLLM, LoRA support) - *llm-kelt*: feedback collection → fine-tuning pipeline - *llm-gent*: agent runtime with traits, tools, persistence - *appinfra*: production Python infrastructure that holds it all together

Everything is open source. Happy to discuss design tradeoffs - the 12-verb constraint is intentionally limiting.

This is v0 - the vocabulary will evolve. If there's prior work I should know about, drop a link.

Open problems worth solving: - *Determinism*: same input → same output. Current idea: fine-tune models to follow verb contracts reliably. - *Verification*: how do you prove a verb did what it claimed? Tracing helps, but formal guarantees need real PL exper - *Composition*: when verbs chain, errors compound. Better error propagation and recovery needed.

Show HN: Rev-dep – 20x faster knip.dev alternative build in Go

https://github.com/jayu/rev-dep
1•jayu_dev•28s ago•0 comments

Show HN: Smplogs – Local-first AWS Cloudwatch log analyzer via WASM

https://www.smplogs.com
1•smplogs•33s ago•0 comments

Fries with that? Ordering from AI linked to selecting more indulgent foods

https://www.psu.edu/news/health-and-human-development/story/fries-ordering-ai-linked-selecting-mo...
2•geox•1m ago•0 comments

Tunnelling Torrents 'Properly' over a VPN with Port Forwarding

https://blog.muffn.io/posts/tunneling-torrents-over-vpn-with-port-forwarding/
2•eustoria•1m ago•0 comments

Python React to Elixir Phoenix Migration Breakdown

https://mrpopov.com/posts/python-react-to-elixir-phoenix-migration-breakdown/
2•tosh•2m ago•0 comments

I made my agents joke with each other [video]

https://www.youtube.com/watch?v=xWW2p_CWST4
2•bahaAbunojaim•3m ago•0 comments

OpenJS Foundation: A safe and modern home for JavaScript technologies

https://openjsf.org/
3•eustoria•3m ago•0 comments

Draining wetlands produces substantial emissions in the Canadian Prairies

https://theconversation.com/draining-wetlands-produces-substantial-emissions-in-the-canadian-prai...
2•PaulHoule•6m ago•0 comments

Feather – Draw in 3D

https://www.feather.art/
2•eustoria•6m ago•0 comments

Show HN: Turning 2D floor plans into 3D-ready JSON with Detectron2

https://www.loom.com/share/b741bc127c814db395832b01bd086e96
2•acaciabengo•7m ago•0 comments

Explain to Issue Reporter

https://github.com/fsprojects/fantomas/issues/1704
2•debugnik•8m ago•0 comments

Brave Search API now features Place Search, a new endpoint for map applications

https://brave.com/blog/place-search-api/
6•w0ts0n•8m ago•0 comments

Launch HN: Cardboard (YC W26) – Agentic video editor

https://www.usecardboard.com/
2•sxmawl•9m ago•0 comments

We Built a Video Rendering Engine by Lying to the Browser About What Time It Is

https://blog.replit.com/browsers-dont-want-to-be-cameras
1•cbrewster•9m ago•0 comments

OsmAnd's Faster Offline Navigation

https://osmand.net/blog/fast-routing/
2•todsacerdoti•9m ago•1 comments

AirSnitch: Demystifying and Breaking Client Isolation in Wi-Fi Networks

https://www.ndss-symposium.org/ndss-paper/airsnitch-demystifying-and-breaking-client-isolation-in...
2•todsacerdoti•10m ago•0 comments

People Leaving US

https://trendslates.substack.com/p/teen-ai-usage-amazon-alleged-price
2•faderisimo•12m ago•0 comments

My accepted research work on 'Failure-Aware Security Framework'

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6122687
2•sandhyavinjam•13m ago•0 comments

Bring Your Own Agent (BYOA)

https://blog.val.town/byoa
2•stevekrouse•13m ago•0 comments

Attacking Russia's Center of Gravity: A Clausewitzian Answer

https://www.militarystrategymagazine.com/article/attacking-russias-center-of-gravity-a-clausewitz...
2•bryanrasmussen•15m ago•0 comments

Human Made: The Pledge

https://lee-phillips.org/humanmade/
1•leephillips•16m ago•0 comments

GitHub Actions is left vulnerable to supply chain attacks: Datadog Report

https://www.datadoghq.com/state-of-devsecops/
3•varunsharma07•17m ago•0 comments

How Google Killed the Rent-a-Domain Era

https://growtika.com/blog/publisher-affiliate-collapse
2•Growtika•17m ago•0 comments

Show HN: Karta – Google Search, for discovering talent

https://www.karta.works
2•kidustiliksew•17m ago•0 comments

Smallest transformer that can add two 10-digit numbers

https://github.com/anadim/AdderBoard
2•ks2048•17m ago•0 comments

A Visual Guide to DNA Sequencing

https://www.asimov.press/p/dna-sequencing
2•mailyk•18m ago•0 comments

He saw an abandoned trailer. Then, uncovered a surveillance network

https://calmatters.org/justice/2026/02/alpr-border-patrol-caltrans/
11•Element_•20m ago•3 comments

Show HN: I built a local AI-powered Ouija board with a fine-tuned 3B model

https://github.com/SurceBeats/Planchette
3•SurceBeats•21m ago•2 comments

Using AI without losing skills

https://manafov.co/posts/using-ai-without-losing-skills
2•airbridgeflyer•22m ago•0 comments

Hyper: a reactive server side rendered web framework for Clojure

https://github.com/dynamic-alpha/hyper
2•rschmukler•23m ago•1 comments