frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Localaik – Run OpenAI and Gemini APIs Locally for CI and Tests

https://github.com/harshaneel/localaik
1•gokhalh•47m ago

Comments

jeremyfelps•17m ago
Missing piece in the agent-testing chain. Testing agentic workflows that span dozens of LLM calls, real API costs blow up fast, and mock libraries always lag actual API behavior by months. A local shim that speaks both Gemini and OpenAI on the same port is the right shape for CI.

Curious about the function-calling determinism. Even at temp 0, small-model JSON schema adherence drifts, especially on rare function shapes. Are you stabilizing it somehow for CI assertions, or just 'rerun on flake'?

The PDF-to-images path is interesting. Multimodal agent tests are where I keep hitting walls. Out of scope for v1, or queued?

GHA service container pattern is the right call. Stealing this idea.

gokhalh•2m ago
Thanks, that's exactly the gap I was trying to fill.

On function-calling determinism: you're right that small-model JSON adherence drifts even at temp 0. localaik does not fully solve this. Two paths that help:

1. Constrained decoding. llama.cpp supports GBNF grammars for shape-strict generation. Passing a grammar via the upstream `grammar` field gives you JSON that parses and matches the schema by construction. It does not guarantee field VALUES (the model can still pick "wrong" entity names), but it eliminates the parsing-flake class of failures. Adding first-class support for this in the translation layer is on the roadmap.

2. Assertion strategy. Test the SHAPE of the tool call (was the right tool invoked? did it receive an arg named X?), not exact arg values. This is good practice for any LLM CI regardless of localaik. The "rerun on flake" anti-pattern is what I'm trying to avoid by being deterministic in the proxy layer, not the model layer.

GHA service container: glad it's useful. The 5-min cold-start budget (--health-retries 30) is the one tuning knob most folks miss. Steal away, that's why it's MIT.

The New Workspace: A First-Principle Exploration of Dictation, Agents and Humans

https://www.inferterra.com/the-new-workspace-a-first-principles-exploration-of-dictation-agents-a...
1•matt_teresi•24s ago•1 comments

Confuse some SSH bots and make botters block you

https://ai.realhackers.org/confuse-some-ssh-bots.html
1•Bender•26s ago•0 comments

Raindrop Workshop: Your local OSS agent debugger

https://github.com/raindrop-ai/workshop
1•jamest•28s ago•0 comments

Show HN: Ait – Claude, Codex, and Aider as a team, on your laptop

https://github.com/m24927605/ait
1•m24927605•2m ago•1 comments

Google Flow Beta for Android

https://play.google.com/store/apps/details?id=com.google.android.apps.labs.whisk&hl=en_US
2•julianpye•4m ago•0 comments

A 1955 Los Alamos computer experiment changed our understanding of chaos

https://www.lanl.gov/media/publications/1663/science-of-unpredictability
1•LAsteNERD•5m ago•1 comments

Google IO 26 Keynote

https://www.youtube.com/watch?v=wYSncx9zLIU
3•Dinux•8m ago•0 comments

Polymarket debuts prediction markets tied to private companies

https://www.reuters.com/legal/government/polymarket-debuts-prediction-markets-tied-private-compan...
1•thm•9m ago•0 comments

Political Money Is Flowing to Influencers. But from Whom?

https://www.nytimes.com/2026/05/16/business/media/influencers-political-financing-disclosure.html
1•thm•10m ago•0 comments

Drone Swarms: Uncomfortably Plausible

https://silencingmachine.org/posts/0004-its-not-the-terminator-its-worse/
1•jaybill•10m ago•0 comments

Ask HN: Reliability Issues with AWS

2•yonisto•10m ago•0 comments

We made our sandbox filesystem 47× faster by deleting it

https://microsandbox.dev/blog/block-backed-rootfs
3•makeboss•10m ago•0 comments

Vendergood: Reconstructing a 1905 constructed language for AI agent cognition

https://github.com/mekickdemons-creator/libro-vendergood
1•Mekickdemons•11m ago•1 comments

Together, Edera and Minimus Claim They Can Protect Your Software from AI Hackers

https://cloudnativenow.com/features/together-edera-and-minimus-claim-they-can-protect-your-softwa...
1•CrankyBear•11m ago•0 comments

What's Easy Now? What's Hard Now?

https://brooker.co.za/blog/2026/05/18/whats-easy-whats-hard.html
2•KraftyOne•11m ago•0 comments

Bill C-22 Surveils Ordinary Canadians While Leaving Cartel Networks Untouched

https://www.thebureau.news/p/bill-c-22-surveils-ordinary-canadians
1•laurex•11m ago•0 comments

xs

https://cryptm.org/xs/
1•tosh•12m ago•0 comments

Anyone else been seeing any Google branding changes today?

1•dragonsenseiguy•14m ago•0 comments

Show HN: Agent threads – Share Claude Code and Codex sessions as public links

https://agent-thread.com
1•pixxxel•14m ago•0 comments

Dynamic Filters: 25x Faster Queries by Passing Info Between Operators

https://datafusion.apache.org/blog/2025/09/10/dynamic-filters/
1•killme2008•14m ago•0 comments

Trump Mobile Phone Is Finally Released–With a Major Blunder

https://www.thedailybeast.com/trump-mobile-phone-is-finally-releasedwith-a-major-blunder/
2•cf100clunk•14m ago•1 comments

Show HN: Pack.sh – Self-host single-file apps

https://pack.sh/
1•gkiely•15m ago•1 comments

Minnesota passes the nation's first ban on 'nudification' apps

https://19thnews.org/2026/04/minnesota-nudification-ban-ai-deepfake/
4•laurex•15m ago•0 comments

Click your location and this map will generate an efficient 5-bar pub crawl

https://mapsmania.github.io/geocoder/pubcrawlr.html
1•mikelgan•17m ago•2 comments

Human newborns form musical predictions based on rhythmic, not melodic structure

https://journals.plos.org/plosbiology/article?id=10.1371%2Fjournal.pbio.3003600&utm_source=clivet...
1•bookofjoe•17m ago•0 comments

You've Graduated. Now What?

https://www.adpresearch.com/research/2026-youve-graduated-now-what
1•lwhsiao•17m ago•0 comments

Chicago vs. New York Pizza Is the Wrong Argument

https://www.hillelwayne.com/post/pizza/
2•ethanhawksley•18m ago•1 comments

Growing tribe of jobless techies is stuck in Silicon Valley's new reality

https://www.latimes.com/business/story/2026-05-19/ai-layoffs-jobless-tech-workers-silicon-valley
3•1vuio0pswjnm7•18m ago•0 comments

Fixing the Most Dangerous Dam in the World

https://practical.engineering/blog/2026/5/19/fixing-the-most-dangerous-dam-in-the-world
2•michaefe•19m ago•0 comments

Maintenance is more than bugs

https://tylerrussell.dev/2026/05/19/maintenance-is-more-than-bugs/
2•terussell85•20m ago•0 comments