news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A

1•mgh2•5m ago•0 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests

1•ludicrousdispla•7m ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/

1•vladeta•12m ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV

1•thealidev•14m ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408

1•lifeisstillgood•14m ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/

1•bundie•17m ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224

1•keepamovin•18m ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/

1•birdculture•20m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/

1•fanf2•21m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html

1•ramenbytes•24m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/

1•oesimania•25m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n

2•vintagedave•28m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo

1•__natty__•29m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app

3•cinusek•29m ago•0 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory

1•samuel_0xK•31m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

2•prateekdalal•34m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen

2•0y•39m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents

1•xenator•40m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/

1•pekingzcc•42m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack

2•ryan_j_naughton•43m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/

1•ravenical•44m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...

1•ValdikSS•45m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172

1•boshomi•47m ago•1 comments

AI for People

https://justsitandgrin.im/posts/ai-for-people/

1•dive•48m ago•0 comments

Rome is studded with cannon balls (2022)

https://essenceofrome.com/rome-is-studded-with-cannon-balls

1•thomassmith65•53m ago•0 comments

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC

2•somethingp•54m ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb

4•saubeidl•55m ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•58m ago•0 comments

pi-nes

https://twitter.com/thomasmustier/status/2018362041506132205

1•tosh•1h ago•0 comments

Show HN: Crew – Multi-agent orchestration tool for AI-assisted development

https://github.com/garnetliu/crew

1•gl2334•1h ago•0 comments

Open in hackernews

Chaos Testing for LLM Agents

https://github.com/arielshad/balagan-agent

2•ArielSh•1w ago

Comments

ArielSh•1w ago

open-source experiment applying chaos-engineering ideas to LLM agents.

Agent development feels fast, but reliability is mostly assumed. Agents depend on prompts, tools, APIs, and implicit coordination. When something breaks, behavior degrades in subtle ways and we usually find out too late.

Balagan Agent intentionally injects controlled “chaos” into agent workflows to surface failure modes early: - Tool failures, latency, partial responses - Prompt drift and unexpected decisions - Hidden assumptions in sequencing and coordination

The goal is not load testing, but understanding how fragile an agent really is and where guardrails are needed.

This started as a side project to explore whether chaos-style testing makes sense for agents, similar in spirit to what Chaos Monkey did for distributed systems.

chrisjj•1w ago

> Problem

> * Agents fail silently in production

> * Tool calls time out, return garbage, or hallucinate

> * Context gets corrupted, budgets get exhausted

> * Nobody knows until users complain

Nobody? You can't blame a sick parrot for its keeper's failure to monitor it.

ArielSh•1w ago

How would you phrase it?

chrisjj•1w ago

* Agent code neglects to adequately detect and report the error conditions arising from the spectacularly unreliable "AI" on which it relies.