frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: AgentCheck – Snapshot and Replay AI Agents Like Real Software

https://github.com/hvardhan878/agentcheck
1•hvardhan878•14h ago
Hey HN,

I built AgentCheck, an open-source testing tool for LLM agents. It lets you:

Snapshot full agent runs (prompt, LLM calls, tool outputs, final answer)

Replay the trace locally — no API calls, no token costs

Diff agent behavior over time

Assert outputs to catch regressions

Why? Because today, most AI agents are tested by spot-checking outputs or rerunning flaky evals — which breaks CI, costs money, and misses edge cases. AgentCheck works more like Jest or VCR.py, but for LLM workflows. It records and replays traces so you can test agents like real software.

It’s CLI-first, dev-friendly, and designed to plug into LangChain/OpenAI workflows.

Still early I’d love feedback, contributors, and use cases from folks building agentic systems. The code’s here: https://github.com/hvardhan878/agentcheck

Thanks!

Nothing's Untestable

https://antithesis.com/blog/2025/bugbash_2025/mitchell_hashimoto/
1•zdw•5m ago•0 comments

Show HN: I made a social media platform

https://onelined.tech/
1•sahil423•16m ago•0 comments

How to write Rust in the kernel part 1

https://lwn.net/Articles/1024202/
1•pkilgore•22m ago•0 comments

CEOs Start Saying the Quiet Part Out Loud: AI Will Wipe Out Jobs

https://www.wsj.com/tech/ai/ai-white-collar-job-loss-b9856259
1•planetjones•23m ago•1 comments

Debian on Apple M1/M2: status and call for testers

https://lists.debian.org/msgid-search/86037b55-e1b8-49e6-a0c9-f961b4ddc1a1@disroot.org
2•pabs3•24m ago•0 comments

GitHub Copilot coding agent now has a Playwright web browser

https://github.blog/changelog/2025-07-02-copilot-coding-agent-now-has-its-own-web-browser/
1•felineflock•26m ago•0 comments

Show HN: Piskvor Prime: a five-in-a-row iOS game with a reactive AI opponent

https://vojtahavlicek.github.io/vojtanyc/posts/piskvor_prime/
1•vh311•26m ago•0 comments

Show HN: Wyntk.ai – anti horseless carriage email

https://www.wyntk.ai/
1•gregorvand•26m ago•0 comments

Give Footnotes a Spec

https://nathansnelgrove.com/2025/07/give-footnotes-a-spec
1•OuterVale•31m ago•0 comments

Braess Paradox [video]

https://www.youtube.com/watch?v=-QTkPfq7w1A
1•travisgriggs•31m ago•1 comments

TPC-DS Benchmark: Trino 476, Spark 4.0.0, and Hive 4 on MR3 2.1

https://mr3docs.datamonad.com/blog/2025-07-02-performance-evaluation-2.1/
1•epdlxjmonad•33m ago•1 comments

Show HN: GenZ AI – Your Voice, but Fluent in Gen Z

https://twitter.com/MisbahSy/status/1940609386927521900
1•misbahsy•33m ago•0 comments

Ask HN: Building for Joy vs. Building for Scale

1•chbkall•35m ago•0 comments

OpenAI to Sponsor Driver Alex Palou at Mid-Ohio IndyCar Race

https://www.sportsbusinessjournal.com/Articles/2025/07/02/openai-gets-first-livery-position-with-ganassi-at-mid-ohio-as-ai-leader-looks-to-racing-for-insights/
1•tekdude•37m ago•0 comments

Learning F# with Falco: Response Localization

https://rewiring.bearblog.dev/learning-f-with-falco-response-localization/
1•Mossy9•39m ago•0 comments

Why the superyachts are getting bigger and bigger

https://www.bbc.com/news/articles/cvgnwx0lwwdo
1•andsoitis•41m ago•1 comments

Aphrodisiac

https://www.rxjourney.net/the-ultimate-aphrodisiac
1•chidieberechigo•43m ago•0 comments

Natasha Lyonne reveals David Lynch was a supporter of AI

https://faroutmagazine.co.uk/natasha-lyonne-reveals-david-lynch-supporter-ai/
2•CharlesW•48m ago•0 comments

Accelerate Legacy Application Modernization 4 times faster

https://www.techolution.com/products/appmod-ai-for-enterprises/
1•tech28•49m ago•0 comments

Third Interstellar Object Discovered

https://minorplanetcenter.net/mpec/K25/K25N12.html
2•gammarator•50m ago•0 comments

David Romero's Digital Models of Frank Lloyd Wright's Unrealized Buildings

https://www.thisiscolossal.com/2025/06/david-romero-frank-lloyd-wright/
2•CharlesW•50m ago•0 comments

You People Keep Contradicting Yourselves

https://www.taylor.gl/blog/27
1•taylorlunt•54m ago•0 comments

Windows 11 Start menu uses a 15 MB JSON for categories

https://www.windowslatest.com/2025/07/03/windows-11-start-menu-uses-a-15mb-json-not-ai-to-organize-apps-under-categories/
3•lcnmrn•58m ago•2 comments

2025 AsiaLLVM Developers' Meeting Talks

https://www.youtube.com/playlist?list=PL_R5A0lGi1ADKfJbzpA0rMDCb5T3QGe5k
1•matt_d•1h ago•1 comments

Open Co-Scientist Agents: Recreating Google's AI Co-Scientist in LangGraph

https://github.com/conradry/open-coscientist-agents
1•conradry•1h ago•0 comments

The Mechanic Johnny Cash and Elvis Would've Wanted (Toolbox Tour) [video]

https://www.youtube.com/watch?v=xrHtzSIh2GQ
1•meandave•1h ago•0 comments

What happens to your brain when you watch videos online at faster speeds

https://theconversation.com/what-happens-to-your-brain-when-you-watch-videos-online-at-faster-speeds-than-normal-259930
1•Duanemclemore•1h ago•2 comments

Is that a Lululemon Scuba hoodie or Costco dupe? No one has to know

https://www.washingtonpost.com/style/fashion/2025/01/25/costco-dupe-lululemon-scuba-hoodie-danskin/
2•walterbell•1h ago•0 comments

Has Xbox Considered Laying One Person Off Instead of Thousands

https://aftermath.site/xbox-layoffs-microsoft-phil-spencer
6•Narishma•1h ago•1 comments

Mr. Abrego's Account of Torture at CECOT in El Salvador

https://www.muellershewrote.com/p/mr-abregos-account-of-torture-at
8•tastyface•1h ago•1 comments