frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: An AI agent that debugs your LLM app and submits pull requests

https://github.com/Kaizen-agent/kaizen-agent
1•yuto_1192•4h ago
Hi HN,

I built Kaizen Agent, an open-source CLI tool that acts like an AI QA engineer for your LLM applications and agents.

It helps catch broken behavior, apply fixes automatically, and open a pull request — all in one workflow.

What it does: - Runs test inputs with expected outputs

- If a test fails, it analyzes the failure

- Applies prompt/code fixes

- Re-runs tests until they pass

- Submits a GitHub PR with the fix

Why I built it: I got tired of manually debugging and iterating while developing multi-step LLM agents. Writing test cases, checking outputs, tweaking prompts, rerunning — it was all too repetitive. This tool automates that loop.

Try it: GitHub: https://github.com/Kaizen-agent/kaizen-agent

Would love feedback — especially from anyone building agents, LLM tools, or testing frameworks. Curious how others are thinking about evaluation, brittleness, and automation in this space.

But what about my garden leave? (2023)

https://www.ft.com/content/4dbe4c46-647f-4019-b0c7-b8c8a752501c
1•walterbell•12m ago•0 comments

Sholay: Bollywood epic roars back to big screen after 50 years with new ending

https://www.bbc.com/news/articles/cvg8m9z5vv8o
1•sonabinu•14m ago•0 comments

Why Does Every Commercial for A.I. Think You're a Moron?

https://www.nytimes.com/2025/06/25/magazine/ai-commercials-ads-loneliness.html
4•lxm•16m ago•1 comments

p5.strands: Writing Shaders in JavaScript

https://www.davepagurek.com/blog/writing-shaders-in-js/
2•wonger_•20m ago•0 comments

Google DeepMind team up to solve the Navier-Stokes million-dollar problem

https://english.elpais.com/science-tech/2025-06-24/spanish-mathematician-javier-gomez-serrano-and-google-deepmind-team-up-to-solve-the-navier-stokes-million-dollar-problem.html
4•bilsbie•29m ago•0 comments

A real-time index for your codebase: Secure, personal, scalable

https://www.augmentcode.com/blog/a-real-time-index-for-your-codebase-secure-personal-scalable
1•handfuloflight•30m ago•0 comments

Counter Service: How we rewrote it in Rust

https://engineering.grab.com/counter-service-how-we-rewrote-it-in-rust
2•nnx•32m ago•0 comments

Amarok Audio Player replaces Phonon API with GStreamer

https://www.neowin.net/news/amarok-33-beta-2-replaces-phonon-api-with-gstreamer/
1•bundie•33m ago•0 comments

Free online picture splitter and Instagram grid maker

https://aiimagesplitter.com
1•zgm13827•34m ago•0 comments

Ask HN: Seeking Publisher for a Book on AI, Creativity and Human Agency

2•haebom•39m ago•1 comments

Show HN: AI Phone Interviewer – get a call in 30 seconds

1•OlehSavchuk•41m ago•1 comments

Disney+ Application Development Kit (ADK)

https://medium.com/disney-streaming/introducing-the-disney-application-development-kit-adk-ad85ca139073
3•imwally•42m ago•1 comments

AI company wins a copyright infringement lawsuit brought by authors

https://www.npr.org/2025/06/25/nx-s1-5445242/federal-rules-in-ai-companys-favor-in-landmark-copyright-infringement-lawsuit-authors-bartz-graeber-wallace-johnson-anthropic
2•dleslie•50m ago•1 comments

HarmonyOS Next Element Positioning

1•flfljh•55m ago•0 comments

Flutter Performance Tuning on HarmonyOS

1•flfljh•56m ago•0 comments

Hug CSS, how I approach CSS architecture

https://gomakethings.com/hug-css-how-i-approach-css-architecture/
3•Bogdanp•1h ago•0 comments

Refactoring Codebases Through Library Design

https://code-refactor.github.io/
1•PaulHoule•1h ago•0 comments

SSH Tron: Multiplayer Tron in your terminal

http://sshtron.zachlatta.com
1•nnx•1h ago•0 comments

Ask HN: What are alternatives to Glitch for hosting a simple Node/Express app?

1•sebastian_z•1h ago•0 comments

macOS Tahoe Beta Forces Sharing FileVault Key

https://mjtsai.com/blog/2025/06/24/macos-tahoe-beta-forces-sharing-filevault-key/
10•miles•1h ago•1 comments

Global climate was more dynamic and extreme than researchers had imagined

https://www.washingtonpost.com/climate-environment/2024/09/19/earth-temperature-global-warming-planet/
3•bilsbie•1h ago•0 comments

Radar AI Training

https://mjtsai.com/blog/2025/06/25/radar-ai-training/
1•bangonkeyboard•1h ago•0 comments

Pedagogy Unchained

https://learning-with-orin.beehiiv.com/p/pedagogy-unchained
2•BryanHoulton•1h ago•0 comments

Windows 10: News about ESU program – free options for consumers

https://borncity.com/win/2025/06/25/windows-10-news-about-esu-program-free-options-for-consumers/
1•miles•1h ago•0 comments

Ask HN: Can LLMs do batch classification?

1•iknownthing•1h ago•1 comments

3dSen PC v1.0

https://geod.itch.io/3dsenpc/devlog/969781/-3dsen-pc-v10-is-here-a-dream-10-years-in-the-making
1•prossercj•1h ago•0 comments

Stack grows down, but local variables grow up? Let me explain

https://www.gizvault.com/archives/stack-growth-differs-from-locals-growth
3•ricecat•1h ago•0 comments

Democratic Leaders Tried to Crush Zohran Mamdani, Should Have Been Taking Notes

https://www.nytimes.com/2025/06/25/opinion/zohran-mamdani-democratic-party.html
5•handfuloflight•1h ago•0 comments

Show HN: SVG Lined Tile Generator

https://adpreese.github.io/svg-lined-tiles/
1•adpreese•1h ago•0 comments

Show HN: AI Body Type Calculator with Personalized Health Plans (Updated)

https://mybodytype.net/
1•howardV•1h ago•0 comments