frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: I created an AI-powered Python testing suite that writes its own tests

1•MarcoDewey•2h ago
I've been working on a project that I'm excited to share with the Hacker News community. It's an AI-powered Python testing suite that uses a hybrid AI approach to automatically generate comprehensive unit tests, perform fuzz testing, and even conduct mutation testing to assess the quality of your existing test suites.

*The Problem*

As a developer, I've always found writing and maintaining a robust test suite to be one of the most time-consuming and challenging aspects of software development. It's often difficult to think of all the possible edge cases and to ensure that your tests are actually effective at catching bugs.

*The Solution*

To address this, I've created an MCP server that leverages both Google's Gemini AI and BAML (Boundary ML) to provide a suite of intelligent testing tools. The server is built on the FastMCP framework and can be easily integrated into your existing workflow.

*Technical Deep Dive*

Here's a breakdown of the key features and how they work:

* *Hybrid AI Approach:* The project uses a hybrid AI approach that combines the strengths of both BAML and Gemini. BAML is used for structured test generation, ensuring that the output is always in a consistent and parseable format. Gemini is used for its powerful language understanding capabilities, which allows it to generate creative and challenging test cases.

* *Intelligent Unit Test Generation:* The unit test generator uses AI to create a comprehensive suite of tests for your Python code. It automatically identifies edge cases, error conditions, and other potential sources of bugs. The generated tests are written using the `unittest` framework and include proper assertions and error handling.

* *AI-Powered Fuzz Testing:* The fuzz tester uses AI to generate a diverse range of inputs to test the robustness of your functions. It can generate everything from simple edge cases to malformed data and large inputs, helping you to identify potential crashes and other unexpected behavior.

* *Advanced Coverage Testing:* The coverage tester uses a combination of AST analysis and AI-powered test generation to achieve maximum code coverage. It identifies all possible branches, loops, and exception paths in your code and then generates tests to cover each of them.

* *Intelligent Mutation Testing:* The mutation tester uses a custom AST-based mutation engine to assess the quality of your existing test suite. It generates a series of small, syntactic changes to your code (mutations) and then checks to see if your tests are able to detect them. This helps you to identify gaps in your test coverage and to improve the overall effectiveness of your tests.

*Call to Action*

I'm still actively developing the project, and I would love to get your feedback. You can find the source code on GitHub: https://github.com/jazzberry-ai/python-testing-mcp

I'm particularly interested in hearing your thoughts on the following:

* Are there any other testing tools that you would like to see added to the suite? * Have you found any interesting bugs or edge cases using the tool? * Do you have any suggestions for improving the prompts or the AI models?

Thanks for reading, and I look forward to hearing from you!

A Hiker Was Missing for Nearly a Year. Then an AI System Spotted His Helmet

https://www.wired.com/story/missing-hiker-ai-drone-recovery/
1•FinnLobsien•1m ago•0 comments

Gut-Feelings vs. Metrics

https://thetechenabler.substack.com/p/gut-feelings-vs-metrics
1•ingve•1m ago•0 comments

A top designer was banned from Dribbble. Now he's building his own competitor

https://techcrunch.com/2025/08/04/a-top-designer-was-banned-from-dribbble-now-hes-building-his-own-competitor/
1•kehiy•15m ago•1 comments

The Curse of the A-Word

https://temporal.io/blog/sergey-the-curse-of-the-a-word
1•matesz•15m ago•0 comments

Nerve Calm Canada

https://www.facebook.com/Nerve.Calm.Canada.Official
1•Polikaiua•15m ago•0 comments

Show HN: Embeddable -build interactive experiences you can drop into any website

https://embeddable.co/
3•dsternlicht•18m ago•0 comments

Horizon Beta (ChatGPT 5?)

https://openrouter.ai/openrouter/horizon-beta
1•franze•25m ago•0 comments

Numai is an open source JavaScript spreadsheet powered by AI

https://github.com/clauderouxster/NUMAI
2•clauderoux•26m ago•1 comments

When to Hire a Computer Performance Engineering Team

https://www.brendangregg.com/blog/2025-08-04/when-to-hire-a-computer-performance-engineering-team-2025-part1.html
1•rochoa•27m ago•0 comments

Bread, Circuses and Education

https://anandsanwal.me/bread-and-circuses-in-modern-education/
1•car•29m ago•0 comments

A

3•narenthiran•33m ago•1 comments

Show HN: I built a platform to showcase digital projects and connect creators

https://www.showmysites.com/
1•toutoulliou•36m ago•0 comments

Chinese blessing scam more prominent in Australia as operators re-emerge

https://www.abc.net.au/news/2025-08-05/blessing-scam-targeting-chinese-women-australia-investigation/105591666
2•testrun•36m ago•0 comments

You Have Too Many Metrics

https://staysaasy.com/management/2025/08/02/metrics.html
1•thisismytest•36m ago•0 comments

Adopting Claude Code: Riding the Software Economics Singularity

https://preset.io/blog/adopting-claude-code-riding-the-software-economics-singularity/
3•Pranav2612000•44m ago•0 comments

Show HN: We built this to save creators time, looking for early users (free)

https://mc-web-feedme.framer.website/feedme
1•cbpark•44m ago•0 comments

Show HN: Dataset Explorer – Free tool to search any public datasets

https://www.hunch.dev/data-explorer
4•matankleym•44m ago•1 comments

Type Safety Back and Forth

https://www.parsonsmatt.org/2017/10/11/type_safety_back_and_forth.html
1•thunderbong•46m ago•0 comments

Inside North Korea's effort to infiltrate U.S. companies

https://www.cnn.com/interactive/2025/08/05/world/north-korea-it-worker-scheme-vis-intl-hnk/index.html
2•lurkshark•47m ago•0 comments

P-fast trie: lexically ordered hash map

https://dotat.at/@/2025-08-04-p-fast-trie.html
1•ingve•59m ago•0 comments

Show HN: Datastripes – No-Code Data Dashboard Builder

https://datastripes.com/
1•vinserello•1h ago•0 comments

Jira is not the problem. Your process is the problem. [Re-post]

https://www.leadinginproduct.com/p/jira-is-not-the-problem
1•benkan•1h ago•0 comments

Mounting Large Files to Containers Efficiently

https://anemos.sh/blog/mounting-large-files/
2•notanaverageman•1h ago•0 comments

SEO Is Dead. Say Hello to Geo

https://nymag.com/intelligencer/article/seo-is-dead-say-hello-to-geo.html
2•mobilio•1h ago•0 comments

What's New in ViteLand: July 2025 Recap

https://voidzero.dev/posts/whats-new-jul-2025
1•manniL•1h ago•0 comments

Algorithms for Optimization (second edition preview) [pdf]

https://algorithmsbook.com/optimization/files/optimization.pdf
1•i_don_t_know•1h ago•1 comments

GitHub Nuked My Account at Midnight During Alpha Release

https://www.stack-junkie.com/blog/GitHub-Nuked-My-Account-at-Midnight-During-Alpha-Release-Why-I-Rage-Quit-to-GitLab
4•OuterVale•1h ago•0 comments

Money Plus Sunset

https://en.wikipedia.org/wiki/Microsoft_Money
2•tosh•1h ago•0 comments

Show HN: Semi Intelligence – Instant. Tiny. Lightweight

https://github.com/SuperGamer474/Semi_Intelligence
2•SuperGamer474•1h ago•1 comments

Lisp interpreter with GC in <750 lines of Odin (and <500 lines of C)

https://github.com/krig/LISP
3•Bogdanp•1h ago•0 comments