Passing tests and coverage are often weaker signals than teams assume. Code can be exercised, tests can pass, and yet the actual acceptance criteria may still be only weakly protected.
More about the problem: https://www.dmytrohuz.com/p/i-built-ac-trace-to-question-the
ac-trace maps acceptance criteria to code and tests, then mutates the mapped code to check whether the tests really catch the breakage.
Current scope is intentionally narrow: Python + pytest. It’s an early experiment, and I’d really value feedback from people working on testing, QA/dev productivity, and AI-assisted development workflows.