frontpage.

Show HN: Vdiff – CLI to help you review AI-generated code

1•fforbeck•1h ago

Hey, you probably already saw that reviewing AI-generated code is a nightmare and quickly becomes a bottleneck. Everyone is using AI agents to write code fast, but the hard part is reviewing a bazillion lines.

I was thinking about building something that would guide me during development and during PR reviews. Something that would give me signals based on facts, risks, and evidence. Not just one LLM reviewing the code it generated. The initial idea was to add a deterministic review layer and combine that with LLM reasoning, and use that to find gaps in code and point me to the most important places, so I don't need to read line by line.

I ended up building a tool called vdiff, and it is working very well for me, and I'm constantly improving it. It is a CLI that analyzes your git diffs and gives you a structured report: what changed, what's risky, and what's missing. It uses tree-sitter for AST diffs and an LLM on top, so you get actual evidence for each finding, not just vibes.

Some of the output signals: - Tells you if it's safe to merge, with a risk score - Lists what's wrong, how confident it is, and shows the evidence - Dependency graph for blast radius analysis - Review memory (tracks resolved/reopened findings across sessions) - You can point it at a spec or PRD, and it checks if the changes actually match - Structural metrics (acyclicity, depth, equality, graph)

It runs locally; I didn't want the tool publishing the code to a third-party server, so your code never leaves your machine. BYOK (bring your own LLM key) - you interact directly with the provider.

If you want to give it a try:

  npm i -g @4bk/vdiff     # install globally
  pip install graphifyy   # required to generate the knowledge graph
  cd your-project         # go to a git repo
  vdiff init              # set up provider, API key, build knowledge graph
  vdiff -v                # analyze staged changes

Would love to hear if this is something helpful for you as well, and what kind of signals you'd want to see. I usually run it before each commit on a feature branch, and then on CI to verify the feature branch against main.

Any feedback is very welcome, and if it is crap, well, then just say it.

Cheers

How Oregon's Data Center Boom Is Supercharging a Water Crisis

Palantir Comes to Campus

Shitpostmodernism: Understanding the Slopgeneration

AI Agents Are the Mass-Produced Cars of Software

Opioid maker Purdue Pharma shuts down as part of $7.4B deal

Disneyland Now Uses Face Recognition on Visitors

Digital Ecosystems: Interactive Multi-Agent Neural Cellular Automata

How are Life-Size Figures Created at hololive production?

Vibecoded my dream game, GeoGuesser for guns, now its helping with student bills

The Railway and the Balloon

Floating Armoury

Customizing Claude Code spinner verbs

Back end-for-Front end: The most secure architecture for browser-based apps

Voyager and the Art of Graceful Degradation

Did I photograph the Aurora or was it something else? (2016)

Upcoming Blender Development Fund and AI Policies

The Annoying Usefulness of Emacs [video]

The Sky Tonight

New US phone network for Christians to block porn and gender-related content

Making Your Writing Work Harder for You

Show HN: TradingAgents without the API bill – run multi agents in Claude Code

Stop Supplying. Start Owning

Uber wants to turn its drivers into a sensor grid for AV companies

Zugzwang

If Claude writes the code, what makes me still a developer?

Santa Cruz restaurant changes logo after flurry of negative reviews for AI art

LLMs consistently pick resumes they generate over ones by humans or other models

Domination: A contrarian view of AI risk (2024)

I moved my blog from Jekyll to Emacs Lisp

The History of Lipstick