frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Autofix Bot – Hybrid static analysis and AI code review agent

18•sanketsaurav•18h ago
Hi there, HN! We’re Jai and Sanket from DeepSource (YC W20), and today we’re launching Autofix Bot, a hybrid static analysis + AI agent purpose-built for in-the-loop use with AI coding agents.

AI coding agents have made code generation nearly free, and they’ve shifted the bottleneck to code review. Static-only analysis with a fixed set of checkers isn’t enough. LLM-only review has several limitations: non-deterministic across runs, low recall on security issues, expensive at scale, and a tendency to get ‘distracted’.

We spent the last 6 years building a deterministic, static-analysis-only code review product. Earlier this year, we started thinking about this problem from the ground up and realized that static analysis solves key blind spots of LLM-only reviews. Over the past six months, we built a new ‘hybrid’ agent loop that uses static analysis and frontier AI agents together to outperform both static-only and LLM-only tools in finding and fixing code quality and security issues. Today, we’re opening it up publicly.

Here’s how the hybrid architecture works:

- Static pass: 5,000+ deterministic checkers (code quality, security, performance) establish a high-precision baseline. A sub-agent suppresses context-specific false positives.

- AI review: The agent reviews code with static findings as anchors. Has access to AST, data-flow graphs, control-flow, import graphs as tools, not just grep and usual shell commands.

- Remediation: Sub-agents generate fixes. Static harness validates all edits before emitting a clean git patch.

Static solves key LLM problems: non-determinism across runs, low recall on security issues (LLMs get distracted by style), and cost (static narrowing reduces prompt size and tool calls).

On the OpenSSF CVE Benchmark [1] (200+ real JS/TS vulnerabilities), we hit 81.2% accuracy and 80.0% F1; vs Cursor Bugbot (74.5% accuracy, 77.42% F1), Claude Code (71.5% accuracy, 62.99% F1), CodeRabbit (59.4% accuracy, 36.19% F1), and Semgrep CE (56.9% accuracy, 38.26% F1). On secrets detection, 92.8% F1; vs Gitleaks (75.6%), detect-secrets (64.1%), and TruffleHog (41.2%). We use our open-source classification model for this. [2]

Full methodology and how we evaluated each tool: https://autofix.bot/benchmarks

You can use Autofix Bot interactively on any repository using our TUI, as a plugin in Claude Code, or with our MCP on any compatible AI client (like OpenAI Codex).[3] We’re specifically building for AI coding agent-first workflows, so you can ask your agent to run Autofix Bot on every checkpoint autonomously.

Give us a shot today: https://autofix.bot. We’d love to hear any feedback!

---

[1] https://github.com/ossf-cve-benchmark/ossf-cve-benchmark

[2] https://huggingface.co/deepsource/Narada-3.2-3B-v1

[3] https://autofix.bot/manual/#terminal-ui

Comments

nickphx•1h ago
"shifted bottleneck to code review"... understatement of decade.
_pdp_•54m ago
What is the difference between this and let's say Claude Code using something like semgrep as a tool?

Also I don't think this tool should be in the developer flow as in my experience it is unlikely to run it on the regular. It should be something that is done as part of the QA process before PR acceptance.

I hope this helps and good luck.

dolftax•32m ago
On the OpenSSF CVE Benchmark[1], Semgrep CE hits 56.97% accuracy vs our 81.21%, and nearly 3x higher recall (75.61% vs 26.83%).

On when to run it, fair point. Autofix Bot is currently meant for local use (TUI, Claude Code plugin, MCP). We're integrating this pipeline into DeepSource[2], which will have inline comments in pull requests, that fits the QA/pre-merge flow you're describing.

That said, if you're using AI agents to write code, running it at checkpoints locally keeps feedback tight.

Thanks for the feedback!

[1] https://github.com/ossf-cve-benchmark/ossf-cve-benchmark

[2] https://deepsource.com/

ramoz•10m ago
These tools are competing with the frontier lab’s own agent architecture and the likes of CI platforms (copilot pr review). Developers will not install more than 1-2 coding interfaces. These type of tools will need to invent something novel outside of formulaic RAG and in turn hope to get acquired by a lab.
tarun_anand•21m ago
Congratulations!! Anchoring is important. What about other parts of the code review like coding guidelines, perf issues etc?
dolftax•10m ago
We flag performance issues today alongside security and code quality. We're working on respecting AGENTS.md, detecting code complexity (AI generated code tends toward verbose, tangled logic), and letting teams define custom coding guidelines.

Show HN: Tripwire: A new anti evil maid defense

https://github.com/fr33-sh/Tripwire
29•DoctorFreeman•1d ago•18 comments

Show HN: Autofix Bot – Hybrid static analysis and AI code review agent

18•sanketsaurav•18h ago•6 comments

Show HN: Sim – Apache-2.0 n8n alternative

https://github.com/simstudioai/sim
216•waleedlatif1•22h ago•50 comments

Show HN: Open-source, offline voice typing and live captions for Android

https://github.com/notune/android_transcribe_app
2•leumon•1h ago•0 comments

Show HN: Epstein's emails reconstructed in a message-style UI (OCR and LLMs)

https://github.com/Toon-nooT/epsteins-phone-reconstructed
24•toon-noot•2h ago•1 comments

Show HN: 360css CSS library inspired by the xbox360 dashboard

https://tarmo1.github.io/360css/
3•Tarmo362•3h ago•0 comments

Show HN: Jottings; Anti-social microblog for your thoughts

https://jottings.me/
14•vishalvshekkar•7h ago•10 comments

Show HN: Local Privacy Firewall-blocks PII and secrets before ChatGPT sees them

https://github.com/privacyshield-ai/privacy-firewall
103•arnabkarsarkar•2d ago•52 comments

Show HN: Workmux – Parallel development in tmux with Git worktrees

https://github.com/raine/workmux
4•rane•4h ago•0 comments

Show HN: A minimum viable Markov gibberish generator in 32 lines of Python

https://github.com/susam/mvs
3•susam•5h ago•0 comments

Show HN: Wirebrowser – A JavaScript debugger with breakpoint-driven heap search

https://github.com/fcavallarin/wirebrowser
64•fcavallarin•2d ago•15 comments

Show HN: An endless scrolling word search game

https://endless-wordsearch.com
23•marcusdev•1d ago•13 comments

Show HN: GPULlama3.java Llama Compilied to PTX/OpenCL Now Integrated in Quarkus

22•mikepapadim•23h ago•5 comments

Show HN: Gotui – a modern Go terminal dashboard library

https://github.com/metaspartan/gotui
37•carsenk•18h ago•13 comments

Show HN: A 2-row, 16-key keyboard designed for smartphones

https://k-keyboard.com/Why-QWERTY-mini
79•QWERTYmini•1d ago•67 comments

Show HN: Gemini Pro 3 imagines the HN front page 10 years from now

https://dosaygo-studio.github.io/hn-front-page-2035/news
3314•keepamovin•3d ago•957 comments

Show HN: Automated license plate reader coverage in the USA

https://alpranalysis.com
235•sodality2•1d ago•143 comments

Show HN: AlgoDrill – Interactive drills to stop forgetting LeetCode patterns

https://algodrill.io
177•henwfan•3d ago•105 comments

Show HN: I want to democratise Bloomberg Terminal

https://www.aulico.com/workspaces/new
2•lalalerodas•11h ago•1 comments

Show HN: I built a system for active note-taking in regular meetings like 1-1s

https://withdocket.com
173•davnicwil•3d ago•130 comments

Show HN: Search the lyrics of 500 HÖR Berlin techno sets

https://hor.greg.technology/
2•gregsadetsky•13h ago•2 comments

Show HN: The world's least deterministic programming language

https://github.com/andr3wV/VibeScript
4•andr3wV•13h ago•0 comments

Show HN: Forecaster Arena – Testing LLMs on real events with prediction markets

https://forecasterarena.com/
3•setrf•14h ago•0 comments

Show HN: A Real-Time 4D Fractal Explorer in the Browser Using WebGPU

https://bryanjj.github.io/nebula/
2•bryan0•17h ago•1 comments

Show HN: A lightweight Git history explorer written in Go

https://github.com/thiagokokada/gitk-go
3•kokada•17h ago•0 comments

Show HN: DuckDB for Kafka Stream Processing

https://sql-flow.com/docs/tutorials/intro/
76•dm03514•3d ago•13 comments

Show HN: I used Gemini 3 to turn 42 books into interactive webpages in 2 weeks

https://www.vibary.art/en
8•Rand_cat•20h ago•2 comments

Show HN: MCPShark – Traffic Inspector for Model Context Protocol

33•mywork-dev•1d ago•4 comments

Show HN: Mycodosing.wtf – Research‑Informed Psychedelic Dosage Calculator

https://mycodosing.wtf/
5•ing-norante•21h ago•2 comments

Show HN: Open-source UI components for apps that run inside ChatGPT

https://ui.manifest.build
3•stosssik•21h ago•0 comments