frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: We scored 50k PRs with AI – what we learned about code complexity

https://gitvelocity.dev
9•chuboy•1h ago
I'm a CTO with a ~16-person engineering team. Last year I wanted real data on what was actually shipping, not guesswork or story point theater. So we built GitVelocity.

Every merged PR gets scored 0–100 by Claude across six dimensions: scope (0–20), architecture (0–20), implementation (0–20), risk (0–20), quality (0–15), perf/security (0–5). Six dimensions added up, then scaled by change size — a 10-line fix scores lower than a 500-line refactor even at the same complexity. Full formula at gitvelocity.dev/scoring-guide.

After scoring 50,000+ PRs across TypeScript, Python, Rust, Go, Java, Elixir, and more, some things surprised us:

Big PRs don't automatically score high. An 800-line migration with low complexity scores worse than a 200-line architectural change. Size gets you the full multiplier, but the base score still has to earn it.

You can't score well without tests. The quality dimension (0–15) won't give you points without test coverage. At similar experience levels, this was the clearest separator between engineers.

Juniors started outscoring some seniors. They adopted AI tools faster and took on harder problems. Once they could see their own scores, they aimed higher.

We score AI-generated code the same as human-written code. Code is code. An engineer who uses AI to ship more complex work faster is more productive, and their scores reflect that.

Scoring consistency was the hardest technical problem. Without reference examples anchoring each dimension, Claude's scores drifted 15+ points between runs. With 18 calibrated anchors (three per dimension at low/mid/high), we got it down to 2–4 points on the same PR.

The thing we didn't expect was behavioral. We call it the Fitbit effect — the tool doesn't make you ship better code, but seeing the score does. Engineers started referencing their own scores in 1:1s unprompted, because the numbers matched what they already felt about their work. A junior who shipped a tricky concurrency fix could point to a score that proved it wasn't "just a small PR."

We recently added team benchmarks (gitvelocity.dev/demo/benchmarks). Once you're scoring PRs, you can see how your team compares to others across the dataset — about 1,000 engineers on 60 teams so far. Headline's team ships faster than roughly 95% of them, which was nice to confirm but also made us wonder who the other 5% are. The competitive angle surprised us: teams that were skeptical about individual scores got genuinely curious once they could measure themselves against the field.

Every score is fully visible to the engineer who wrote the PR, with per-dimension breakdowns and reasoning. There's no hidden dashboard that management sees and engineers don't.

Free, BYOK (your Anthropic API key). We default to Sonnet 4.6, which scores nearly as well as Opus 4.6 at a fraction of the cost — but you can switch models if you want. Pennies per PR either way. No source code stored, diffs analyzed and discarded. Works with GitHub, GitLab, and Bitbucket.

Ask me anything about the scoring methodology, how we solved calibration, or what it was actually like rolling this out to a team.

Airlines surprise passengers with 'flights' that are buses

https://www.washingtonpost.com/travel/2026/03/26/american-airlines-flight-bus-ride/
1•mooreds•33s ago•1 comments

Optimizing Session Frequency in EEG Biofeedback

https://www.mdpi.com/1424-8220/26/7/2077
1•PaulHoule•1m ago•0 comments

Dentaku Museum

http://www.dentaku-museum.com/
1•marukodo•1m ago•0 comments

Your Claude.md Is a Wish List, Not a Contract

https://techtrenches.dev/p/your-claudemd-is-a-wish-list-not
1•gpi•2m ago•0 comments

Bitwarden Integrates with OneCLI Agent Vault

https://www.onecli.sh/blog/bitwarden-agent-access-sdk-onecli
1•sudo_chmod•2m ago•0 comments

Lessons My Mistakes Taught Me Throughout My Career

https://comuniq.xyz/post?t=895
1•01-_-•2m ago•0 comments

An NSFW Filter for Marginalia Search

https://www.marginalia.nu/log/a_134_nsfw/
2•speckx•2m ago•0 comments

Lael Wilcox on her quest to become the fastest human to cycle round the world

https://www.theguardian.com/sport/2026/mar/30/lael-wilcox-cycling-circumnavigation-world-record
1•mitchbob•3m ago•0 comments

APL Performance

https://aplwiki.com/wiki/Performance
1•tosh•3m ago•0 comments

Improve Battery Life and Performance Using Intel's Battery Life Diagnostic Tool

https://medium.com/@oarevalo/improve-battery-life-and-performance-using-the-intel-battery-life-di...
1•oarevalo•3m ago•0 comments

Show HN: A 3D Body Scan for Nine Cents – Without SMPL

https://clad.you/blog/posts/body-pipeline/
1•arkadiuss•4m ago•0 comments

FastComments Is Ready for Space

https://blog.fastcomments.com/(03-29-2026)-fastcomments-is-ready-for-space.html
1•winrid•4m ago•0 comments

A Man Making Music with His Brain Implant

https://www.wired.com/story/meet-the-man-making-music-with-his-brain-implant/
1•Brajeshwar•5m ago•0 comments

AI giants tried to storm the last human stronghold: the math olympiads

https://english.elpais.com/technology/2026-03-28/how-ai-giants-tried-to-storm-the-last-stronghold...
1•tethys•6m ago•0 comments

Why millions of seniors have suddenly lost health care coverage

https://www.washingtonpost.com/health/2026/03/28/insurance-health-care-medicare-seniors/
4•bookofjoe•6m ago•1 comments

Empty Diagonal

https://en.wikipedia.org/wiki/Empty_diagonal
1•thunderbong•6m ago•0 comments

Stacktopolis: The authentic charity CTO experience (game)

https://techfreedom.eu/stacktopolis/
1•dajbelshaw•7m ago•0 comments

Show HN: ClamBot – AI agent that runs all LLM-generated code in a WASM sandbox

https://github.com/clamguy/clambot
1•theearlyadopter•8m ago•0 comments

PostalForm lets agents place a real print-and-mail order for their owner

https://postalform.com/agents
1•bobbiechen•8m ago•0 comments

Show HN: Paseo, an open source environment for coding agents

https://github.com/getpaseo/paseo
1•boudra•9m ago•0 comments

JD Vance says aliens are 'demons' and details obsession with UFOs

https://www.theguardian.com/us-news/2026/mar/30/jd-vance-alien-ufo-are-demons
4•mitchbob•10m ago•2 comments

Software Engineers Have Been Lied To

https://www.independentengineer.co/p/software-engineers-have-been-lied-to
1•sap1enz•10m ago•0 comments

Google Unveils AppFunctions to Connect AI Agents and Android Apps

https://www.infoq.com/news/2026/03/android-appfunctions-agents/
1•Jonathanfishner•11m ago•0 comments

Aurora Climbing takes down Kilter app and refuses to hand over user data

https://www.instagram.com/reel/DWXjy28iXt9/
1•pr0phet•11m ago•0 comments

FeedVault – I organize my YT subs into topic feeds because YouTube won't

1•erikvallart•12m ago•2 comments

In Expanding de Sitter Space, Quantum Mechanics Gets More Elusive

https://www.quantamagazine.org/in-expanding-de-sitter-space-quantum-mechanics-gets-even-more-elus...
2•lschueller•13m ago•0 comments

Show HN: Diffly – A Python package to compare polars dataframes

https://github.com/Quantco/diffly
1•ekarais•14m ago•0 comments

Java AI Ecosystem Guide

https://ai4jvm.com/
2•jlward4th•14m ago•0 comments

XSpec is a unit test framework for XSLT, XQuery, and Schematron

https://github.com/xspec/xspec
1•protomolecool•16m ago•0 comments

An Irish village with phone-free children

https://www.nytimes.com/2026/03/25/realestate/ireland-cell-phones-children.html
1•gmargari•19m ago•1 comments