frontpage.

AI Tool 'Weave' Scores Employee AI Coding Usage; Share's No Methodology

1•weaved•1h ago

There is a new kind of AI tool for software engineering. [Weave][0] is one of them. They are meant to analyze and an employee's usage of AI and evaluate their effectiveness with it.

Tokens are expensive, companies want to make sure they are being used effectively. Fair enough. The problem is that Weave provides nearly zero insight into how their metrics work. They do tell you they use AI to do their analysis, so likely in practive it is LLMs analyzing the usage of other LLMs.

If your employer sets this up your work with AI is reduced down to a number. From first hand experience these numbers are used to make firing decisions. The number is, laughably, presented as "Code output" xx.x / week. It doesn't tell you anything about what the unit is or how they arrive at it.

They provide no assurances or proof that their model isn't biased. They claim their dataset is from expert level PRs. They don't say what the dataset is or how it was compiled exactly. Does it cover native and non-native English speakers? Does it cover young developers' work as well as older developers' work? Did it learn any other biases from usernames, git commit email addresses, etc?

Who knows!? But guess you're fired anyway because computer number go down.

The strawberry on top is that in their [blog posts][1] they claim both 94% accuracy and a 0.94 correlation, but are referring to the same thing. They do not even know the statistical difference between accuracy and correlation but their model gets to decide how good of a vibe coder you are.

Weave's investors are:

Moonfire — lead investor (European early-stage VC) Burst Capital — co-lead (early-stage VC) Y Combinator — participant (the accelerator program; 25% of their current batch uses Weave per Weave's own blog) Pioneer Fund — participant (per PitchBook) Roar Ventures — participant (per PitchBook)

[0]: https://workweave.dev/ [1]: https://web.archive.org/web/20260212040803/https://workweave.dev/blog/weave-vs-linearb

How I went AI-native in my terminal workflow

Databricks Outage

Show HN: Rusdantic

Show HN: Benchmarking LLMs through autonomous games of Blood on the Clocktower

Shipping Snake Oil

After 16 years and $8B, the military's new GPS software still doesn't work

Show HN: BitPolar

Job Isn't Programming

Veethi – We replaced 5 sales tools with one AI chat for founder-led outbound

AMD Zen 6 'Venice' ES chips break cover with up to 192 cores, 32 per CCD

MAME 0.287

Agentic AI and the next intelligence explosion

Pwning V8 with Turbofan Type Confusion (CVE-2025-2135)

Booby-Trapped Insoles Allegedly Sent to Russian Troops

YeetCode – I built a platform for coding duels with friends

US Army paratroopers arrive in Middle East as buildup intensifies

Mathematical methods and human thought in the age of AI

Tickets Are Prompts

DocsMD – Git-able agent friendly documentation

A Taxonomy of Office Chairs

Built a cheap DIY fan controller because my motherboard never had working PWM

Fossier: A slop evaluator for GitHub PRs to prevent spams

Pitch your idea to a team of voice agents, get a dev-ready spec in < 3 minutes

Observers Are All You Need: How Observer-Synchronization Creates All of Physics

Two Worlds

Google removes Search Engine Land article after false DMCA claim

5GW Data Center Buildout Requires Novel Engineering

Could a Large Language Model Be Conscious?

Steve Jobs Brought the Apple II to the Classroom

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon