frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Devin Review: AI to Stop Slop

https://cognition.ai/blog/devin-review
29•agtestdvn•2h ago

Comments

devin•1h ago
Apropos of nothing: I hate that they used my name for their product.
Y_Y•1h ago
People of the same name should have a union or something. It's apparently fine to annoy everyone in the world called Alexa, probably just because the world's Alexas can't act collectively.
swyx•1h ago
hahahah oh no
blibble•1h ago
so they graciously accept that "AI" agents generate slop (by the very title of the post)

so why would they be any better at reviewing?

Sharlin•1h ago
Easy! You just need a review-reviewer AI to check the review AI's output.
qayxc•1h ago
Bots all the way down...
gaodrew•1h ago
AI reviewing AI does lead to a cycle of ouroboros slop, that’s why Devin Review is actually more of a UI for you to read code easier, not like other code review tools that try to do all the reviewing themselves

Disclaimer: i work here

rednafi•1h ago
I’m all in for building more intuitive UIs to make the review process less cumbersome. But with the current capabilities of LLMs, under no circumstances should we allow AI to be the final judge of whether something should be merged into trunk.

Code review is the last line of defense we have against our systems being invaded by the massive amount of slop that’s getting generated left and right.

Instead of trying to automate the code review process, maybe we should spend more energy on making the scaffolding around it better: better diff tools, semantically grouped files (as Devin mentioned), and better UI for large diffs (GitHub’s UI is horrible for anything beyond a thousand lines).

gaodrew•1h ago
Agreed!
ninjha•1h ago
(I work at Cognition, opinions my own etcetc)

True! Devin Review doesn’t make the kind of judgements you mention, it just does its best to find bugs and help you understand the code faster. I managed to review a PR on an airplane (without starlink) with it earlier this week lol

rednafi•1h ago
Yeah, I wasn’t alluding to Devin reviewing and merging the changelog. This was more of a general statement, since a lot of code review tools seem to get this part wrong.

A lot of energy is being spent on making reviews faster, when reviews are intentionally meant to scale sublinearly. The goal should be: how can we make the process more convenient and less error-prone?

illnewsthat•1h ago
> Devin Review is free and available for PRs on regular GitHub repositories (not GitHub Enterprise). Public PRs don’t require a Devin account.

I guess the tokens are cheap enough or their pockets are deep enough, but this still seems surprising. I guess they can chalk it up to a marketing cost.

briga•1h ago
I can foresee a future of induced demand, where by making PRs "easier" to review, you will end up with way more PRs to review, leading PR backlogs as backed up with PRs as ever. Except now dev teams will have trust-me-bro LLM reviews convincing them that they don't actually need to do full code reviews on code they're putting into production. What could go wrong?
gaodrew•1h ago
Very good point. So when we designed this we actually had that in mind. Devin Review is not supposed to replace your judgment and “give the answer”. It just organizes the PR in a way that makes it way easier for YOU to understand.
briga•1h ago
I was being partly facetious and I think this is probably the way things are going. I guess it's just hard to stomach that devs will end up relying on these tools more than their own intuition. But I suppose that ship has sailed already for a lot of people.
servercobra•1h ago
Overall I've been really impressed with Devin. IMO it's the best tool for AI generating features if you know what you're looking for, have patterns to follow, etc. I suspect the context they build about your project helps a ton.

I was literally just working on a system, using Devin to do the review no less, to add a bunch of the rules we have that are outside of linting's capability to tackle the same kind of thing. Tools like Copilot and Qodo have very high noise ratios, but do occasionally catch legit bugs. Devin Review could be a great complement, and hopefully they'll make it so we can add our own rules soon.

xnx•1h ago
"Devin" has negative brand value.
hrimfaxi•1h ago
Why?
esafak•1h ago
I think the OP's alluding to the initial hype about Devin replacing software engineers.
gaodrew•1h ago
I work at Cognition, lmk any feedback, will share with the team!
devin•48m ago
I don’t like the name of the product.
libraryofbabel•1h ago
Devin? Now that's a name I've not heard in a long time...a lonnng time.

Seriously, in this age of Claude Code and Codex, does anyone use Devin, or even know someone who does? Do they have any users at all?

Ironically, their product has probably got massively better in the last couple of years, because the underlying LLMs got massively better at coding and long-context tasks. But that doth not a successful business model make, and unless you’re Cursor (and even then I’m not so sure) this is a very very hard space to succeed in without owning your own frontier model (i.e being Anthropic, OpenAI, or Google).

esafak•1h ago
I use their deepwiki often.
ninjha•1h ago
yeah there is apparently not a lot of overlap between hn/twitter users and devin users, and we don’t really do marketing campaigns either

logos on website if you want to see some of our customers lol

Der_Einzige•1h ago
We wrote the actual paper on “stopping slop”

https://arxiv.org/abs/2510.15061

snowmobile•1h ago
> code review—not code generation—is now the bottleneck to shipping great products.

Unsurprising, since a human still needs to understand and verify the code, be that as it's written or as it's reviewed. AI's only managed to move the brainpower required from the fun part to the tedious and boring part.

sjajshha•43m ago
Eh, code review has _always_ been the bottleneck (both for the author and any other reviewers). Pulling the agent slot machine for anything remotely challenging is just inflicting pain for no reason on yourself - if quality matters. If not, let it rip.

Otherwise, you’re gonna have to read every line (including those not in the diff) anyways. Typing it out - or getting the AI to do it at a speed you can comprehend - isn’t a meaningful slowdown at all.

joshstrange•1h ago
I wanted to look into their pricing for Devin+ and I have to say, ACU are entirely too opaque/confusing/complicated. The entire description of them is shrouded in mystery. And this part confuses me even more:

> Aside from the few ACUs required to keep the Devin VM running, Devin will not consume ACUs when:

> Waiting for your response

> Waiting for a test suite to run

> Setting up and cloning repositories

Ok, that kind of makes sense, but what does "the few ACUs required to keep the Devin VM running" mean? These cost $2.50/ea so "a few" means $5+ and on what time scale? Daily? Monthly?

The lowest plan comes with $20 ACUs but they don't list anywhere how far that gets you or even rough examples. I guess if you want to kick the tires $20 isn't a crazy amount to test it out yourself and maybe I'm just not the target market (I kind of feel like I am though?) but I wish their pricing made sense.

samyok•11m ago
Have been using Devin Review for a little bit, and I think it's the first of the many "code review" LLM-bots that have come out that doesn't actively feel like "slop". Seems like they must have some integrations with codemaps or deepwiki (the Cognition products I use most often) to power the insights.

My favorite feature has been organizing the files by "logical flow" rather than alphabetically, which feels like such a tiny change but it's such a huge QOL upgrade. A lot of the features seem inspired by Graphite, which is also really enjoyable.

Show HN: ChartGPU – WebGPU-powered charting library (1M points at 60fps)

https://github.com/ChartGPU/ChartGPU
453•huntergemmer•8h ago•139 comments

Claude's new constitution

https://www.anthropic.com/news/claude-new-constitution
242•meetpateltech•7h ago•203 comments

Show HN: TerabyteDeals – Compare storage prices by $/TB

https://terabytedeals.com
36•vektor888•2h ago•28 comments

Golfing APL/K in 90 Lines of Python

https://aljamal.substack.com/p/golfing-aplk-in-90-lines-of-python
28•aburjg•5d ago•1 comments

Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant

https://www.media.mit.edu/publications/your-brain-on-chatgpt/
16•misswaterfairy•46m ago•7 comments

Skip is now free and open source

https://skip.dev/blog/skip-is-free/
234•dayanruben•8h ago•82 comments

Challenges in join optimization

https://www.starrocks.io/blog/inside-starrocks-why-joins-are-faster-than-youd-expect
30•HermitX•6h ago•5 comments

The WebRacket language is a subset of Racket that compiles to WebAssembly

https://github.com/soegaard/webracket
74•mfru•4d ago•17 comments

Show HN: Rails UI

https://railsui.com/
86•justalever•4h ago•58 comments

Jerry (YC S17) Is Hiring

https://www.ycombinator.com/companies/jerry-inc/jobs/QaoK3rw-software-engineer-core-automation-ma...
1•linaz•2h ago

Letting Claude play text adventures

https://borretti.me/article/letting-claude-play-text-adventures
55•varjag•5d ago•20 comments

Show HN: RatatuiRuby wraps Rust Ratatui as a RubyGem – TUIs with the joy of Ruby

https://www.ratatui-ruby.dev/
25•Kerrick•4d ago•3 comments

Three types of LLM workloads and how to serve them

https://modal.com/llm-almanac/workloads
22•charles_irl•7h ago•1 comments

Setting Up a Cluster of Tiny PCs for Parallel Computing

https://www.kenkoonwong.com/blog/parallel-computing/
20•speckx•4h ago•5 comments

Mystery of the Head Activator

https://www.asimov.press/p/head-activator
6•mailyk•3d ago•0 comments

TrustTunnel: AdGuard VPN protocol goes open-source

https://adguard-vpn.com/en/blog/adguard-vpn-protocol-goes-open-source-meet-trusttunnel.html
38•kumrayu•6h ago•9 comments

Waiting for dawn in search: Search index, Google rulings and impact on Kagi

https://blog.kagi.com/waiting-dawn-search
189•josephwegner•6h ago•127 comments

Stevey's Birthday Blog

https://steve-yegge.medium.com/steveys-birthday-blog-34f437139cb5
4•throwawayHMM19•1d ago•1 comments

Tell HN: 2 years building a kids audio app as a solo dev – lessons learned

19•oliverjanssen•9h ago•15 comments

SIMD programming in pure Rust

https://kerkour.com/introduction-rust-simd
31•randomint64•2d ago•10 comments

Slouching Towards Bethlehem – Joan Didion (1967)

https://www.saturdayeveningpost.com/2017/06/didion/
48•jxmorris12•5h ago•2 comments

Open source server code for the BitCraft MMORPG

https://github.com/clockworklabs/BitCraftPublic
25•sfkgtbor•6h ago•7 comments

Scientists find a way to regrow cartilage in mice and human tissue samples

https://www.sciencedaily.com/releases/2026/01/260120000333.htm
229•saikatsg•5h ago•63 comments

Show HN: Grov – Multiplayer for AI coding agents

https://github.com/TonyStef/Grov
18•tonyystef•1h ago•8 comments

Can you slim macOS down?

https://eclecticlight.co/2026/01/21/can-you-slim-macos-down/
151•ingve•15h ago•198 comments

Nested code fences in Markdown

https://susam.net/nested-code-fences.html
172•todsacerdoti•10h ago•59 comments

I finally got my sway layout to autostart the way I like it

https://hugues.betakappaphi.com/2026/01/19/sway-layout/
13•__hugues•14h ago•4 comments

TeraWave Satellite Communications Network

https://www.blueorigin.com/news/blue-origin-introduces-terawave-space-based-network-for-global-co...
107•T-A•4h ago•76 comments

JPEG XL Test Page

https://tildeweb.nl/~michiel/jxl/
153•roywashere•6h ago•107 comments

Show HN: Semantic search engine for Studio Ghibli movie

https://ghibli-search.anini.workers.dev/
11•aninibread•9h ago•6 comments