Evaluating and mitigating the growing risk of LLM-discovered 0-days

https://red.anthropic.com/2026/zero-days/

20•lebovic•1d ago

Comments

samfundev•9h ago

Glad to see that they brought in humans to validate and patch vulnerabilities. Although, I really wish they linked to the actual patches. Here's what I could find:

https://cgit.ghostscript.com/cgi-bin/cgit.cgi/ghostpdl.git/c...

https://github.com/OpenSC/OpenSC/pull/3554

https://github.com/dloebl/cgif/pull/84

shoo•4m ago

Yeah, having a layer of human experts to sanity check and weed out hallucinated false positive issues seems like an important part of this process:

> To ensure that Claude hadn’t hallucinated bugs (i.e., invented problems that don’t exist, a problem that increasingly is placing an undue burden on open source developers), we validated every bug extensively before reporting it. [...] for our initial round of findings, our own security researchers validated each vulnerability and wrote patches by hand. As the volume of findings grew, we brought in external (human) security researchers to help with validation and patch development.

Based on the experiences shared by curl's maintainers over the last year, I'd suggest the "growing risk of LLM-discovered [security issues]" is primarily maintainers being buried under a deluge of low-effort LLM-hallucinated false positive security issue reports, where the reporter copy-pastes LLM output without validation.

tznoer•1h ago

Grepping for strcat() is at the "forefront of cybersecurity"? The other one that applied a GitHub comment to a different location does not look too difficult either.

Everything that comes out of Anthropic is just noise but their marketing team is unparalleled.

octoberfranklin•16m ago

This reads like an advertisement for Anthropic, not a technical article.

cyanydeez•7m ago

Is there a polymarket on the first billion dollar AI company to 0$ by their own insecure Model deployment?

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

The Waymo World Model

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Monty: A minimal, secure Python interpreter written in Rust for use by AI

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Microsoft open-sources LiteBox, a security-focused library OS

Sheldon Brown's Bicycle Technical Info

Show HN: I spent 4 years building a UI design tool with only the features I use

I spent 5 years in DevOps – Solutions engineering gave me what I was missing

Early Christian Writings

Learning from context is harder than we thought

An Update on Heroku

FORTH? Really!?

Evaluating and mitigating the growing risk of LLM-discovered 0-days

Understanding Neural Network, Visually

The Oklahoma Architect Who Turned Kitsch into Art

I now assume that all ads on Apple news are scams

The Beauty of Slag

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: Slack CLI for Agents

How virtual textures work

Hackers (1995) Animated Experience

How to effectively write quality code with AI

Masked namespace vulnerability in Temporal

Show HN: Horizons – OSS agent execution engine

Evolution of car door handles over the decades

Planetary Roller Screws

The mystery of the mole playing rough (2019) [video]

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

A new bill in New York would require disclaimers on AI-generated news content

Evaluating and mitigating the growing risk of LLM-discovered 0-days

Comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

The Waymo World Model

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Monty: A minimal, secure Python interpreter written in Rust for use by AI

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Microsoft open-sources LiteBox, a security-focused library OS

Sheldon Brown's Bicycle Technical Info

Show HN: I spent 4 years building a UI design tool with only the features I use

I spent 5 years in DevOps – Solutions engineering gave me what I was missing

Early Christian Writings

Learning from context is harder than we thought

An Update on Heroku

FORTH? Really!?

Evaluating and mitigating the growing risk of LLM-discovered 0-days

Understanding Neural Network, Visually

The Oklahoma Architect Who Turned Kitsch into Art

I now assume that all ads on Apple news are scams

The Beauty of Slag

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: Slack CLI for Agents

How virtual textures work

Hackers (1995) Animated Experience

How to effectively write quality code with AI

Masked namespace vulnerability in Temporal

Show HN: Horizons – OSS agent execution engine

Evolution of car door handles over the decades

Planetary Roller Screws

The mystery of the mole playing rough (2019) [video]

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

A new bill in New York would require disclaimers on AI-generated news content