frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

CVE-Bench: testing LLM agents on real-world vulnerability patches

https://giovannigatti.github.io/cve-bench/
4•logickkk1•41m ago

Comments

david_shaw•26m ago
The problem with Mythos and Glasswing related hype is that finding vulnerabilities isn't the problem for most organizations. It's great that Mythos and similar models can find vulnerabilities that remained undetected (and hopefully unexploited) for years. That's valuable, especially in open source projects, but it's never been the real challenge for software companies.

The real problem is balancing the need to fix vulnerabilities with the mandate of shipping new products and features. At every organization I've worked for or with, this has been the natural friction point. That's good: Product should make customers happy, and Security should keep the customers and their data safe.

Ultimately, the whole business should share these goals: everyone should strive for a resilient, useful product shipped quickly that delights customers. Easier said than done, but the friction should be tactical ("how do we spend engineering resources?") rather than strategic ("are security fixes important? do we care?").

Which is why I'm much more interested in automated (or semi-automated) PRs to actually fix discovered vulnerabilities rather than just identify them. But, as this project implies, it's not always that simple. It's easy to fix vulnerabilities if you don't care about breaking other functionality.

In my opinion, it's currently still necessary to have a human developer in the loop to make sure functionality in product is maintained, and potentially security in the loop to make sure the vulnerability is actually fixed and not just obfuscated.

Once this technology is sufficiently advanced -- and I think we're getting close -- my hope is that developer and security time will be spent thinking about resilient software design and architecture, not code-level vulnerabilities.

We'll see where it goes.

CachyOS Delivers Lead over Arch Linux Pop_OS and Ubuntu on System76 Thelio Major

https://www.phoronix.com/review/cachyos-thelio-major
1•Bender•1m ago•0 comments

Knowa – Open-Source LLM Context Optimizer

https://github.com/zzorphcreator/knowa
1•zzorphcreator•2m ago•0 comments

Ruminating about mutable value semantics

https://www.scattered-thoughts.net/writing/ruminating-about-mutable-value-semantics/
1•tripdout•2m ago•0 comments

Troops to get free tickets to White House UFC event, must meet weight standards

https://www.cnn.com/2026/05/29/politics/us-military-ufc-white-house-weight
2•Bender•3m ago•0 comments

Otari: Own Your AI Stack

https://blog.mozilla.ai/otari-own-your-ai-stack/
2•mwheeler•7m ago•0 comments

QEMU may allow AI-generated contributions in non-critical areas

https://www.phoronix.com/news/QEMU-Patch-Allows-Some-AI
1•Lihh27•8m ago•0 comments

The Internet Has Become Too American to Trust

https://thewalrus.ca/the-internet-has-become-too-american-to-trust/
2•Teever•10m ago•0 comments

A Postgres-native durable workflow system

https://earendil-works.github.io/absurd/
1•yogthos•11m ago•0 comments

The Last Technical Interview

https://steve-yegge.medium.com/the-last-technical-interview-bc13ddcf4564
1•headalgorithm•11m ago•0 comments

Pick of the 10 Hottest West Coast Startups in (2013)

https://thenextweb.com/news/west-coast-startups-to-look-out-for-in-2013
1•Caarticles•12m ago•0 comments

The California State Assembly Has Passed the 'Protect Our Games Act'

https://www.invenglobal.com/articles/22330/stop-killing-games-movement-gains-momentum-california-...
16•TechTechTech•14m ago•1 comments

What "Memory Compiler" Actually Means: From Bitcells to GDS Tiling

https://thecloudlet.github.io/technical/compiler/memory-compiler/
1•matt_d•16m ago•0 comments

Show HN: MigraDiff – maintained fork of migra (PostgreSQL schema diff)

https://github.com/migradiff/migra
1•lateos-ai•16m ago•0 comments

Show HN: 3 of Minutes of AI Anime Based on Korean Comics [video]

https://www.youtube.com/watch?v=OfkTGCVk-RQ
1•JimsonYang•16m ago•0 comments

Show HN: I brought back Airbnb categories

https://www.vibebnb.fyi/
1•giulioco•16m ago•0 comments

I Built RuntimeWire: A One-Person, Mostly-Autonomous AI Newsroom

https://blog.ryanmerket.com/how-i-built-runtimewire-a-one-person-mostly-autonomous-ai-newsroom-99...
1•ryanmerket•17m ago•0 comments

Paint.NET developer gets "paint.net" domain from scammers after 22 years

https://twitter.com/i/status/2060397901650825238
5•notRobot•17m ago•1 comments

MCP: defending the runtime layer of agent security

https://arcis-website.pages.dev/blog/posts/defending-agent-tool-calls
1•gagancm•19m ago•0 comments

A Theory of Everything Is Revolutionizing the Democratic Party

https://www.theatlantic.com/ideas/2026/05/antitrust-theory-barry-lynn/687287/
2•whatisabcdefgh•23m ago•0 comments

"Vibe coding" is not a metric

https://www.coreyguitar.com/blog/18/vibe-coding/
1•coreycr•26m ago•0 comments

Cybersecurity challenge: be nice to each other [IMPOSSIBLE]

https://sdomi.pl/weblog/29-please-do-better-thanks/
1•ambigious7777•27m ago•0 comments

AI and the Courts – A Cautionary Tale

https://www.bailii.org/ew/cases/EWHC/Ch/2026/1199.html
1•multjoy•28m ago•2 comments

Show HN: A 24h grid where one tap on a distracting app voids the whole hour

https://apps.apple.com/us/app/oh-my-hours/id6760450002
2•mindfulbun•30m ago•0 comments

Schrödinger's Kittens Are All Grown Up

https://nautil.us/schrodingers-kittens-are-all-grown-up-1281010
3•bookofjoe•31m ago•0 comments

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA

https://github.com/jmaczan/tiny-vllm
5•yu3zhou4•31m ago•0 comments

Microsoft 0-day feud escalates as researcher threatens another exploit dump

https://www.theregister.com/security/2026/05/28/microsoft-0-day-feud-escalates-as-researcher-thre...
9•Cider9986•32m ago•3 comments

Lohner-Porsche

https://en.wikipedia.org/wiki/Lohner%E2%80%93Porsche
1•lqet•33m ago•0 comments

Agents-Collab.md – A live handoff protocol for multi-agent projects

https://github.com/Rlealbarili/Agents-Collab.md
1•Rlealbarili•33m ago•0 comments

How the Community Trained Gemma to "Think" with Tunix and TPUs

https://developers.googleblog.com/how-the-community-trained-gemma-to-think-with-tunix-and-tpus/
1•simonpure•34m ago•0 comments

Ask HN: What was the best decision you made in your career?

5•chistev•36m ago•1 comments