frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Making frontier cybersecurity capabilities available to defenders

https://www.anthropic.com/news/claude-code-security
41•surprisetalk•1h ago

Comments

upghost•1h ago
Anakin: I'm going to save the world with my AI vulnerability scanner, Padme.

Padme: You're scanning for vulnerabilities so you can fix them, Anakin?

Anakin: ...

Padme: You're scanning for vulnerabilities so you can FIX THEM, right, Annie?

czbond•1h ago
Definitely will be a fight against bad actors pulling bulk open source software projects, npm packages, etc and running this for their own 0 days.

I hope Anthropic can place alerts for their team to look for accounts with abnormal usage pre-emptively.

tptacek•1h ago
You want frontier models to actively prevent people from using them to do vulnerability research because you're worried bad people will do vulnerability research?
czbond•1h ago
Not at all. I was suggesting if an account is performing source code level request scanning of "numerous" codebases - that it could be an account of interest. A sign of mis-use.

This is different than someones "npm audit" suggesting issues with packages in a build and updating to new revisions. Also different than iterating deeply on source code for a project (eg: nginx web server).

tptacek•1h ago
I don't understand the joke here.
drcongo•1h ago
I thought they'd noticed how many of my Claude tokens I've been burning trying to build defences against the AI bot swarms. Sadly not.
reconnecting•19m ago
Is it only crawlers or bots that abuse your product?

We have been developing our own system (1) for several years, and it's built by engineers, not Claude. Take a look — maybe it could be helpful for your case.

1. https://github.com/tirrenotechnologies/tirreno

deadbabe•1h ago
Solve a problem and everyone praises you.

No one knows you also caused that problem.

nadis•1h ago
> "Rather than scanning for known patterns, Claude Code Security reads and reasons about your code the way a human security researcher would: understanding how components interact, tracing how data moves through your application, and catching complex vulnerabilities that rule-based tools miss."

Fascinating! Our team has been blending static code analysis and AI for a while and think it's a clever approach for the security use case the Anthropic team's targeting here.

bink•1h ago
I hope this is better than their competitors products. So far I've been underwhelmed. They basically just find stuff that's already identified by static analysis tooling and toss in a bunch of false positives from the AI scans.
david_shaw•1h ago
There's a lot of skepticism in the security world about whether AI agents can "think outside the box" enough to replicate or augment senior-level security engineers.

I don't yet have access to Claude Code Security, but I think that line of reasoning misses the point. Maybe even the real benefit.

Just like architectural thinking is still important when developing software with AI, creative security assessments will probably always be a key component of security evaluation.

But you don't need highly paid security engineers to tell you that you forgot to sanitize input, or you're using a vulnerable component, or to identify any of the myriad issues we currently use "dumb" scanners for.

My hope is that tools like this can help automate away the "busywork" of security. We'll see how well it really works.

tptacek•55m ago
I am seeing something closer to the opposite of skepticism among vulnerability researchers. It's not my place to name names, but for every Halvar Flake talking publicly about this stuff, there are 4 more people of similar stature talking privately about it.
awestroke•50m ago
Claude Opus 4.6 has been amazing at identifying security vulnerabilities for us. Less than 50% falae positives.
ievans•1h ago
Not super surprising that Anthropic is shipping a vulnerability detection feature -- OpenAI announced Aardvark back in October (https://openai.com/index/introducing-aardvark/) and Google announced BigSleep in Nov 2024 (https://cloud.google.com/blog/products/identity-security/clo...).

The impact question is really around scale; a few weeks ago Anthropic claimed 500 "high-severity" vulnerabilities discovered by Opus 4.6 (https://red.anthropic.com/2026/zero-days/). There's been some skepticism about whether they are truly high severity, but it's a much larger number than what BigSleep found (~20) and Aardvark hasn't released public numbers.

As someone who founded a company in the space (Semgrep), I really appreciated that the DARPA AIxCC competition required players using LLMs for vulnerability discovery to disclose $cost/vuln and the confusion matrix of false positives along with it. It's clear that LLMs are super valuable for vulnerability discovery, but without that information it's difficult to know which foundation model is really leading.

What we've found is that giving LLM security agents access to good tools (Semgrep, CodeQL, etc.) makes them significantly better esp. when it comes to false positives. We think the future is more "virtual security engineer" agents using tools with humans acting as the appsec manager. Would be very interested to hear from other people on HN who have been trying this approach!

Death to Scroll Fade

https://dbushell.com/2026/01/09/death-to-scroll-fade/
1•birdculture•31s ago•0 comments

Your Transformer Is secretly an EOT Solver

https://elonlit.com/scrivings/your-transformer-is-secretly-an-eot-solver/
1•Anon84•1m ago•0 comments

Raison – Version control and real-time deployment for AI prompts

https://raison.ist
1•arbayi•2m ago•0 comments

Why my father ran the same small business for 30 years

https://siliconcanals.com/j-a-im-in-my-40s-and-i-finally-understand-why-my-father-ran-the-same-sm...
1•happy-go-lucky•3m ago•0 comments

Design time vs. Run time in Agentic engineering

https://twitter.com/taherchhabra/status/2024935862275113444
1•taherchhabra•4m ago•0 comments

Intuitive Intro to Reinforcement Learning for LLMs

https://mesuvash.github.io/blog/2026/rl_for_llm/
1•mesuvash•4m ago•0 comments

Komoot's decline after the Bending Spoons acquisition

https://usernebula.com/report/komoot-case-study
1•samberry•6m ago•0 comments

I built an agent that reads Jira tickets and opens pull requests automatically

https://github.com/ErezShahaf/Anabranch
1•ErezShahaf•8m ago•1 comments

Show HN: I built a Chrome extension to predict sun vs. shade for stadium seats

https://getsunscreen.com
1•evankaye•9m ago•0 comments

Show HN: pi.dev statusbar – macOS statusbar app for live pi agent status

https://github.com/jademind/pi-statusbar
1•jademind•10m ago•0 comments

Tomas Vondra on Talking Postgres: Why it's fun to hack on Postgres performance

https://talkingpostgres.com/episodes/why-its-fun-to-hack-on-postgres-performance-with-tomas-vondra
1•clairegiordano•12m ago•0 comments

How Reblogs Work

https://www.tumblr.com/engineering/809095477398323200/how-reblogs-work
1•Tomte•12m ago•0 comments

Permacomputing Principles

https://permacomputing.net/principles/
1•MindGods•12m ago•0 comments

"Million-year-old" fossil skulls from China are far older–and not Denisovans

https://arstechnica.com/science/2026/02/new-dates-on-chinese-fossils-raise-question-of-how-many-t...
1•alsetmusic•12m ago•0 comments

Reddit Ads support is leaking PII and actively crossing user sessions

2•arashvakil•14m ago•1 comments

Hacked my chess ELO ranking as a beginner, went from 0-700 in 12 sessions

1•smanna92•14m ago•0 comments

Skillflag: CLI flag convention for listing and installing agent skills

https://github.com/osolmaz/skillflag
1•hosolmaz•15m ago•0 comments

Embrace Your Laziness in the Age of AI

https://matthiasplappert.com/blog/2026/laziness-in-the-age-of-ai
2•cakefork•16m ago•0 comments

Cloudflare outage affecting many services

https://downdetector.co.uk/
1•andycloke•16m ago•0 comments

AWS outages caused by AI coding bot blunder, report claims

https://www.tomshardware.com/tech-industry/artificial-intelligence/multiple-aws-outages-caused-by...
4•strict9•18m ago•0 comments

Show HN: BeadHub, Beads-based coordination for multiple coding agents

https://github.com/beadhub/beadhub
1•juanre•21m ago•0 comments

Georgian wine culture dates back, uninterrupted, approximately 8k years

https://www.wsetglobal.com/knowledge-centre/blog/2023/july/05/exploring-georgian-wine-history-gra...
2•Anon84•22m ago•0 comments

Fall-from-grace: A prompt engineering functional programming language

https://github.com/Gabriella439/grace
1•bwestergard•25m ago•0 comments

Turns Out There Was Voter Fraud in Georgia–By Elon Musk

https://newrepublic.com/post/206857/georgia-voter-fraud-elon-musk
2•mandeepj•26m ago•2 comments

The AI security nightmare is here and it looks suspiciously like lobster

https://www.theverge.com/ai-artificial-intelligence/881574/cline-openclaw-prompt-injection-hack
2•cschick•27m ago•1 comments

YouTube tests 'conversational AI' on TV apps

https://9to5google.com/2026/02/19/youtube-tv-conversational-ai-test/
1•geox•29m ago•0 comments

Exploring Linux on a LoongArch Mini PC

https://www.wezm.net/v2/posts/2026/loongarch-mini-pc-m700s/
4•naves•30m ago•0 comments

Interview with Steve Klabnik

https://alexalejandre.com/programming/steve-klabnik-interview/
3•veqq•31m ago•0 comments

Radical Forces in Germany (1931)

https://www.foreignaffairs.com/articles/germany/1931-04-01/radical-forces-germany
2•jjmarr•33m ago•1 comments

Venting Doesn't Reduce Anger, but Something Else Does, Review Finds

https://www.sciencealert.com/venting-doesnt-reduce-anger-but-something-else-does-review-finds
1•PaulHoule•33m ago•0 comments