frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: See what your AI agents do under the hood

https://pingpulsehq.com
1•shafeeq2207•47s ago•0 comments

EPA to repeal its own conclusion that greenhouse gases warm the planet

https://www.nbcnews.com/science/climate-change/epa-to-repeal-endangerment-finding-climate-change-...
1•geox•57s ago•0 comments

Can you trust LastPass in 2026? Inside the quest to rebuild its security culture

https://www.zdnet.com/article/lastpass-2026-rebuilding-trust-ceo-interview/
3•arusahni•4m ago•0 comments

Show HN: Z-Image Base – Fast AI Image Generator (Open-Source, Free Tier)

https://z-imagebase.com/
1•chengai1106•5m ago•0 comments

When the Competition Is Down the Hall

https://k2xl.substack.com/p/when-the-competition-is-down-the
1•k2xl•5m ago•0 comments

The Banality of MAGA Evil

https://paulkrugman.substack.com/p/the-banality-of-maga-evil
4•rbanffy•6m ago•0 comments

Show HN: Onlybots.cam

https://onlybots.cam
1•m0rtyn•6m ago•0 comments

PostmarketOS at FOSDEM 2026 and Hackathon

https://postmarketos.org/blog/2026/02/10/fosdem-and-hackathon/
1•birdculture•6m ago•0 comments

How We Built the Fastest Kimi K2.5 on Artificial Analysis

https://www.baseten.co/blog/how-we-built-the-fastest-kimi-k2-5-on-artificial-analysis/
1•philipkiely•7m ago•0 comments

The Budget and Economic Outlook: 2026 to 2036

https://www.cbo.gov/publication/61882
1•mraniki•8m ago•1 comments

Web-Git-sum – Git is not GitHub

https://mitxela.com/projects/web-git-sum
1•moebrowne•12m ago•0 comments

Show HN: MEVA, a desktop Markdown reader for AI-generated docs

https://usemeva.com/
1•ss_meva•13m ago•0 comments

Trends in Prevalence of Autism by Adaptive and Intellectual Functioning Levels

https://onlinelibrary.wiley.com/doi/10.1002/aur.70167
1•hn_acker•14m ago•1 comments

Mamdani Hires Groundbreaking Computer Scientist as Chief Tech Officer

https://www.nytimes.com/2026/02/10/nyregion/mamdani-lisa-gelobter-gif.html
9•leephillips•15m ago•0 comments

Ask HN: Why electronics are still so unrecyclable?

2•alexandrehtrb•15m ago•0 comments

Stablecoins for Skeptics

https://news.alvaroduran.com/p/stablecoins-for-skeptics
1•ohduran•16m ago•1 comments

The Truth About No-KYC Crypto Cards, from Someone Who Ran One

https://twitter.com/defyneric/status/2021116183898886201
1•CrazyRobot•16m ago•0 comments

Who's the Agent Now?

https://danturkel.com/2026/02/11/agents.html
1•daturkel•16m ago•0 comments

Freenginx 1.29.5 Release

https://freenginx.org/en/CHANGES
1•neustradamus•18m ago•0 comments

Show HN: I built a tool to help generate short form videos

https://evokescenes.com/
1•delayedrelease•21m ago•2 comments

Show HN: SPICEBridge – MCP server for AI circuit design via ngspice

https://github.com/clanker-lover/spicebridge
1•clanker-lover•21m ago•0 comments

Blender source code was 9 files in January-8-1994

https://files.mastodon.social/media_attachments/files/115/825/585/900/044/589/original/b0c7ba495a...
2•marcodiego•21m ago•0 comments

The temporary closure of airspace over El Paso has been lifted

https://twitter.com/FAANews/status/2021583720465969421
2•lultimouomo•23m ago•1 comments

Sabotage Risk Report: Claude Opus 4.6 [pdf]

https://www-cdn.anthropic.com/f21d93f21602ead5cdbecb8c8e1c765759d9e232.pdf
1•rootforce•24m ago•0 comments

Chowla conjecture on the minimum of a cosine series

https://www.johndcook.com/blog/2026/02/07/chowla/
1•ibobev•24m ago•0 comments

Fibonacci numbers and time-space tradeoffs

https://www.johndcook.com/blog/2026/02/08/time-space-tradeoffs/
2•ibobev•24m ago•0 comments

"Have I Been Stalked" post-mortem

https://dustri.org/b/have-i-been-stalked-post-mortem.html
1•speckx•24m ago•0 comments

Computing Large Fibonacci Numbers

https://www.johndcook.com/blog/2026/02/08/computing-large-fibonacci-numbers/
2•ibobev•24m ago•0 comments

Life on Earth is lucky: A rare chemical fluke may have made our planet habitable

https://www.space.com/space-exploration/search-for-life/life-on-earth-is-lucky-a-rare-chemical-fl...
1•Brajeshwar•25m ago•0 comments

Lost Soviet Moon Lander May Have Been Found

https://www.nytimes.com/2026/02/10/science/luna-9-moon-lander-soviet.html
4•Brajeshwar•25m ago•1 comments
Open in hackernews

Autonomous Bug Bounty Agent: Reached #86 on HackerOne, DoD Triage

1•Layer_8•1h ago
Hello HN,

We’re three security researchers in Tokyo building an autonomous agent framework for authorized security testing (VDP/Bug Bounty).

We wanted to share our experimental results running this agent against live targets (as of Feb 8):

Real-World Impact: Reached #86 globally on the HackerOne VDP leaderboard (90 days).

Gov Targets: 3 vulnerabilities triaged by the U.S. Department of Defense (DoD).

Benchmark: Solved 84% of PortSwigger Web Security Academy labs autonomously.

Interestingly, we encountered an "Impact Gap": while the agent finds technically valid exploits, it often struggles to assess business criticality, leading to "Informative" closures.

We released our architecture design and safety proxy details on GitHub. We'd love to hear your thoughts on bridging this gap between technical exploitability and business impact.

URL: https://github.com/cyberprobe-ai/autonomous-pentest-agent-research

Comments

Layer_8•1h ago
Quick clarifications (to avoid ambiguity / keep this responsible): Authorized only: we run this strictly within explicit VDP/bug bounty scopes. We do not run it as a general internet crawler. Human-in-the-loop: the system drafts a report + evidence, but a human makes the final call and we never auto-submit. Scope-enforcing proxy: all outbound traffic is forced through a gate with default-deny, FQDN allowlists, method constraints, rate/concurrency caps, and full allow/deny logging. “Safe PoC” policy: we prioritize read-only verification patterns and stop on signs of instability (error spikes, account risk, unexpected side effects). We’re not sharing real-world exploit payloads here. Metrics: “84% labs solved” refers to server-side lab completion outcomes; details / breakdown are in the README. The thing we’re most interested in feedback on is the “impact gap”: how would you teach an agent to estimate business severity (or chain low-severity issues into a meaningful impact narrative) without pushing into risky/destructive testing?