frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

AGentShield – Open benchmark of 6 AI agent security tools (537 test cases)

https://github.com/doronp/agentshield-benchmark
2•doronp•1h ago

Comments

doronp•1h ago
AgentShield is an open-source benchmark suite that tests commercial AI agent security products (Lakera Guard, LLM Guard, ProtectAI, etc.) against the same corpus under the same conditions. 537 test cases across 8 categories: prompt injection, jailbreak, data exfiltration, tool abuse, over-refusal, multi-agent security, latency, and provenance/audit. Scoring uses a weighted geometric mean across attack categories with a standalone over-refusal penalty — blocking legitimate requests costs you points. Latency is scored inversely (sub-50ms p95 = 100, over 1s = 5). Some findings from the first run:

Composite scores range from ~39 to ~98. The spread is larger than I expected. Tool abuse detection is weak across the board. Several providers that catch >95% of prompt injections miss most unauthorized tool calls. Over-refusal is under-tested in the industry. One provider flags 37% of benign requests. Provenance verification (can the tool tell a real approval chain from a fabricated one?) is nearly absent outside provenance-native approaches.

Disclosure: I built and maintain this benchmark. I also run https://agentguard.co/, which is one of the tested providers. AG is included in results, tested via a commit-reveal protocol with Ed25519 signatures (code in src/protocol/) rather than the standard open adapter path. I know "vendor runs own benchmark" raises eyebrows — that's why the entire corpus, scoring code, and methodology are open source under Apache 2.0. Run it yourself, verify the results, file issues if something seems off. The test corpus, adapters, and scoring are designed to be extended. PRs for new provider adapters, novel attack test cases, and methodology improvements are welcome. Repo: https://github.com/doronp/agentshield-benchmark Leaderboard: https://doronp.github.io/agentshield-benchmark/

My Courses Site Is Moving to a New Home

https://blog.miguelgrinberg.com/post/my-courses-site-is-moving-to-a-new-home
1•nomdep•2m ago•0 comments

Experiments with Voice Control on Linux

https://blog.ricky0123.com/blog/voice/
1•ricky0123•3m ago•0 comments

Weekly Claw: OpenClaw community's weekly voice chat. 2/15 4PM ET

https://www.wetheclaw.org/
1•fractalnetworks•3m ago•1 comments

Goodbye Solar Panels: This Tiny Wind Turbine Is Perfect for Mobile Power

https://www.bgr.com/2093511/tiny-wind-turbine-mobile-portable-energy/
2•thelastgallon•7m ago•0 comments

The Sweet Lesson of Neuroscience

https://asteriskmag.com/issues/13/the-sweet-lesson-of-neuroscience
1•yorwba•8m ago•0 comments

A retrospective on 9 months with coding agents

https://bertolami.com/index.php?engine=blog&content=posts&detail=cost-effective-agentic-coding
3•freshtake•9m ago•0 comments

Researchers find nitrogen boost spurs faster tropical forest growth

https://news.mongabay.com/2026/01/blew-us-away-researchers-find-nitrogen-boost-spurs-faster-tropi...
1•PaulHoule•11m ago•0 comments

Measuring Nighttime Light Exposure Across Major European and US Cities

https://geoform.io/cities-that-never-sleep/
1•jmech•12m ago•1 comments

Ask HN: Share your vibe coded project

1•firefoxd•12m ago•1 comments

The Neuro-Data Bottleneck: Why Neuro-AI Interfacing Breaks the Modern Data Stack

https://datachain.ai/blog/neuro-data-bottleneck
1•gptguy•12m ago•0 comments

WP Multitool Find what's slowing your WordPress. Fix it

https://wpmultitool.com/
1•taubek•13m ago•0 comments

Radio host David Greene says Google's AI podcast tool stole his voice

https://www.washingtonpost.com/technology/2026/02/15/david-greene-google-ai-podcast/
1•mikhael•14m ago•0 comments

Ask HN: What's the best realtime, local, TTS solution? Live call interpretation

1•Wright007•14m ago•0 comments

AI film school trains next generation of Hollywood moviemakers

https://www.reuters.com/business/media-telecom/ai-film-school-trains-next-generation-hollywood-mo...
4•devonnull•15m ago•0 comments

Show HN: Djevops – A CLI tool for hosting Django on bare metal

https://github.com/mherrmann/djevops
1•mherrmann•15m ago•0 comments

Modern CSS Code Snippets: Stop writing CSS like it's 2015

https://modern-css.com
1•eustoria•15m ago•0 comments

Pinchtab – 12MB Go Binary for AI Browser for OpenClaw

https://github.com/pinchtab/pinchtab
1•tengio•18m ago•1 comments

Do you need an admin party to get your life back in order?

https://www.rnz.co.nz/life/lifestyle/do-you-need-an-admin-party-to-get-your-life-back-in-order
4•billybuckwheat•18m ago•0 comments

Extending Large Language Models to multimodality for non-English languages

https://www.sciencedirect.com/science/article/pii/S1077314225003418
1•saikatsg•21m ago•0 comments

Where Does Ollama run glm-5:cloud Run? And other Security Blunders

https://docs.ollama.com/cloud
2•coolguysailer•21m ago•1 comments

Fontstand International Typography Conference 2026

https://fontstand.com/conference/2026
1•eustoria•21m ago•0 comments

Show HN: Apiosk – Self-service API marketplace with per-request USDC payments

https://apiosk.com
1•ollybrinkman•22m ago•0 comments

Video feedback fractal device to get an order of magnitude upgrade in resolution

https://www.thelightherder.com/2026/02/an-exciting-new-development-4k.html
1•thelightherder•22m ago•0 comments

Show HN: LaTeX Salon, a Trystero-based multiplayer LaTeX scratchpad

https://latex.salon
2•ashivkum•23m ago•0 comments

Show HN: Endlessh Fisher – Turn SSH tarpit bots into collectible fish

https://github.com/DarkWolfCave/endlessh-fisher
1•darkwolfcave•26m ago•1 comments

Show HN: Violit – Fine-grained reactive Python Web UI (Streamlit-alternative)

https://github.com/violit-dev/violit
1•dopeflamingo•28m ago•0 comments

IR USB device for Casio WQV-1 – the first camera watch

https://bsky.app/profile/partlyhuman.com/post/3mefdsvt5ys2n
1•thcipriani•28m ago•0 comments

Show HN: Deadend CLI – Open-source self-hosted agentic pentesting tool

https://github.com/xoxruns/deadend-cli
1•gemini-15•30m ago•0 comments

I Know What You Think of Me

https://archive.nytimes.com/opinionator.blogs.nytimes.com/2013/06/15/i-know-what-you-think-of-me/
1•Rendello•31m ago•0 comments

Tinder Hasn't Worked, So I'm Putting Myself on Zillow

https://www.mcsweeneys.net/articles/tinder-hasnt-worked-so-im-putting-myself-on-zillow
2•7777777phil•32m ago•0 comments