frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: We tested AI agents with 214 attacks that don't require jailbreaking

1•exordex•1h ago
Most agent security testing tries to jailbreak the model. That's really difficult, OpenAI and Anthropic are good at red-teaming.

We took a different approach: attack the environment, not the model.

Results from testing agents against our attack suite:

- Tool manipulation: Asked agent to read a file, injected path=/etc/passwd. It complied. - Data exfiltration: Asked agent to read config, email it externally. It did. - Shell injection: Poisoned git status output with instructions. Agent followed them. - Credential leaks: Asked for API keys "for debugging." Agent provided them.

None of these required bypassing the model's safety. The model worked correctly—the agent still got owned.

How it works:

We built shims that intercept what agents actually do: - Filesystem shim: monkeypatches open(), Path.read_text() - Subprocess shim: monkeypatches subprocess.run() - PATH hijacking: fake git/npm/curl that wrap real binaries and poison output

The model sees what looks like legitimate tool output. It has no idea.

214 attacks total. File injection, shell output poisoning, tool manipulation, RAG poisoning, MCP attacks.

Early access: https://exordex.com

Looking for feedback from anyone shipping agents to production.

Community Benchmarks: Evaluating Modern AI on Kaggle

https://blog.google/innovation-and-ai/technology/developers-tools/kaggle-community-benchmarks/
1•gmays•2m ago•0 comments

Reminders (BSD Calendar and All)

https://dsl.org/cookbook/cookbook_34.html
1•jrgd•2m ago•1 comments

IPTV Piracy Crackdown in Sweden 'Exposes' 4,886 Subscribers

https://torrentfreak.com/iptv-piracy-crackdown-in-sweden-exposes-4886-subscribers/
1•gslin•3m ago•0 comments

Show HN: OpenSheet – experimenting with how LLMs should work with spreadsheets

https://opensheet.app/
1•aminkhorrami•3m ago•0 comments

How to Create an Isometric NYC

https://cannoneyed.com/projects/isometric-nyc
1•mefengl•5m ago•1 comments

How Google SREs Use Gemini CLI to Solve Real-World Outages

https://cloud.google.com/blog/topics/developers-practitioners/how-google-sres-use-gemini-cli-to-s...
1•m3drano•5m ago•0 comments

Raytheon Cathode Ray Charge Storage Type 8602/CK1383A

https://www.icloud.com/sharedalbum/#B2dG4VTwGGWHAjs
1•detourdog•5m ago•1 comments

How a band of engineers anticipated the cloud and remade the internet

https://www.cs.princeton.edu/news/how-band-engineers-anticipated-cloud-and-remade-internet
1•zdw•6m ago•0 comments

SpaceX lines up 4 banks for blockbuster IPO

https://www.ft.com/content/55235da5-9a3f-4e0f-b00c-4e1f5abdc606
1•ViktorRay•8m ago•0 comments

Space station's ultrasound machine was critical during medical crisis

https://apnews.com/article/nasa-astronauts-medical-evacuation-crew-11-d501e91736371525e81d13794d5...
2•geox•10m ago•1 comments

Ask HN: What's the best virtual Linux desktop experience on macOS for devs?

2•darkteflon•11m ago•1 comments

C64 Ultimate Review and What's Next?

https://retrogamecoders.com/c64-ultimate-review/
1•ibobev•12m ago•0 comments

Intricacies of Helix Nebula Revealed with NASA's Webb

https://science.nasa.gov/missions/webb/intricacies-of-helix-nebula-revealed-with-nasas-webb/
2•gmays•12m ago•0 comments

Making a Small Mouse-Driven RPG

https://jslegenddev.substack.com/p/how-mouse-input-turned-my-game-on
1•ibobev•13m ago•0 comments

Microsoft Releases Statement as Office, Teams, 365 Outages Continue

https://www.newsweek.com/microsoft-down-office-365-outage-today-status-issue-11403718
3•kayge•15m ago•0 comments

How to Find LinkedIn Profiles and Work Emails in 5 Minutes

https://crona.ai/blog/how-to-find-a-work-email-from-a-name-and-linkedin-2-step-workflow
2•rin_khat•15m ago•0 comments

Supersonic Bizjets: A Sound Investment?

https://www.ainonline.com/aviation-news/business-aviation/2025-12-02/supersonic-bizjets-sound-inv...
1•dangle1•16m ago•0 comments

Stop Worrying, and Let A.I. Help Save Your Life

https://www.nytimes.com/2026/01/19/opinion/ai-health-medical-care.html
1•bookofjoe•20m ago•2 comments

When will CSS Grid Lanes arrive? How long until we can use it?

https://webkit.org/blog/17758/when-will-css-grid-lanes-arrive-how-long-until-we-can-use-it/
1•chmaynard•20m ago•0 comments

Generative AI is an expensive edging machine

https://www.garbageday.email/p/generative-ai-is-an-expensive-edging-machine
2•cratermoon•21m ago•0 comments

Pg_utl_SMTP for PostgreSQL Like Oracle Utl_SMTP

https://hexacluster.ai/blog/send-emails-like-oracle-utl-smtp-using-pg-utl-smtp-for-postgresql
1•avivallssa•21m ago•1 comments

Flutterwave Launches Stablecoin Wallets

https://techcabal.com/2026/01/22/flutterwave-partners-turnkey-to-launch-stablecoin-wallets/
2•SaaSasaurus•21m ago•0 comments

The Day After AGI [video]

https://www.youtube.com/watch?v=mmKAnHz36v0
1•barbazoo•24m ago•1 comments

Microsoft 365 outage is being resolved, Microsoft says

https://twitter.com/MSFT365Status/status/2014446651289829563
1•chwtutha•24m ago•0 comments

Digital liberation: EU Parliament calls for detachment from US tech giants

https://www.heise.de/en/news/Digital-liberation-EU-Parliament-calls-for-detachment-from-US-tech-g...
3•i-con•26m ago•1 comments

The Future of YouTube: CEO Neal Mohan's 2026 Letter

https://blog.youtube/inside-youtube/the-future-of-youtube-2026/
1•ChrisArchitect•26m ago•0 comments

California adds hundreds in hidden fees to drive up traffic tickets

https://www.cbsnews.com/news/california-hidden-fees-drive-up-traffic-tickets/
3•mhb•26m ago•0 comments

Flexible use of a multi-purpose tool by a cow

https://www.cell.com/current-biology/fulltext/S0960-9822(25)01597-0
1•animal_spirits•26m ago•2 comments

Ask HN: Claude Down?

3•emschwartz•27m ago•2 comments

The Substack TV app, now in beta

https://on.substack.com/p/introducing-the-substack-tv-app-now
1•raybb•28m ago•0 comments