frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Same AI agent, different prompts: 0% vs. 62% security pass rate

1•xsourcesec•2h ago
I've been testing production AI agents for vulnerabilities.

Interesting finding: System prompt design matters more than the model itself.

Same agent. Same task. Same attack vectors. Only difference: how the system prompt was structured.

Results: → Prompt A: 0% pass rate (failed every test) → Prompt B: 62.5% pass rate

No model change. No fine-tuning. Just prompt engineering.

Anyone else seeing this pattern? What's your approach to hardening AI agents?

Comments

chrisjj•1h ago
Surely this is no more than expected. Just as say a building might have some entrances secure and others not.

AI Maestro Agent Orchestration

https://github.com/23blocks-OS/ai-maestro
1•RyanShook•40s ago•0 comments

TIL: I am an open-source contributor

https://beasthacker.com/til/i-am-an-open-source-contributor.html
1•oumua_don17•50s ago•0 comments

Spotify Wrapped season, don't outsource your love of music to AI

https://www.theguardian.com/music/2025/dec/03/spotify-wrapped-ai-create-your-own-playlists
1•cdrnsf•5m ago•0 comments

Solving Agent Context Loss: A Beads and Claude Code Workflow for Large Features

https://jx0.ca/solving-agent-context-loss/
1•jarredkenny•9m ago•1 comments

IMS Toucan – Text-to-Speech for over 7000 Languages

https://github.com/DigitalPhonetics/IMS-Toucan
1•punnerud•9m ago•0 comments

Show HN: A flight simulator for difficult leadership conversations

https://shadowscoping.com/
2•rezat•11m ago•0 comments

Self-driving cars aren't nearly a solved problem

https://strangecosmos.substack.com/p/self-driving-cars-arent-nearly-a
1•el_nahual•14m ago•0 comments

Show HN: Snowflake Emulator – Local Snowflake Development with Go and DuckDB

https://github.com/nnnkkk7/snowflake-emulator
1•sr-white•15m ago•0 comments

Roundup of Events for Bootstrappers in January 2026

https://bootstrappersbreakfast.com/2025/12/23/roundup-of-january-2026-bootstrappers-events/
1•skmurphy•15m ago•1 comments

Lynkr – Multi-Provider LLM Proxy

https://github.com/Fast-Editor/Lynkr
1•vishalveera•15m ago•1 comments

How to read more? We might take instruction from a more leisurely age

https://www.historytoday.com/archive/out-margins/new-year-readers-resolutions
2•hhs•16m ago•0 comments

Common prefix skipping, adaptive sort

http://smalldatum.blogspot.com/2026/01/common-prefix-skipping-adaptive-sort.html
2•coffepot77•16m ago•0 comments

Home Assistant

https://www.home-assistant.io/
1•elsewhen•16m ago•0 comments

2026 will be my year of the Linux desktop

https://xeiaso.net/notes/2026/year-linux-desktop/
47•todsacerdoti•21m ago•20 comments

The Physics of Ideas: Reality as a Coordination Problem

https://fuck.fail
2•shoes_for_thee•22m ago•1 comments

Veil: Client-Side Steganography

https://veil.offseq.com/
2•jonbaer•22m ago•0 comments

Carbon Costs Quantified

https://www.astralcodexten.com/p/carbon-costs-quantified
1•thelastgallon•23m ago•0 comments

Show HN: Website that plays the lottery every second

https://lotteryeverysecond.lffl.me/
2•Loeffelmann•24m ago•0 comments

Wayfarer Labs is about to be OverWorld

https://wayfarerlabs.ai
3•overworld•26m ago•1 comments

Software Error Will Force 325,000 Californians to Replace Real IDs

https://www.nytimes.com/2026/01/02/us/california-real-id-dmv-error.html
3•bookofjoe•28m ago•1 comments

EmacsConf 2025 Notes

https://sachachua.com/blog/2026/01/emacsconf-2025-notes/
3•JNRowe•30m ago•0 comments

Tell HN: I shipped a script-based language filter and an onboarding tour

1•rankiwiki•31m ago•0 comments

One line, one agent: LLM-native language NERD goes agent-first

https://www.nerd-lang.org/agent-first
1•gnanagurusrgs•34m ago•1 comments

Show HN: Wip – Watch and reload any process using pluggable hooks

https://github.com/system32-ai/wip
2•debarshri•35m ago•0 comments

Microsoft kills official way to activate Windows 11/10 without internet

https://www.neowin.net/news/report-microsoft-quietly-kills-official-way-to-activate-windows-1110-...
5•josephcsible•35m ago•0 comments

Ask HN: Is there any way to lock apps on iPhone?

1•Quinzel•36m ago•1 comments

Liballocs: Meta-level run-time services for Unix processes

https://github.com/stephenrkell/liballocs
1•PaulHoule•36m ago•0 comments

Seminole Warriors Fought the US Military to a Stalemate

https://www.military.com/daily-news/investigations-and-features/2025/12/30/seminole-warriors-foug...
3•santadays•36m ago•0 comments

Stewart Cheifet–creator, exec producer&host of Computer Chronicles, dies at 87

https://computerchronicles.blog/post/stewart-cheifet-1938-2025/
1•bookofjoe•38m ago•0 comments

Show HN: A Bloomberg terminal for finding fresh powder (DuckDB WASM)

https://aryeh-snow.storage.googleapis.com/index.html
4•aribenjamin•38m ago•0 comments