5:["$","div",null,{"className":"flex-1 flex flex-col","children":[["$","header",null,{"className":"w-full flex items-center justify-between min-h-[52px] h-[52px] px-4 gap-4","children":[["$","$L10",null,{"href":"/newest","children":["$","svg",null,{"xmlns":"http://www.w3.org/2000/svg","width":24,"height":24,"viewBox":"0 0 24 24","fill":"none","stroke":"currentColor","strokeWidth":2,"strokeLinecap":"round","strokeLinejoin":"round","className":"lucide lucide-move-left","children":[["$","path","kyvwex",{"d":"M6 8L2 12L6 16"}],["$","path","1m8cig",{"d":"M2 12H22"}],"$undefined"]}]}],["$","a",null,{"className":"text-xs hover:underline","href":"https://news.ycombinator.com/item?id=46470164","target":"_blank","rel":"noopener noreferrer","children":"Open in hackernews"}]]}],["$","$Lc",null,{}],["$","div",null,{"className":"p-4 overflow-scroll","children":[["$","h1",null,{"className":"text-2xl font-extrabold mb-2 break-all","children":"Same AI agent, different prompts: 0% vs. 62% security pass rate"}],null,["$","div",null,{"className":"text-xs text-muted-foreground mt-4 font-bold flex gap-1","children":[["$","span",null,{"className":"flex items-center gap-1","children":[["$","svg",null,{"xmlns":"http://www.w3.org/2000/svg","width":14,"height":14,"viewBox":"0 0 24 24","fill":"none","stroke":"currentColor","strokeWidth":2,"strokeLinecap":"round","strokeLinejoin":"round","className":"lucide lucide-arrow-up","children":[["$","path","hav0vg",{"d":"m5 12 7-7 7 7"}],["$","path","x0mq9r",{"d":"M12 19V5"}],"$undefined"]}],1]}],["$","span",null,{"children":"•"}],["$","span",null,{"children":"xsourcesec"}],["$","span",null,{"children":"•"}],["$","span",null,{"children":"1mo ago"}]]}],["$","div",null,{"className":"mt-4 text-sm post-text","dangerouslySetInnerHTML":{"__html":"I've been testing production AI agents for vulnerabilities.

Interesting finding: System prompt design matters more than the model itself.

Same agent. Same task. Same attack vectors.\nOnly difference: how the system prompt was structured.

Results:\n→ Prompt A: 0% pass rate (failed every test)\n→ Prompt B: 62.5% pass rate

No model change. No fine-tuning. Just prompt engineering.

Anyone else seeing this pattern? What's your approach to hardening AI agents?

"}}],["$","section",null,{"className":"mt-4","children":[["$","h1",null,{"className":"font-bold text-lg","children":"Comments"}],[["$","details","46470455",{"className":"p-2 w-full comment pl-0","open":true,"data-id":46470455,"children":[["$","summary",null,{"children":[["$","span",null,{"className":"text-xs text-muted-foreground mr-1","children":"chrisjj"}],["$","span",null,{"className":"text-xs text-muted-foreground mr-1","children":"•"}],["$","span",null,{"className":"text-xs text-muted-foreground","children":"1mo ago"}]]}],["$","div",null,{"className":"whitespace-pre-wrap break-words text-sm pb-2","dangerouslySetInnerHTML":{"__html":"Surely this is no more than expected. Just as say a building might have some entrances secure and others not."}}],[]]}]]]}]]}]]}]

Flirt: The Native Backend

OpenAI's Latest Platform Targets Enterprise Customers

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

Big Tech's AI Push Is Costing More Than the Moon Landing

The AI boom is causing shortages everywhere else

Suno, AI Music, and the Bad Future [video]

Ask HN: How are researchers using AlphaFold in 2026?

Running the "Reflections on Trusting Trust" Compiler

Watermark API – $0.01/image, 10x cheaper than Cloudinary

Now send your marketing campaigns directly from ChatGPT

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

Show HN: Hibana – choreography-first protocol safety for Rust

Haniri: A live autonomous world where AI agents survive or collapse

GPT-5.3-Codex System Card [pdf]

Atlas: Manage your database schema as code

Geist Pixel

Show HN: MCP to get latest dependency package and tool versions

The better you get at something, the harder it becomes to do

Show HN: WP Float – Archive WordPress blogs to free static hosting

Show HN: I Hacked My Family's Meal Planning with an App

Sony BMG copy protection rootkit scandal

The Future of Systems

NASA now allowing astronauts to bring their smartphones on space missions

Claude Code Is the Inflection Point

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

AI Agent Automates Google Stock Analysis from Financial Reports

Voxtral Realtime 4B Pure C Implementation