frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

How to Red Team Your AI Agent in 48 Hours – A Practical Methodology

1•manuelnd•1h ago
We published the methodology we use for AI red team assessments. 48 hours, 4 phases, 6 attack priority areas.

This isn't theoretical — it's the framework we run against production AI agents with tool access. The core insight: AI red teaming requires different methodology than traditional penetration testing. The attack surface is different (natural language inputs, tool integrations, external data flows), and the exploitation patterns are different (attack chains that compose prompt injection into tool abuse, data exfiltration, or privilege escalation).

The 48-hour framework:

1. Reconnaissance (2h) — Map interfaces, tools, data flows, existing defenses. An agent with file system and database access is a fundamentally different target than a chatbot.

2. Automated Scanning (4h) — Systematic tests across 6 priorities: direct prompt injection, system prompt extraction, jailbreaks, tool abuse, indirect injection (RAG/web), and vision/multimodal attacks. Establishes a baseline.

3. Manual Exploitation (8h) — Confirm findings, build attack chains, test defense boundaries. Individual vulnerabilities compose: prompt injection -> tool abuse -> data exfiltration is a common chain.

4. Validation & Reporting (2h) — Reproducibility, business impact, severity, resistance score.

Some observations from running these:

- 62 prompt injection techniques exist in our taxonomy. Most teams test for a handful. The basic ones ("ignore previous instructions") are also the first to be blocked.

- Tool abuse is where the real damage happens. Parameter injection, scope escape, and tool chaining turn a successful prompt injection into unauthorized database queries, file access, or API calls.

- Indirect injection is underappreciated. If your AI reads external content (RAG, web search), that content is an attack surface. 5 poisoned documents among millions can achieve high attack success rates.

- Architecture determines priority. Chat-only apps need prompt injection testing first. RAG apps need indirect injection first. Agents with tools need tool abuse testing first.

The methodology references our open-source taxonomy of 122 attack vectors: https://github.com/tachyonicai/tachyonic-heuristics

Full post: https://tachyonicai.com/blog/how-to-red-team-ai-agent/

OWASP LLM Top 10 companion guide: https://tachyonicai.com/blog/owasp-llm-top-10-guide/

Retired Netflix Engineering Director on Regrets, Video Engineering, Hiring

https://www.youtube.com/watch?v=ApG9vjbHDCk
1•ksec•1m ago•0 comments

Toolspotting: A new way to measure engagement

https://www.toolspotting.com/spotlight
1•mrdalal•2m ago•0 comments

499 is a prime number with this property: 499⁴⁹⁹ ends in 499499

https://twitter.com/pickover/status/2023047194211701052
1•keepamovin•5m ago•0 comments

Ask HN: What is the best bang for buck budget AI coding?

1•LowResBudget•6m ago•1 comments

Teaching Claude to Write Pony

https://www.ponylang.io/blog/2026/02/teaching-claude-to-write-pony/
1•spooneybarger•7m ago•0 comments

Browse Code by Meaning

https://haskellforall.com/2026/02/browse-code-by-meaning
1•romac•7m ago•0 comments

A remote control for your agents

https://www.restate.dev/blog/a-remote-control-for-your-agents
1•stsffap•7m ago•1 comments

Data Is Your Moat

https://www.parseable.com/blog/data-is-your-moat
1•tiwarinitish86•7m ago•2 comments

Capita taps Microsoft Copilot to dig it out from UK pensions backlog

https://www.theregister.com/2026/02/17/capita_microsoft_copilot_pensions/
1•jjgreen•10m ago•1 comments

Show HN: Nibble a fast and easy to use network scanner

https://github.com/backendsystems/nibble
1•saberd•11m ago•0 comments

Capitalist Countries 2026

https://worldpopulationreview.com/country-rankings/capitalist-countries
1•ksec•12m ago•0 comments

Two Bits Are Better Than One: making bloom filters 2x more accurate

https://floedb.ai/blog/two-bits-are-better-than-one-making-bloom-filters-2x-more-accurate
4•matheusalmeida•13m ago•0 comments

I broke into my own AI system in 10 minutes. I built it

2•mohith_km•14m ago•0 comments

Cascade standalone DNSSEC signer in Rust from NLnet

https://blog.nlnetlabs.nl/cascade/
1•xvilka•15m ago•0 comments

The Infrastructure of Jeffrey Epstein's Power

https://www.nytimes.com/2026/02/13/opinion/ezra-klein-podcast-anand-giridharadas.html
1•rbanffy•16m ago•0 comments

The Cost of Staying

https://twitter.com/amytam01/status/2023593365401636896
1•canadianhacker•18m ago•0 comments

Chinese Memory Penetrates Global PC Supply Chains

https://www.chosun.com/english/industry-en/2026/02/08/ZHVGQTPLQ5CQ5BE2YBT22GTS2M/
2•Qem•19m ago•0 comments

Show HN: CleanCloud – 20 rules to find what's costing you money in AWS and Azure

1•sureshcsdp•19m ago•1 comments

Maybe America Needs Some New Cities

https://www.nytimes.com/2026/02/12/business/economy/america-new-cities-irvine.html
1•woldemariam•21m ago•0 comments

The Rev. Jesse Jackson, pioneering civil rights activist, dies at 84

https://www.cnn.com/2026/02/17/us/reverend-jesse-jackson-death
2•rmason•22m ago•1 comments

I attacked my own LangGraph agent system. All 6 attacks worked

1•mohith_km•23m ago•2 comments

OpenFactBook – The World Factbook

https://openfactbook.org/
1•bovermyer•24m ago•0 comments

Show HN: Free domain health monitoring tool

https://check-server.iqtechnology.io/
1•bodyast1010•24m ago•0 comments

'All records broken' as storm leaves swaths of France under water

https://www.france24.com/en/france/20260214-all-records-broken-storm-nils-leaves-swathes-southwes...
1•geox•25m ago•0 comments

A phone is stolen in London every seven to eight minutes

https://www.bbc.com/news/articles/cdx4762znr6o
1•woldemariam•27m ago•0 comments

Hunt Globally

https://arxiv.org/abs/2602.15019
1•salkahfi•27m ago•0 comments

Fast Sorting, Branchless by Design

https://00f.net/2026/02/17/sorting-without-leaking-secrets/
1•jedisct1•30m ago•0 comments

You Only Debug Once? Think Again

https://singularitynow.substack.com/p/you-only-debug-once-think-again
1•danduma•38m ago•1 comments

How Mitchell Hashimoto Builds Ghostty [video]

https://www.youtube.com/watch?v=ljoNEH39lyw
2•TheWiggles•42m ago•0 comments

OpenAI Tapped for Voice Control Tech in US Drone Swarm Challenge

https://www.bloomberg.com/news/articles/2026-02-13/openai-tapped-for-voice-control-tech-in-us-dro...
1•macleginn•43m ago•1 comments