frontpage.

Hey HN, If you're building LangChain agents, you've probably seen them break in creative ways - prompt injection bypassing your chain logic, tools getting called with bad parameters, or cascading failures when an API times out mid-chain.

I built Khaos to test these failure modes before production.

Example LangChain agent: ```python from langchain.agents import AgentExecutor, create_openai_functions_agent from khaos import khaosagent

  @khaosagent(name="research-agent", framework="langgraph")
  def agent(query: str) -> dict:
      executor = AgentExecutor(agent=agent, tools=tools)
      result = executor.invoke({"input": query})
      return {"response": result["output"]}

Test it: pip install khaos-agent khaos discover khaos run research-agent --pack security

Khaos injects: - 242+ security attacks - Prompt injection variations that bypass LangChain's prompt templates - Tool misuse - Malicious parameters in tool calls (e.g., os.system injection in code execution tools) - Chain failures - What happens when your 3rd step in a 5-step chain times out? - LLM faults - Rate limits, token overflows, model unavailability

  Why this matters for LangChain specifically:

  LangChain's abstraction layers can hide vulnerabilities:
  - Prompt templates can still be injected via tool outputs
  - AgentExecutor doesn't validate tool parameters
  - Chains fail silently or propagate corrupted state
  - ReAct/Plan-and-Execute patterns have unique attack surfaces

  Works with LangGraph, LCEL chains, and classic LangChain agents. Auto-instruments your chains to inject faults at each step.

  Repo: https://github.com/ExordexLabs/khaos-sdk
  Examples: https://github.com/ExordexLabs/khaos-examples/tree/master/code-execution-agent

Building takes shorter than writing about it

Building a TUI is easy now

GPU, Accelerator Powered Analytical Engine

NYC gets its first 'free grocery store'

Show HN: InfiniteGPU, An open-source AI compute network,now supporting training

The Future of Programmers (2015)

Show HN: TextureFast – Generate PBR textures for 3D models in seconds

OpenAI Claims DeepSeek Distilled US Models to Gain an Edge

Pg_stat_ch: We built low-overhead Postgres metrics exporter to ClickHouse

Show HN: Kumiki – A Bento.me Clone

Moving Away from Nextcloud

Opus 4.6: long haul breakthrough

Developing ethical, social, and cognitive competence (2015)

Apple's Next Two Products Are Coming Soon

Show HN: Clawlet – Ultra-Lightweight&Efficient Alternative to OpenClaw, Nanobot

Show HN: My agent started its own online store

The problem isn't OpenClaw. it's the architecture

Regulation Is a Service Problem

14-Year-Old Is Using Origami to Imagine Emergency Shelters

Using the Ralph Wiggum loop to execute Kiro specs

AI Bots Are Making Anonymity Untenable

Wikipedia controversy with archive.is resulted from attempt to doxx site owner

Zero-Downtime Ingress Controller Migration in Kubernetes

Show HN: Free OSS cold email bulk sender and management

Simile: A simulation platform for human behavior

Your Turn

I built a Claude.md that solves the compaction/context loss problem

Let's Build an AI Assistant That Remembers

OMLX – LLM Inference Server for Apple Silicon (Ollama for MLX)

Performance and reliability pitfalls of eBPF [video,pdf]

LangChain Agent Testing Guide Tool (Free)