Hi HN, I built AIR because my own AI agents went off the rails.
I run an e-commerce store and deployed agents to handle customer communications. They started sending wrong information, making promises we couldn't keep, and handling complaints badly. When I tried to figure out what happened, I had scattered logs across services with no way to prove what the agents actually said.
AIR is a flight recorder for AI systems. It's an OpenAI-compatible reverse proxy (written in Go) that sits between your code and your LLM provider. Every prompt, completion, and tool call gets recorded with HMAC-SHA256 tamper-evident audit chains — modify one record and the chain breaks.
What makes this different from Langfuse/Helicone/LangSmith: accountability, not just observability. Tamper-evident chains instead of mutable logs. Content stays on your infra (S3/MinIO). 22 compliance controls auto-mapped to SOC 2, ISO 27001, and EU AI Act. Deterministic replay of any AI decision.
shotwellj•1h ago
AIR is a flight recorder for AI systems. It's an OpenAI-compatible reverse proxy (written in Go) that sits between your code and your LLM provider. Every prompt, completion, and tool call gets recorded with HMAC-SHA256 tamper-evident audit chains — modify one record and the chain breaks.
What makes this different from Langfuse/Helicone/LangSmith: accountability, not just observability. Tamper-evident chains instead of mutable logs. Content stays on your infra (S3/MinIO). 22 compliance controls auto-mapped to SOC 2, ISO 27001, and EU AI Act. Deterministic replay of any AI decision.
Python SDK integrates with OpenAI, LangChain, and CrewAI: pip install air-blackbox-sdk Interactive demo (no signup): https://nostalgicskinco.github.io/air-blackbox-gateway/air-d...
Everything is Apache-2.0. 200+ tests. CI on every push.