frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Lightbox – Flight recorder for AI agents (record, replay, verify)

https://uselightbox.app/
3•Berticus12•2h ago
I built Lightbox because I kept running into the same problem: an agent would fail in production, and I had no way to know what actually happened.

Logs were scattered, the LLM’s “I called the tool” wasn’t trustworthy, and re-running wasn’t deterministic.

This week, tons of Clawdbot incidents have driven the point home. Agents with full system access can expose API keys and chat histories. Prompt injection is now a major security concern.

When agents can touch your filesystem, execute code, and browse the web…you probably need a tamper-proof record of exactly what actions it took, especially when a malicious prompt or compromised webpage could hijack the agent mid-session.

Lightbox is a small Python library that records every tool call an agent makes (inputs, outputs, timing) into an append-only log with cryptographic hashes. You can replay runs with mocked responses, diff executions across versions, and verify the integrity of logs after the fact.

Think airplane black box, but for your hackbox.

*What it does:*

- Records tool calls locally (no cloud, your infra)

- Tamper-evident logs (hash chain, verifiable)

- Replay failures exactly with recorded responses

- CLI to inspect, replay, diff, and verify sessions

- Framework-agnostic (works with LangChain, Claude, OpenAI, etc.)

*What it doesn’t do:* - Doesn’t replay the LLM itself (just tool calls) - Not a dashboard or analytics platform - Not trying to replace LangSmith/Langfuse (different problem)

*Use cases I care about:*

- Security forensics: agent behaved strangely, was it prompt injection? Check the trace.

- Compliance: “prove what your agent did last Tuesday”

- Debugging: reproduce a failure without re-running expensive API calls

- Regression testing: diff tool call patterns across agent versions

As agents get more capable and more autonomous (Clawdbot/Molt, Claude computer use, Manus, Devin), I think we’ll need black boxes the same way aviation does.

This is my attempt at that primitive.

It’s early (v0.1), intentionally minimal, MIT licensed.

Site: <https://uselightbox.app> install: `pip install lightbox-rec`

GitHub: <https://github.com/mainnebula/Lightbox-Project>

Would love feedback, especially from anyone thinking about agent security or running autonomous agents in production.

Your favorite work tools are now interactive inside Claude

https://claude.com/blog/interactive-tools-in-claude
1•consumer451•1m ago•0 comments

Pandas 3.0 Released

https://pandas.pydata.org/docs/whatsnew/v3.0.0.html
2•k1next•5m ago•0 comments

Why the Pumpernickel Bagel Is Disappearing

https://www.grubstreet.com/article/why-the-pumpernickel-bagel-is-disappearing-in-nyc.html
2•randycupertino•5m ago•1 comments

Printable WiFi Code Ornament

https://excamera.substack.com/p/printable-wifi-code-ornament
1•jamesbowman•7m ago•0 comments

Show HN: Manager List is now live

https://managerlist.com
1•miketu•7m ago•0 comments

Split – The Coin of Fate

https://split.displace.tech
1•eamann•7m ago•0 comments

Anthropic and OpenAI CEOs condemn ICE violence, praise Trump

https://techcrunch.com/2026/01/27/anthropic-and-openai-ceos-condemn-ice-violence-praise-trump/
3•SilverElfin•8m ago•0 comments

Anker Nano Charger (45W, Smart)(A121D) Testing and Exploration

https://www.lttlabs.com/articles/2026/01/27/anker-nano-charger-45w-testing
1•LabsLucas•9m ago•1 comments

The Underground Internet of the 1980s [video]

https://www.youtube.com/watch?v=T3wVhCE4j1c
2•oldnetguy•9m ago•0 comments

Good Taste as a Super Power

https://emsh.cat/good-taste/
1•Dangeranger•10m ago•0 comments

Scientist studies ultra-processed foods. Here's what he eats in a day

https://www.washingtonpost.com/wellness/2025/10/01/kevin-hall-ultra-processed-foods/
1•paulpauper•10m ago•0 comments

What will AI do to your career? (Maxim Fateev – CEO Temporal)

https://temporal.io/blog/what-will-ai-do-to-your-career
1•dpflan•13m ago•0 comments

Hedge funds are tapping prediction markets and their data for an edge

https://www.businessinsider.com/how-hedge-funds-are-using-prediction-markets-data-2026-1
1•paulpauper•13m ago•0 comments

Meta and Amazon shift to output-based performance reviews

https://www.bragdoc.ai/blog/output-over-effort-changes-everything
2•nataliaherself•13m ago•0 comments

China hacked Downing Street phones for years

https://www.telegraph.co.uk/news/2026/01/26/china-hacked-downing-street-phones-for-years/
4•croes•14m ago•0 comments

Using Gemini to draft DOT regulations

https://www.propublica.org/article/trump-artificial-intelligence-google-gemini-transportation-reg...
2•paulpauper•14m ago•0 comments

Show HN: We are building Git for data

3•mmnb•14m ago•0 comments

One Year Since the "DeepSeek Moment"

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment
1•ibobev•14m ago•0 comments

Alyah: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs

https://huggingface.co/blog/tiiuae/emirati-benchmarks
1•ibobev•14m ago•0 comments

Architectural Choices in China's Open-Source AI Ecosystem

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment-blog-2
1•ibobev•15m ago•0 comments

Why did the developer go broke?

1•oxqbldpxo•15m ago•2 comments

In less than a year, the resistance against returning to the office collapsed

https://www.theglobeandmail.com/business/careers/article-in-less-than-a-year-the-resistance-again...
1•charles_f•15m ago•1 comments

Generative AI failed to replace SaaS

1•AIFairy•16m ago•0 comments

Video of Assault on Peaceful ICE Observer

https://twitter.com/ScooterCasterNY/status/2010469244560146488
5•boplicity•17m ago•3 comments

Show HN: The Gallery – A collaborative 3D time capsule

https://the-gallery-f51a8.web.app
1•PUXABRIGA•19m ago•0 comments

Try text scaling support in Chrome Canary

https://www.joshtumath.uk/posts/2026-01-27-try-text-scaling-support-in-chrome-canary/
2•linolevan•20m ago•0 comments

Codenotary's Free SBoM Service Tackles the AI Software Supply Chain

https://devops.com/codenotarys-free-sbom-service-tackles-the-ai-software-supply-chain/
1•CrankyBear•20m ago•0 comments

DeepSeek-OCR 2: Visual Causal Flow

https://github.com/deepseek-ai/DeepSeek-OCR-2
1•nickthegreek•20m ago•0 comments

How to Fail as an Organization in 2026

https://jott.live/markdown/how_to_fail_2026
1•brrrrrm•22m ago•0 comments

Mozilla is building an AI 'rebel alliance' to take on industry heavweights

https://www.cnbc.com/2026/01/27/mozilla-building-an-ai-rebel-alliance-to-take-on-openai-anthropic...
5•thm•22m ago•2 comments