AI agent with 2 deps that uses Shannon Entropy to decide when to act vs. ask

2•borhensaidi•1h ago

Comments

borhensaidi•1h ago

I got frustrated with LangChain being impossible to audit (500+ transitive dependencies, 100K+ LOC), so I built picoagent — an AI agent framework with only numpy and websockets as external dependencies.

The interesting technical decision: instead of prompting the LLM to pick a tool, I use Shannon Entropy (H(X) = -Σp·log₂(p)) on the softmax score distribution over available tools. If entropy is above 1.5 bits, the agent asks for clarification instead of guessing. In my tests this cuts false positive tool calls by 40-60%.

The threshold adapts over time using a simple online learning system that tracks success/failure rates per session — no external data sent anywhere.

Other things that might be interesting to HN: - Zero-trust sandbox with 18+ regex deny patterns blocking rm -rf, fork bombs, sudo, reverse shells, path traversal - Dual-layer memory: numpy .npz vector embeddings + LLM consolidation to MEMORY.md (no Pinecone, no vector DB) - The entire entropy gate is 64 lines of readable Python - 5 chat channels (Telegram, Discord, Slack, WhatsApp, Email) with unified memory - MCP-native (Model Context Protocol) stdio server - Hot-reloadable Markdown skills via SIGHUP

It's early and rough. I'm looking for feedback on: - Is 1.5 bits the right entropy threshold or should it be dynamic from day one? - What dangerous shell patterns am I missing in the sandbox? - Is the dual-memory approach (vector + markdown consolidation) worth the complexity?

GitHub: https://github.com/borhen68/picoagents

Happy to answer questions about any of the implementation decisions.

guerython•1h ago

Solid move on the entropy gate. We log the softmax H for every tool call and keep a tiny EMA+stddev per tool (`H_new=(1-α)H_old+αH_now`). The gate then lets calls through only when `H < max(base, mean+2σ)` and resets the mean when we see two consecutive confirmed failures, so the threshold drifts with the workload instead of hardcoding 1.5 bits.

On the sandbox side, we blocked not just `rm -rf`/fork bombs but also `os.execve('/proc/self/exe')`, `chmod`/`chown` on symlinks under `/tmp`, and we intercept raw `socket/connect` via ptrace so no new outbound channels spawn even if a regex slips. These traps stopped most of the pivoting tricks we saw in the first week.

Swarming, spinning microrobots can manipulate their surroundings

I'm Sorry This New Artist Sucks [video]

Show HN: I Get IT – Why My GitHub Repos, and Websites Get Zero Traction

Opponent Modeling Wins 2× Faster Than Stockfish

Personal token: share equity in your lifetime upside

Future of Devtools and Moats

Piezoelectric gel to regenerate lost bone (e.g. periodontitis)

Show HN: Orcv: If tmux was built for window management on macOS in 2026

Show HN: I used LLMs to build a compression tool that beats xz on x86_64 ELFs

Vibe Knowing

How Nanotech Made an Old Leukemia Drug 22,000x Stronger

Show HN: Aver – A Git-native Markdown-based tracker for knowledge setwardship

AI Now Has Its Own Cursor

FOMC Insight Engine: semantic search over Fed archives

Which State Governs the Internet's Fine Print

Customer Intelligence Protocol

Show HN: YourFinanceWORKS – Open-source financial management with AI

Show HN: Leyoda – Shareable startup cards with analytics

Show HN: BeatCanvas – A browser-based demoscene visualizer for SoundCloud

A U.S. scholarship thrills a teacher in India. Then came soul-crushing questions

Most common fields of study, from 1970 to now

Show HN: ClawShield – Open-source security proxy for AI agents (Go, eBPF)

Quarkdown: Markdown with Superpowers

Show HN: Remoat – Control Antigravity from your phone via Telegram

Show HN: EWA Compare – Compare 21 earned wage access providers worldwide

Waymo robotaxi blocks EMS responding to Austin mass shooting

Show HN: LightJJ – Web-Based UI for Jujutsu VCS

Show HN: Timber – Ollama for classical ML models, 336x faster than Python

Claude and the Dow: AI is unlike other tech because AI has embedded judgment

Clickout staff journalists sacked and misleadingly replaced with AI writers