frontpage.

I built a CLI that scores robot demonstration episodes using mutual information between states and actions.

The problem: robot learning datasets contain bad demos (jerky movements, hesitation, inconsistent timing). Training on these hurts policy performance. Manual review doesn't scale.

    pip install democlean
    democlean analyze lerobot/pusht

democlean scores each episode by how predictable the actions are given the states. Smooth, purposeful motion scores high. Jerky, inconsistent motion scores low.

Validation: I correlated MI scores with motion metrics on lerobot/pusht (human teleoperation data). High-MI episodes had 12% lower jerk (p=0.02) and 24% higher state-action correlation (p=0.03). I did not train policies to measure downstream improvement.

Limitations I want to be upfront about:

- MI correlates with episode length (r≈0.8). Longer episodes score higher.

- This measures motion smoothness, not task success.

- Works best with 50+ episodes from a single task.

- Inspired by DemInf (Hejna et al., RSS 2025) but uses raw KSG estimation instead of their VAE pipeline. Simpler, probably less accurate for high-dimensional observations.

Complements score_lerobot_episodes which catches visual issues (blur, lighting). This catches behavioral issues.

GitHub: https://github.com/dipampaul17/democlean

Happy to answer questions about the approach or validation.

New Research Connects Heart Attacks to Brain, Nervous and Immune Systems

EIA: 99%+ of new US capacity in 2026 will be solar, wind and storage

What It's Like to Be a Worm

Anticipating aging-related mental decline using saliva samples and AI

Step 3.5 Flash

Ideas for marketing a dev centric product (2019)

DHS AI Surveillance Arsenal Grows as Agency Defies Courts

Show HN: AI accountability partner helps you meet your goals

Show HN: Distillmed – NotebookLM for Expert Witnesses

Analysing the BlastPass Pegasus 0-click Exploit for iOS 16.6 [video]

Ingredient Score

Show HN: VOR – A verified runtime with 0% hallucination via observations

ICE halts "all movement" at Texas detention facility due to measles infections

Show HN: Super Bowl Party for AI Agents by AI Agents

Show HN: I built a pixel art maker

The LLM Revolution Is Over. The Physical AI Revolution Is Coming Fast [video]

Agent Development Kit (ADK-Go) v0.4.0

Show HN: Is AI "good" yet? – tracking HN sentiment on AI coding

Show HN: Devin-CLI – The missing link for Agent-to-Agent orchestration

Notepad++ hijacked by state-sponsored actors

Trump plans to close Kennedy Center for two years for reconstruction work

Iran presidency releases names of those killed in anti-government protests

Show HN: Bunqueue – Job queue for Bun using SQLite instead of Redis

Clang Hardening Cheat Sheet – Ten Years Later

Show HN: Open-source, offline Kanban board with "swim lanes"

Show HN:Coordinating 10-agent teams with OpenClaw and shared persistent memory

Show HN: Toktrack – 40x faster AI token tracker, rewritten from Node.js to Rust

Lightweight Compression in DuckDB (2022)

Nanobot: Ultra-Lightweight Personal AI Assistant

Lodash's Security Reset and Maintenance Reboot