frontpage.

Hi HN,

I’m Ashu, founder of VideoDB. I’ve spent a big chunk of my life building video infrastructure. Not video creation. Video plumbing.

The stuff you only learn after production breaks: timebases, VFR, keyframes, audio sync drift, container quirks, partial uploads, live streams, retries, backpressure, codecs, ffmpeg flags, cost blowups, and “why is this clip unseekable on one player but fine on another”.

This week we shared VideoDB Skills, a skill pack that lets AI agents call those infra primitives directly, instead of you wiring pipelines with screenshots plus FFmpeg glue.

Repo: https://github.com/video-db/skills

What it enables (infra level):

- Ingest videos and live streams

- Index and search moments

- Return playable evidence links

- Run server side edits and transforms

- Trigger automations from video events

Why this matters for agents:

Agents can reason, write code, browse. But continuous media is still mostly invisible. In an agentic world, perception needs to be a first class interface, not a manual workflow.

Try it quickly:

npx skills add video-db/skills

Then inside your agent: /videodb setup

A few prompts to test:

1. “Upload this URL and give me a playable stream link”

2. “Search this folder for scenes with <keyword> and return clips”

3. “Capture my screen for 2 minutes and give me a structured summary”

4. “Monitor this RTSP feed and log events with timestamps”

What I’m looking for from HN:

1. Does this feel like the right abstraction layer for perception in agent stacks?

2. What would you consider the minimum viable “perception API”?

3. Where do you think this fails in the real world, latency, cost, privacy, reliability?

If you try it and it breaks, tell me the agent, OS, and the error output. I’ll fix it.

The Prolific Output of Wes McKinney in the Age of Agentic Engineering

Brian Cox: The terrifying possibility of the Great Filter [video]

Ask HN: What are your favorite debugging techniques?

Netlabs – Videos on Routing and Cisco

Master and Commander

You Bought Zuck's Ray-Bans. Now Someone in Nairobi Is Watching You Poop

UnitedHealth promised transparency. Instead, it's cutting back key disclosures

If you give a dev a server

Kraken Becomes First Crypto Firm to Win Access to Fed's Core Payments System

Show HN: I built a bug reporter that opens a GitHub PR to fix the bug

ICE has spun a surveillance web

Show HN: Kasbah Guard – Free browser extension that catches secrets on GenAI

'Project Hail Mary' Contains No Green Screen Shot in Movie, Director Says

Cognitive Sandbox Manifesto – Artificial Life and Transparent Neural Systems

AMD Engineer Leverages AI to Help Make a Pure-Python AMD GPU User-Space Driver

Show HN: OpenKIWI (Knowledge Integration and Workflow Intelligence)

Show HN: Callzo – I built this to call home when I land abroad

Actor-Curator: Learning the Training Curriculum for RL Post-Training

Show HN: Slate – An Open Source Local First Note taking web app built using Rust

Why multi-step AI workflows need a new language?

Review: I Write Like

Painless skin patch offers new way to monitor immune health

Lose Myself

Colorado Will Require Age Attestation on Computing Devices via OS

Factory Logic

Dremio Best Practices

Daemon (2006)

The Crypto Chokehold

Show HN: Nomik-Open-source AI-native knowledge graph for code(Neo4j and MCP)

Show HN: Kurojaku – Sudoku and Blackjack

Show HN: I packaged decade of video infra battle scars into tools for AI agents