frontpage.

Show HN: Build queryable packs for AI agents from videos, podcasts, and files

https://github.com/buralog/beyin

1•buralog•1h ago

Hi,

This started from a pretty personal use case.

There was this very technical person I follow who would go live on YouTube from time to time. He has a ton of experience, and would casually drop really good insights about software architecture, engineering tradeoffs, and just general "you only learn this after years" kind of stuff. He also posts shorter clips, but I wanted something else: I wanted that knowledge to be always there, queryable whenever I needed it.

At the same time, I was also trying to understand what RAG actually is in practice, and how to learn applied AI by building something real instead of just reading about it.

My first thought was: ok, this probably has to be fully local. I assumed if I want to query my own stuff locally, then I need to use a local LLM. So I looked into Ollama and thought, alright, I can build this on top of that and just query everything on my machine. At that point I also had some pretty wrong assumptions about local models and resource usage.

After building the first version, it worked, but the result felt a bit underwhelming. Retrieval itself was useful, but the final answer didnt feel as smart as I expected. I use Codex and Claude Code a lot in my daily workflow, so maybe I was unfairly expecting something that felt more "intelligent", or at least looked that way.

Then after a lot of tests with agents (Codex and Claude Code, I use both), I realized something kind of obvious: the Ollama part was mostly just taking the retrieved chunks and turning them into a proper answer. And if that's the job, why couldnt an agent do the same thing?

So I tried wiring it through MCP.

That was the moment where the project really clicked for me. The answers became way better structured, the whole thing felt much smarter, and more importantly, it fit directly into how I already work. Instead of having a seperate tool where I go ask questions, the knowledge just becomes available inside the agent workflow itself. The agent can retrieve it, use it, suggest things, and continue the task.

That was exactly what I wanted, maybe even better than what I had in mind when I started.

The best part for me is that once it's set up, it kinda disappears. I just keep adding YouTube videos, podcasts, and files, and then that context is available while I'm working with AI agents. It stops feeling like "a RAG demo" and starts feeling like part of the actual workflow.

What started as a small local RAG experiment ended up turning into something much more useful than I originally imagined.

Ask HN: Are You Using Finetuning?

ShadowStrike Phantom: Open-Source EDR/XDR Platform

Trump is 'calling for a nuclear strike,' former White House comms director says

Show HN: Marimo pair – reactive Python notebooks as environments for agents

Making of Words.zip (Infinite Word Search)

Agentic development aspirations: build, run, observe – without more Markdown

Say no to a 'camera on your face', says Meta smart glasses rival

Got Bored due to endless scrolling on ChatGPT and Gemini

What we learned about TEE security from auditing WhatsApp's Private Inference

My Attempt on AI Workflow

This Spillway Failed on Purpose [video]

Show HN: Maintenance OS – AI-powered property maintenance for landlords

BQN: Primitive Overloading

Open-Source Cannabis Price Index – Methodology, SQL, and Sample Data

The Fermenter's Guide to Launching a Product

The Building Block Economy

Show HN: A reasoning hierarchical robotics pipeline you can run in the browser

How good is Opus 4.6 at vuln detection?

The Building Block Economy

OpenNOW: An open-source desktop client for GeForce NOW

Six (and a half) intuitions for KL divergence

Security versus Interoperability: Real Tension or False Dichotomy?

Vibe Coding Tools Are a BattleMech

A whole civilization might die tonight

Ask HN: Why does it look like everyone is abandoning GitHub Copilot?

Get more done with new vertical tabs and immersive reading mode in Chrome

Show HN: Ollama-client-rs, a Rust client for Ollama

Hallucinated citations are polluting the scientific literature

In Vivo Car T Causes Serious Toxicities in All Patients of Early Trial

Russia Hacked Routers to Steal Microsoft Office Tokens

Show HN: Build queryable packs for AI agents from videos, podcasts, and files

Ask HN: Are You Using Finetuning?

ShadowStrike Phantom: Open-Source EDR/XDR Platform

Trump is 'calling for a nuclear strike,' former White House comms director says

Show HN: Marimo pair – reactive Python notebooks as environments for agents

Making of Words.zip (Infinite Word Search)

Agentic development aspirations: build, run, observe – without more Markdown

Say no to a 'camera on your face', says Meta smart glasses rival

Got Bored due to endless scrolling on ChatGPT and Gemini

What we learned about TEE security from auditing WhatsApp's Private Inference

My Attempt on AI Workflow

This Spillway Failed on Purpose [video]

Show HN: Maintenance OS – AI-powered property maintenance for landlords

BQN: Primitive Overloading

Open-Source Cannabis Price Index – Methodology, SQL, and Sample Data

The Fermenter's Guide to Launching a Product

The Building Block Economy

Show HN: A reasoning hierarchical robotics pipeline you can run in the browser

How good is Opus 4.6 at vuln detection?

The Building Block Economy

OpenNOW: An open-source desktop client for GeForce NOW

Six (and a half) intuitions for KL divergence

Security versus Interoperability: Real Tension or False Dichotomy?

Vibe Coding Tools Are a BattleMech

A whole civilization might die tonight

Ask HN: Why does it look like everyone is abandoning GitHub Copilot?

Get more done with new vertical tabs and immersive reading mode in Chrome

Show HN: Ollama-client-rs, a Rust client for Ollama

Hallucinated citations are polluting the scientific literature

In Vivo Car T Causes Serious Toxicities in All Patients of Early Trial

Russia Hacked Routers to Steal Microsoft Office Tokens