Just a little vibe coded CLI file search tool I whipped up for my own use. It uses ffmpeg, libreoffice, and mupdf to break down a multitude of file formats and feed them to YAMNet for audio classification, Whisper for transcription and translation, Qwen 2.5 Omni 7B for music classification, and ultimately Jina Embeddings V4 to produce the embeddings used for search.
timschmidt•3h ago
I hope someone else finds it useful.