A fully local Retrieval-Augmented Generation (RAG) implementation for querying 25 years of Swiss Teletext news (~500k articles in German language) — no APIs, no data leaving your machine.
Why? I thought it's a cool type of dataset (short/high density news summaries) to test some local RAG approaches.