While tinkering with RAG ideas I've thoroughly processed the entire MDN Web Docs original content, pre-ingested it into LanceDB, uploaded the 50k+ rows dataset (https://huggingface.co/datasets/deepsweet/mdn) to HuggingFace, and published a RAG-MCP server (https://github.com/deepsweet/mdn) ready for semantic search with hybrid vector (1024-d) and full‑text (BM25) retrieval.
A screenshot is worth a thousand words: https://raw.githubusercontent.com/deepsweet/mdn/main/example...