frontpage.

Show HN: Local-first RAG for PDF user manuals, datasheets

https://github.com/dymk/askdocs-mcp

1•dymk•29m ago

I work on embedded firmware for my day job, and I've found LLMs to be useful for answering questions about technical errata. But, they tend to be bad at answering highly specific questions without using some kind of search tool (if they decide to use one at all), and some user manuals are far too large to fit into a context window.

I built askdocs-mcp as a way to give agents a more direct route to searching through a project's source-of-truth documents. My design constraints were that it run 100% locally, as some manuals are under NDA. It should start up fast, and let me experiment with different embedding & language models. It was built with ollama in mind, but if you can't run models locally, it will work with any OpenAI compatible endpoint.

Features:

  - Incrementally builds and caches the set of docs. Initial start up can take a while as PDFs are chunked and ran through an embedding model, but after that, startup is near instant.

  - Uses the filesystem as the database - you only need `ollama` running somewhere so the tool can access an embedding and natural language model.

  - Provides a tool `ask_docs` for getting natural-language answers back about what the documentation says, which are annotated with page numbers the information came from. Those can be used with tool `get_doc_page` to retrieve the full page if the agent needs additional context.

Because I'm providing the exact set of documents that apply to my project, I see fewer hallucinations and rabbit-hole chasing. The agent isn't relying (as much) on its latent space to answer questions, and it avoids using a web search tool which might find subtly different part numbers or protocol versions. It saves precious context as well, because the parent agent gets a concise version of what it's looking for, instead of doing the "searching" itself by loading large chunks of the document into itself.

I'm sure there are improvements that can be made e.g. document chunking or the "system prompt" the tool gives to the language model - I'd love to hear your feedback, especially if you find this useful. Thanks!

Russian ICBM Launch Failure

Scientists Just Unlocked Quantum Connections That Reach Across Continents

Retained vs. Immediate Mode GUI

Tattoo Ink Moves Through Body, Killing Immune Cells, Weakening Vaccine Response

Open Review's Public Data

Effective harnesses for long-running agents

Control the World for $1

FluentPrep AI – Practice Toefl Speaking with AI-Powered Feedback

US Energy Department Launches "Genesis Mission" to Transform Science Through AI

Juno Neutrino Observatory Releases First Results

Show HN: Design a commercial bakery in an afternoon, not for $10k

AWS builds DNS backstop for times flaky US East region fails

Airbus issues major A320 recall after flight-control incident

Stopwatch Under the Hood (2016)

Where Have the International Math Olympiad Gold Medallists Ended Up? Part Three

Chartly: Open-Source Odoo Analytics with Agentic AI

Ask HN: How do you verify front-end code in agentic LLM coding loops?

The first home computer in Europe [video]

Why Intel and AMD couldn't play LBP together [video]

Show HN: Local-first RAG for PDF user manuals, datasheets

Show HN: Aviation calculators I built while learning

DarkVeil – Private, MEV-Resistant Execution Layer on Solana

Ask HN: How does Hacker News determine which posts appear in the top?

Erotic Plasticity and Male Resentment

On Seriousness

Holiday App Festivitas Adds Animated Lights and Snow Effects to Apple Devices

Hand-Holding Explained

Hytale Servers: Everything You Need to Know for Early Access Launch

I hate Xlib and so should you

Hello Europe, Joe Biden is gone

Show HN: Local-first RAG for PDF user manuals, datasheets

Russian ICBM Launch Failure

Scientists Just Unlocked Quantum Connections That Reach Across Continents

Retained vs. Immediate Mode GUI

Tattoo Ink Moves Through Body, Killing Immune Cells, Weakening Vaccine Response

Open Review's Public Data

Effective harnesses for long-running agents

Control the World for $1

FluentPrep AI – Practice Toefl Speaking with AI-Powered Feedback

US Energy Department Launches "Genesis Mission" to Transform Science Through AI

Juno Neutrino Observatory Releases First Results

Show HN: Design a commercial bakery in an afternoon, not for $10k

AWS builds DNS backstop for times flaky US East region fails

Airbus issues major A320 recall after flight-control incident

Stopwatch Under the Hood (2016)

Where Have the International Math Olympiad Gold Medallists Ended Up? Part Three

Chartly: Open-Source Odoo Analytics with Agentic AI

Ask HN: How do you verify front-end code in agentic LLM coding loops?

The first home computer in Europe [video]

Why Intel and AMD couldn't play LBP together [video]

Show HN: Local-first RAG for PDF user manuals, datasheets

Show HN: Aviation calculators I built while learning

DarkVeil – Private, MEV-Resistant Execution Layer on Solana

Ask HN: How does Hacker News determine which posts appear in the top?

Erotic Plasticity and Male Resentment

On Seriousness

Holiday App Festivitas Adds Animated Lights and Snow Effects to Apple Devices

Hand-Holding Explained

Hytale Servers: Everything You Need to Know for Early Access Launch

I hate Xlib and so should you

Hello Europe, Joe Biden is gone