We’re building an MCP that lets you chat with long PDFs (e.g. textbooks with hundreds of pages) on platforms like Claude, Cursor, and other AI agents — without worrying about context limits.
Comments
LoMoGan•4mo ago
Interesting, is this based on an external Vector DB to store and process the PDF?
mingtianzhang•4mo ago
Thanks for the great question! We actually use a reasoning-based, vectorless approach. In short, it follows this process:
1. Generate a table of contents (ToC) for the document.
2. Read the ToC to select a relevant section.
3. Extract relevant information from the selected section.
4. If enough information has been gathered, provide the answer; otherwise, return to step 2.
We believe this approach closely mimics how a human would navigate and read long PDFs.
LoMoGan•4mo ago
mingtianzhang•4mo ago
LoMoGan•4mo ago
mingtianzhang•4mo ago