Most extractors are either fast but lose structure (markitdown, pymupdf4llm) or accurate but slow (docling). Ours ties with docling on accuracy but is orders of magnitude faster.
https://github.com/pspdfkit/pdf-to-markdown
We'd love feedback on it, and ofc send us files that break it.