Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.
The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.
Given this is so new I'm struggling to find any tools which make this easier.
raymond_goo•5h ago