Over the weekend, I built a separate version that does exactly that:
Point it to a local file (PDF, DOCX, JPG, TXT)
Describe the dataset you want
It extracts text → finds relevant parts via semantic search → applies your instructions through a generated schema → outputs a clean dataset.
roscas•1h ago
If yes, amazing, I might use it.
If no, thanks but I won't use it because it makes no sense to send your PDF/DOC to an online service to be used to feed their AI models.