Standard OCR + Regex was too brittle. So I built Scanny AI.
It listens for Drive webhooks, uses a vision model to extract keys (like "Total Value") regardless of layout, enforces a strict JSON schema, and patches the HubSpot API.
It handles about 5k pages/hour.
Docs and API keys: scanny-ai.com