You can upload multiple photos or a single video of cards. For videos, it extracts frames in the browser, lets you select/deselect frames, then uses Google Gemini (schema-constrained JSON) to extract structured fields (name/title/company/email/phone/website/address). Results are editable, deduplicated, and exportable to Excel, CSV, and vCard (.vcf).
Repo: https://github.com/vassiliylakhonin/bizcard-ai-scanner
Notes:
Runs locally with Vite/React/TS. There’s an optional minimal Node backend proxy so you don’t have to ship an API key to the browser if you deploy it.