frontpage.

A friend of mine was struggling with manually extracting data from PDFs, so I built a REST API to automate it.

Main features: - Text extraction (full document or specific pages) - Table extraction → JSON with headers/rows - Invoice parsing → vendor, amounts, line items, tax (auto language detection) - Resume parsing → contact info, skills, work experience, education - Metadata, links, embedded images extraction

API endpoint: https://pdfpull-895295000838.europe-west1.run.app/docs Playground: https://bnacar.dev/pdfpull-landing/playground.html

Tech: FastAPI, PyMuPDF, pdfplumber. Rule-based extraction (no LLM API calls).

The invoice/resume parsers detect language automatically (EN/DE/TR) and extract fields without per-template configuration.

Demo key for testing: sk_demo_123456789

Example request:

    curl -X POST "https://pdfpull-895295000838.europe-west1.run.app/api/v1/parse/invoice" \
      -H "X-API-Key: sk_demo_123456789" \
      -F "file=@invoice.pdf"

Returns structured JSON:

    {
      "vendor_name": "ACME Corp",
      "invoice_number": "INV-2024-001", 
      "total_amount": 1250.00,
      "currency": "USD",
      "line_items": [...],
      "confidence": 0.92
    }

Free to try (100 requests with demo key). Looking for feedback on the API design and what document types to add next.

Satellites Have a Lot of Room

1980s Farm Crisis

Show HN: FSID - Identifier for files and directories (like ISBN for Books)

Show HN: Holy Grail: Open-Source Autonomous Development Agent

Show HN: Minecraft Creeper meets 90s Tamagotchi

Show HN: Termiteam – Control center for multiple AI agent terminals

The only U.S. particle collider shuts down

Ask HN: Why do purchased B2B email lists still have such poor deliverability?

Show HN: Remotion directory (videos and prompts)

Portable C Compiler

Show HN: Kokki – A "Dual-Core" System Prompt to Reduce LLM Hallucinations

Software Engineering Transformation 2026

Microsoft purges Win11 printer drivers, devices on borrowed time

Lunch with the FT: Tarek Mansour

Old Mexico and her lost provinces (1883)

'AI' is a dick move, redux

The source code was the moat. But not anymore

Does anyone else feel like their inbox has become their job?

An AI model that can read and diagnose a brain MRI in seconds

Dev with 5 of experience switched to Rails, what should I be careful about?

AlphaFace: High Fidelity and Real-Time Face Swapper Robust to Facial Pose

Scientists discover “levitating” time crystals that you can hold in your hand

Rammstein – Deutschland (C64 Cover, Real SID, 8-bit – 2019) [video]

Tell HN: Yet Another Round of Zendesk Spam

Postgres Message Queue (PGMQ)

Show HN: Django-rclone: Database and media backups for Django, powered by rclone

NY lawmakers proposed statewide data center moratorium

OpenClaw AI chatbots are running amok – these scientists are listening in

Show HN: AI agent forgets user preferences every session. This fixes it

Introduce the Vouch/Denouncement Contribution Model