frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Filling PDF forms with AI using client-side tool calling

https://copilot.simplepdf.com/?share=a7d00ad073c75a75d493228e6ff7b11eb3f2d945b6175913e87898ec96ca8076&form=w9&lang=en
4•nip•1h ago
Hey HN!

I built SimplePDF Copilot: an AI assistant that can interact with the PDF editor. It fills fields, answers questions, focuses on a specific field, adds fields, deletes pages, and so on.

It's built on top of SimplePDF that I started 7 years ago, pioneering privacy-respecting client-side pdf editing, now used monthly by 200k+ people.

As for the privacy model: the PDF itself never leaves the browser. Parsing, rendering, and field detection all run client-side.

The text the model needs (and your messages) goes to whatever LLM you point at. By default that's our demo proxy (DeepSeek V4 Flash, rate-capped), but you can BYOK and point it at any cloud provider, or go fully local (I've been testing with LM Studio).

Unlike the existing "Chat with PDF" tools that only retrieve the text/OCR layer, Copilot can act on the PDF: filling fields, adding fields (detected client-side using CommonForms by Joe Barrow [1], jbarrow on HN with some post-processing heuristics I added on top), focusing on fields, deleting pages, and so on.

I built this because SimplePDF is mostly used by healthcare customers where document privacy is paramount, and I wanted an AI experience that didn't require shipping PII to a third party. Stack is pretty standard:

- Tanstack Start

- AI SDK from Vercel

- Tailwind (I personally prefer CSS modules, I'm old-school but the goal since I open source it, I figured that Tailwind would be a better fit)

The more interesting part is the client-side tool calling: events are passed back and forth via iframe postMessage.

If you're not familiar with "tool calling" and "client-side tool calling", a quick primer:

Tool calling is what LLMs use to take actions. When Claude runs grep or ls, or hits an MCP server, those are tool calls.

Client-side tool calling means the intent to call a tool comes from the LLM, but the execution happens in the browser.

That matters for: speed, you can't go faster than client-to-client operations and also gives you the ability to limit the data you expose to the LLM. For the demo I do feed the content of the document to the LLM, but that connection could be severed as simply as removing the tool that exposes the content data.

The demo is fully open source, available on Github [2] and the demo is the same as the link of this post [3]

What's not open source is SimplePDF itself (loaded as the iframe).

I could talk on and on about this, let me know if you have any questions, anything goes!

[1] https://github.com/jbarrow/commonforms

[2] https://github.com/SimplePDF/simplepdf-embed/tree/main/copil...

[3] https://copilot.simplepdf.com/?share=a7d00ad073c75a75d493228...

Comments

nip•1h ago
Just to be clear, this is a technical demo showing what's possible with client-side tool calling + local models: LLM-assisted form filling where no document data has to leave the user's machine.

Use cases range from:

- Filling foreign-language forms

- Navigating a contract before signing: "can I trust ALL the clauses here?"

- Pre-filling repetitive forms from existing data sources (CRM, EHR, etc. via MCP/RAG)

Copilot is designed to be embedded; our customers ship it white-labeled inside their own products.

Leading journal finds AI is flooding academic publishing with lower quality work

https://phys.org/news/2026-05-journal-ai-academic-publishing-quality.html
1•pseudolus•1m ago•0 comments

Why TUIs are making a comeback

https://wiki.alcidesfonseca.com/blog/why-tuis-are-back/
1•alcidesfonseca•3m ago•0 comments

ICANN opens applications for new top-level domains for the first time since 2012

https://www.theregister.com/2026/05/01/icann_new_gtld_applications/
1•thunderbong•4m ago•0 comments

Inverse Sapir-Whorf and programming languages

https://lukeplant.me.uk/blog/posts/inverse-sapir-whorf-and-programming-languages/
2•birdculture•7m ago•0 comments

The AI Race Is Charged by the Fear of Being Left Behind

https://thewalrus.ca/the-ai-race-is-charged-by-the-fear-of-being-left-behind/
1•pseudolus•13m ago•0 comments

"Gazump"

https://notoneoffbritishisms.com/2026/05/01/gazump/
1•jjgreen•17m ago•0 comments

We migrated 100 services from Nginx to Envoy in one month

https://www.qovery.com/blog/alan-from-nginx-to-envoy-what-actually-happens-when-you-swap-your-pro...
1•ev0xmusic•17m ago•0 comments

Bep/gallerydeluxe: Fast Hugo gallery theme/module suitable for lots of images

https://github.com/bep/gallerydeluxe
1•Tomte•23m ago•0 comments

PgAdmin: The Most Popular PostgreSQL Admin Tool

https://www.pgadmin.org/
1•doener•24m ago•0 comments

Some Notes on AI

https://www.math.columbia.edu/~woit/wordpress/?p=15672
1•jjgreen•28m ago•0 comments

RAG isn't memory. It's Ctrl+F with embeddings

https://medium.com/@vbcherepanov/rag-isnt-memory-it-s-ctrl-f-with-embeddings-c461b90ac7b1
2•vbcherepanov•30m ago•0 comments

•31m ago

How GitHub lost its way

https://substack.com/@usiddique09/p-196195940
2•usmansidd•31m ago•0 comments

clang-format configurator v2

https://clang-format-configurator.site/
1•gjvc•34m ago•0 comments

Apple just gave a clue that a big AI acquisition may be in the cards

https://www.marketwatch.com/story/apple-just-gave-a-subtle-clue-that-a-splashy-ai-acquisition-may...
2•dalvrosa•35m ago•0 comments

First Nations students are teaching themselves

https://www.cbc.ca/news/canada/edmonton/frog-lake-cree-language-app-9.7185348
2•01-_-•35m ago•0 comments

Convicted former Harvard scientist rebuilds brain computer lab in China

https://www.reuters.com/world/china/convicted-former-harvard-scientist-rebuilds-brain-computer-la...
3•01-_-•36m ago•0 comments

Looking for Employers for the job fair and hiring with Meeting C++

https://www.meetingcpp.com/meetingcpp/news/items/Looking-for-Employers-for-the-job-fair-and-hirin...
1•dalvrosa•37m ago•0 comments

Gall's Law – Yagnipedia

https://yagnipedia.com/wiki/galls-law
3•ankitg12•41m ago•0 comments

Neomd: A minimal terminal email client for people who write in Markdown

https://neomd.ssp.sh/
1•handfuloflight•42m ago•0 comments

The Discord migration that didn't happen

https://productimpossible.com/articles/discord-migration-that-didnt-happen/
2•sebakubisz•48m ago•0 comments

Show HN: Autorank – Rank on Google and AI search while you sleep

https://www.getautorank.ai/
1•alokjnv10•48m ago•0 comments

How fast is a macOS VM, and how small could it be?

https://eclecticlight.co/2026/05/02/how-fast-is-a-macos-vm-and-how-small-could-it-be/
11•moosia•49m ago•0 comments

ZenBusiness Data Breach

https://haveibeenpwned.com/Breach/ZenBusiness
1•amazonhut•51m ago•0 comments

How Casey Newton is revamping his newsletter to compete with AI

https://www.niemanlab.org/2026/04/more-scoops-less-aggregation-and-analysis-how-casey-newton-is-r...
1•giuliomagnifico•51m ago•0 comments

US to Withdraw Troops from Germany

https://www.dw.com/en/us-to-withdraw-thousands-of-troops-from-germany/a-77016071
2•pera•55m ago•0 comments

Dazzle Camouflage

https://en.wikipedia.org/wiki/Dazzle_camouflage
2•tosh•56m ago•0 comments

AMD Posts HDMI 2.1 FRL Patches for Their Amdgpu Linux Driver

https://www.phoronix.com/news/AMDGPU-HDMI-2.1-FRL-Patches
2•type0•58m ago•0 comments

Andrej Karpathy: From Vibe Coding to Agentic Engineering

https://www.youtube.com/watch?v=96jN2OCOfLs
3•swolpers•1h ago•0 comments

Study: AI models that consider user's feeling are more likely to make errors

https://arstechnica.com/ai/2026/05/study-ai-models-that-consider-users-feeling-are-more-likely-to...
1•rbanffy•1h ago•0 comments