frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Browser-based PDF form fields detection (YOLO-based)

https://commonforms.simplepdf.com/
15•nip•6h ago
Hey HN!

Last week, Joe Barrow released CommonForms [1], a set of open models for automatically detecting form fields in PDFs.

He trained two models, FFDNet-S and FFDNet-L, on a dataset of 55k documents. You can read more about his approach in the arXiv paper [2].

As someone who's been searching for reliable models to auto-detect form fields (one of the last hard problems in PDF form filling), I was seriously impressed by the quality of these models. I wanted to give them the attention and distribution they deserve, so I created a fully browser-based implementation that handles both detection and field addition.

My implementation relies on his models and onnx runtime web + some post-processing. I plan on publishing a small browser library to encapsulate it in the coming days to make it easier to deploy anywhere (currently you'd have to fork / copy my code)

Happy to answer any questions about the browser-based implementation!

Questions about the models themselves should be directed to Joe, who I believe is also on HN [3]

[1] https://github.com/jbarrow/commonforms [2] https://arxiv.org/abs/2509.16506 [3] https://news.ycombinator.com/user?id=jbarrow

Comments

jbarrow•6h ago
Hey, Benjamin, thanks for the attribution! Happy to field any questions HN users have.

It's really gratifying to see people building on the work, and I love that it's possible to do browser-side/on-device.

Novo Nordisk's Canadian Mistake

https://www.science.org/content/blog-post/novo-nordisk-s-canadian-mistake
151•jbm•2h ago•62 comments

Original C64 Lode Runner Source Code

https://github.com/Piddewitt/Loderunner
17•indigodaddy•56m ago•6 comments

Doing well in your courses: Andrej's advice for success (2013)

https://cs.stanford.edu/people/karpathy/advice.html
318•peterkshultz•6h ago•116 comments

Dosbian: Boot to DOSBox on Raspberry Pi

https://cmaiolino.wordpress.com/dosbian/
82•indigodaddy•3h ago•31 comments

Duke Nukem: Zero Hour N64 ROM Reverse-Engineering Project Hits 100%

https://github.com/Gillou68310/DukeNukemZeroHour
34•birdculture•2h ago•11 comments

Airliner hit by possible space debris

https://avbrief.com/united-max-hit-by-falling-object-at-36000-feet/
135•d_silin•5h ago•65 comments

Deterministic multithreading is hard (2024)

https://www.factorio.com/blog/post/fff-415
24•adtac•13h ago•2 comments

Compare Single Board Computers

https://sbc.compare/
93•todsacerdoti•5h ago•39 comments

GNU Octave Meets JupyterLite: Compute Anywhere, Anytime

https://blog.jupyter.org/gnu-octave-meets-jupyterlite-compute-anywhere-anytime-8b033afbbcdc
95•bauta-steen•7h ago•16 comments

The working-class hero of Bletchley Park you didn't see in the movies

https://www.theguardian.com/world/2025/oct/12/move-over-alan-turing-meet-the-working-class-hero-o...
68•hansmayer•1w ago•17 comments

The Spilhaus Projection: A world map according to fish

https://southernwoodenboatsailing.com/news/the-spilhaus-projection-a-world-map-according-to-fish
79•zynovex•1w ago•10 comments

Comparing the power consumption of a 30 year old refrigerator to a new one

https://ounapuu.ee/posts/2025/10/14/fridge-power-consumption/
86•furkansahin•5d ago•124 comments

Could the XZ backdoor been detected with better Git/Deb packaging practices?

https://optimizedbyotto.com/post/xz-backdoor-debian-git-detection/
51•ottoke•5h ago•42 comments

The Trinary Dream Endures

https://www.robinsloan.com/lab/trinary-dream/
34•FromTheArchives•6h ago•47 comments

The Cancer Imaging Archive (TCIA)

https://www.cancerimagingarchive.net/
9•1970-01-01•6d ago•0 comments

Abandoned land drives dangerous heat in Houston, study finds

https://stories.tamu.edu/news/2025/10/07/abandoned-land-drives-dangerous-heat-in-houston-texas-am...
111•PaulHoule•9h ago•115 comments

How to Assemble an Electric Heating Element from Scratch

https://solar.lowtechmagazine.com/2025/10/how-to-build-an-electric-heating-element-from-scratch/
80•surprisetalk•9h ago•50 comments

Infisical (YC W23) Is Hiring Full Stack Engineers

https://www.ycombinator.com/companies/infisical/jobs/0gY2Da1-full-stack-engineer-global
1•vmatsiiako•6h ago

Show HN: Duck-UI – Browser-Based SQL IDE for DuckDB

https://demo.duckui.com
172•caioricciuti•11h ago•54 comments

The macOS LC_COLLATE hunt: Or why does sort order differently on macOS and Linux (2020)

https://blog.zhimingwang.org/macos-lc_collate-hunt
67•g0xA52A2A•10h ago•15 comments

Enchanting Imposters

https://daily.jstor.org/enchanting-imposters/
3•Petiver•2d ago•0 comments

Show HN: Pyversity – Fast Result Diversification for Retrieval and RAG

https://github.com/Pringled/pyversity
60•Tananon•8h ago•6 comments

Ask HN: What are people doing to get off of VMware?

98•jwithington•5h ago•71 comments

The case for the return of fine-tuning

https://welovesota.com/article/the-case-for-the-return-of-fine-tuning
125•nanark•13h ago•69 comments

Redis Backplane for Hubots

https://github.com/hubot-friends/hubot-redis-backplane
6•gijoeyguerra•5d ago•3 comments

Designing EventQL, an Event Query Language

https://docs.eventsourcingdb.io/blog/2025/10/20/designing-eventql-an-event-query-language/
7•goloroden•2h ago•1 comments

Scheme Reports at Fifty

https://crumbles.blog/posts/2025-10-18-scheme-reports-at-fifty.html
44•djwatson24•8h ago•17 comments

Improving PixelMelt's Kindle Web Deobfuscator

https://shkspr.mobi/blog/2025/10/improving-pixelmelts-kindle-web-deobfuscator/
85•ColinWright•10h ago•14 comments

Show HN: Open-Source Voice AI Badge Powered by ESP32+WebRTC

https://github.com/VapiAI/vapicon-2025-hardware-workshop
38•Sean-Der•1w ago•4 comments

Xubuntu.org Might Be Compromised

https://old.reddit.com/r/Ubuntu/comments/1oa4549/xubuntuorg_might_be_compromised/
294•kekqqq•8h ago•125 comments