frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

ClawPDF – Open-Source Virtual/Network PDF Printer with OCR and Image Support

https://github.com/clawsoftware/clawPDF
192•miles•10mo ago

Comments

criddell•10mo ago
Why use Tesseract for this? Windows' built-in OCR is so much better in my experience.
Oras•10mo ago
Yeah, tesseract has lots of issues especially identifying tables
skeeter2020•10mo ago
I suspect because of the vintage of this project. This is built on .net Framework 4.x, hence windows only.

edit: and goes deep into COM for device interfaces. Wow! blast from the past.

wolfi1•10mo ago
.Net Framework is mostly a wrapper for COM
PeterStuer•10mo ago
That's a bit of a streach. Yes, .Net was MS's next gen of component tech following (D)COM, but it grew way past that from the start.
jeroenhd•10mo ago
Microsoft's OCR engine supports Windows 10.0.10240.0 and up. This project intends to support Windows 7 and up.

In theory you could maintain code paths for both, offering a slimmer package for Windows 10+, but that'd also cost more time and effort to maintain.

Also, not many people know Windows comes with an OCR API. It's extremely underused in my opinion.

atmanactive•10mo ago
Windows OCR is used by PowerToys.

https://github.com/microsoft/PowerToys

hoistbypetard•10mo ago
That looks really useful.

But, also, wow! Windows-only and AGPLv3 is not a combination I think I've ever seen before.

sirjaz•10mo ago
We need more things like this. I know people don't like Windows Server because it is not open source, but it is simple to use and get up and running. Also, user management is easy.
yndoendo•10mo ago
I don't like Microsoft products, such as Windows, because I used them through out the years and find all the edge cases where they don't hold up. Windows OS is too fragile with their kludge of internal designs. Corrupt registry or WMI repository bakes systems with ease. This has nothing to do with Open Source.

OSes that use plain text configuration files are easy to resurrect. Windows is fixed with reinstall the OS. Linux and BSD are fixed with editing a config file or reinstalling a single corrupt application / library.

Example of bad versus good design is DirectX shader compilation. Windows can only perform this while the game is running. Linux with WINE can perform this without the game running. Windows will have bad FPS during the first run / scene with many games because of this.

PS. Windows print system is really bad in the industrial environment because they do not follow label markup language stands. Number of label DSLs have a print quantity setting to save memory. Want 1000 copies printed, one print job with print quantity set to 1000. Windows spools up 1000 copies of the label and sends each to the printer. This eats up the memory on printers in no time. It also brakes the ability to clear the print queue just on the printer. Extra steps require the Windows print job to be canceled and they the printer's queue to be cleared. Otherwise The printer will receive the next 990 of the 1000 print job.

Tika2234•9mo ago
Short answer is you not familiar with Windows but quite good with Linux. Hence the "not like" part. Plenty of Windows developers I know (that is way more than Linux developers statiscally) love Windows. The apps they designed and built simply way better or even non existent on Linux. The same reason too for them, they don't know Linux and near God-level tier with Windows from MFC to assembbly.
yndoendo•9mo ago
Assumptions .... I was an IT/Network Consultant for a number of years before going to product development. Started with DOS on 5 1/2 dual floppy and then Win 3.1.

Example of bad API designs by Microsoft that gets pushed into production is `GetPrivateProfileString`[0]. This function returns a single key value from an INI file. This function will 1) Open the file, 2) Search the file for the Key, 3) Close the file. A better design would be to abstract the file so it is only open and closed once versus how many key values must be read from an INI file. It is like reading one BYTE of an IC at a time instead of batching the process.

NTFS cannot even free master file table space. Creating a lot of small files make it expand and never shrink.

Windows does not properly handle STDIN and STDOUT. Because of DOS being an applications versus a SHELL a person must compile an application as a GUI or CMD flagged, that is also bad design because a command line application must be re-design and re-complied as a GUI to hide the DOS console from showing when it runs and brakes all STDIN and STDOUT logging methods.

Microsoft still does not have proper offline updating. For some reason they falsely believe that everyone connects their computer to the Internet. Lot of air-gap machines in the automation industry. Big reason to move product host OS to BSD or Linux.

It is not fun trying to fix a corrupt registry or WMI repository. Even Microsoft sent out a Windows update to stop auto-backup of the registry because their low-end Surface laptops didn't have the hard-drive space to store them.

[0] https://learn.microsoft.com/en-us/windows/win32/api/winbase/...

sowbug•10mo ago
OT: someone please make a RPi image that "prints" a page to an eink display. I want to duct-tape an RPi Zero and a rechargeable battery to the back of a display, then be able to print recipes to it while cooking. Other people might print board-game rules or speech notes while rehearsing -- anything that you'd typically print and then throw away after brief usage.

I know I could make a PDF, sideload it to a Kindle, etc. Too many steps. I just want the display to appear as a printer on my phone.

IlikeKitties•10mo ago
Sounds pretty vibe codable, why don't you try it yourself?
xrendan•10mo ago
I have some really old code that pretty much does this, I'll see if I can find it.
xrendan•10mo ago
Ugh, I don't have it. It was from before I used git.

Basically to do this you have a cups server that exposes itself as a network printer that prints to a specified PDF directory and then you have a program watching that directory for new files and if there's a new one it opens up whatever pdf viewer you want in full screen.

Setup a shared pdf printer: https://askubuntu.com/questions/1310867/how-to-set-up-shared...

navane•10mo ago
I always wanted to tackle this use case with receipt printers, those thermal narrow paper rol ones. But those things are freaking expensive!
colechristensen•10mo ago
Restaurants are going out of business all the time, there's your source
literalAardvark•10mo ago
Thermal paper has some pretty horrible effects on your health, I'd avoid that.
whartung•10mo ago
Just curious if the folks at CVS chart particularly high on these horrible effects, considering the no doubt thousands of feet of receipts they handle each day there.

For those unaware, at the CVS Pharmacy if you walk in and buy so much as a pack of gum, you're likely to walk out with at least 3 feet of receipt. They use them to tack on ads and coupons.

literalAardvark•10mo ago
Probably, idk if there are such specific studies.

The thermal sensitive layer contains very large amounts of BPA in a dusty form that will easily contaminate your hands.

BPA is a major endocrine disruptor. They might say BPA-free, which would be technically correct, but that just means they'll use a near identical BPA variant that isn't proven to be an endocrine disruptor yet.

Handle with care, wash your hands, don't put them in the kitchen.

turtlebits•10mo ago
You could use the "share" sheet on your phone to send to an RPI over BT via obexpushd, then process it on device -> eink display
kittikitti•10mo ago
This is an incredible idea! I really like it because it sounds so obvious after being exposed to it but I never thought of it before! I wonder what other ways we could integrate GPT's, LLM's, and other AI into the simple "Print" functionality across all our devices.
mathfailure•10mo ago
For Windows only.

Abandonware.

npodbielski•10mo ago
Looks like it is .NET Framework, so there is possibility to port it to .net core and possibly use via dll or .so as library inside other, linux desktop framework (or in something more portable like Flutter).
cryptonector•10mo ago
Could get ported.
johnea•10mo ago
Just another poster child of windoze suk.

Of course, CUPS based printing has had built in print to PDF for years...

[1] Common Unix Printing System

tonyedgecombe•10mo ago
Windows has had a built in PDF driver for a long time as well.

A Decade of Slug

https://terathon.com/blog/decade-slug.html
416•mwkaufma•6h ago•36 comments

Python 3.15's JIT is now back on track

https://fidget-spinner.github.io/posts/jit-on-track.html
268•guidoiaquinti•7h ago•100 comments

Microsoft's 'unhackable' Xbox One has been hacked by 'Bliss'

https://www.tomshardware.com/video-games/console-gaming/microsofts-unhackable-xbox-one-has-been-h...
537•crtasm•10h ago•206 comments

Get Shit Done: A Meta-Prompting, Context Engineering and Spec-Driven Dev System

https://github.com/gsd-build/get-shit-done
189•stefankuehnel•5h ago•112 comments

Mistral AI Releases Forge

https://mistral.ai/news/forge
138•pember•4h ago•11 comments

Show HN: Sub-millisecond VM sandboxes using CoW memory forking

https://github.com/adammiribyan/zeroboot
32•adammiribyan•12h ago•7 comments

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

https://amaiya.github.io/onprem/examples_agent.html
6•wiseprobe•47m ago•0 comments

Kagi Small Web

https://kagi.com/smallweb/
705•trueduke•16h ago•196 comments

Launch HN: Kita (YC W26) – Automate credit review in emerging markets

27•rheamalhotra1•6h ago•3 comments

More than 135 open hardware devices flashable with your own firmware

https://openhardware.directory
9•iosifnicolae2•4d ago•0 comments

Electron microscopy shows 'mouse bite' defects in semiconductors

https://news.cornell.edu/stories/2026/03/electron-microscopy-shows-mouse-bite-defects-semiconductors
22•hhs•4d ago•2 comments

It Took Me 30 Years to Solve This VFX Problem – Green Screen Problem [video]

https://www.youtube.com/watch?v=3Ploi723hg4
167•yincrash•4d ago•73 comments

Unsloth Studio

https://unsloth.ai/docs/new/studio
169•brainless•10h ago•37 comments

Chrome extension adjusts video speed based on how fast the speaker is talking

https://github.com/ywong137/speech-speed
95•MrBuddyCasino•4d ago•29 comments

Show HN: Fatal Core Dump – A debugging murder mystery played with GDB

https://www.robopenguins.com/fatal_core_dump/
27•axlan•4d ago•1 comments

Why AI systems don't learn – On autonomous learning from cognitive science

https://arxiv.org/abs/2603.15381
22•aanet•4h ago•8 comments

The Pleasures of Poor Product Design

https://www.inconspicuous.info/p/the-pleasures-of-poor-product-design
4•NaOH•56m ago•0 comments

Edge.js: Run Node apps inside a WebAssembly sandbox

https://wasmer.io/posts/edgejs-safe-nodejs-using-wasm-sandbox
98•syrusakbary•7h ago•29 comments

Honda is killing its EVs

https://techcrunch.com/2026/03/14/honda-is-killing-its-evs-and-any-chance-of-competing-in-the-fut...
199•sylvainkalache•2d ago•415 comments

Node.js needs a virtual file system

https://blog.platformatic.dev/why-nodejs-needs-a-virtual-file-system
228•voctor•11h ago•194 comments

A tale about fixing eBPF spinlock issues in the Linux kernel

https://rovarma.com/articles/a-tale-about-fixing-ebpf-spinlock-issues-in-the-linux-kernel/
3•y1n0•1h ago•0 comments

Ryugu asteroid samples contain all DNA and RNA building blocks

https://phys.org/news/2026-03-ryugu-asteroid-samples-dna-rna.html
185•bookofjoe•13h ago•96 comments

Spice Data (YC S19) Is Hiring a Product Specialist

https://www.ycombinator.com/companies/spice-data/jobs/P0e9MKz-product-specialist-new-grad
1•richard_pepper•8h ago

'The Secret Agent': Exploring a Vibrant, yet Violent Brazil (2025)

https://theasc.com/articles/the-secret-agent-cinematography
124•tambourine_man•10h ago•60 comments

Show HN: I built an interactive 3D three-body problem simulator in the browser

https://structuredlabs.github.io/threebodyproblem/
19•amrutha_•4d ago•10 comments

Torturing Rustc by Emulating HKTs

https://www.harudagondi.space/blog/torturing-rustc-by-emulating-hkts/
50•g0xA52A2A•3d ago•6 comments

Meta and TikTok let harmful content rise to drove engagement, say whistleblowers

https://www.bbc.com/news/articles/cqj9kgxqjwjo
201•1vuio0pswjnm7•5h ago•128 comments

OpenSUSE Kalpa

https://kalpadesktop.org/
177•ogogmad•12h ago•78 comments

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

https://github.com/peters/horizon
54•petersunde•7h ago•21 comments

Java 26 is here

https://hanno.codes/2026/03/17/java-26-is-here/
176•mfiguiere•7h ago•136 comments