frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Igalia, Servo, and the Sovereign Tech Fund

https://www.igalia.com/2025/10/09/Igalia,-Servo,-and-the-Sovereign-Tech-Fund.html
94•robin_reala•1h ago•10 comments

Show HN: I invented a new generative model and got accepted to ICLR

https://discrete-distribution-networks.github.io/
190•diyer22•5h ago•23 comments

PSA: Always use a separate domain for user content

https://www.statichost.eu/blog/google-safe-browsing/
53•ericselin•46m ago•43 comments

OpenGL is getting mesh shaders as well, via GL_EXT_mesh_shader

https://www.supergoodcode.com/mesh-shaders-in-the-current-year/
19•pjmlp•2h ago•1 comments

A small number of samples can poison LLMs of any size

https://www.anthropic.com/research/small-samples-poison
1015•meetpateltech•22h ago•382 comments

Ohno Type School

https://ohnotype.co/blog/ohno-type-school-a
71•tobr•4d ago•28 comments

My approach to building large technical projects (2023)

https://mitchellh.com/writing/building-large-technical-projects
216•mad2021•10h ago•29 comments

Parallelizing Cellular Automata with WebGPU Compute Shaders

https://vectrx.substack.com/p/webgpu-cellular-automata
34•ibobev•4h ago•3 comments

Nobel Peace Prize 2025: María Corina Machado

https://www.nobelprize.org/prizes/peace/2025/summary/
373•pykello•5h ago•363 comments

Weave (YC W25) is hiring a founding AI engineer

https://www.ycombinator.com/companies/weave-3/jobs/SqFnIFE-founding-ai-engineer
1•adchurch•2h ago

I Switched from Htmx to Datastar

https://everydaysuperpowers.dev/articles/why-i-switched-from-htmx-to-datastar/
204•ksec•7h ago•132 comments

Python 3.14 is here. How fast is it?

https://blog.miguelgrinberg.com/post/python-3-14-is-here-how-fast-is-it
629•pjmlp•1d ago•448 comments

Htmx, Datastar, Greedy Developer

https://drshapeless.com/blog/posts/htmx,-datastar,-greedy-developer.html
21•KolmogorovComp•3h ago•1 comments

Bringing Desktop Linux GUIs to Android: The Next Step in Graphical App Support

https://www.linuxjournal.com/content/bringing-desktop-linux-guis-android-next-step-graphical-app-...
9•sipofwater•3h ago•4 comments

Static Bundle Object: Modernizing Static Linking

https://medium.com/@eyal.itkin/static-bundle-object-modernizing-static-linking-f1be36175064
18•ingve•2d ago•10 comments

Examples Are the Best Documentation

https://rakhim.exotext.com/examples-are-the-best-documentation
253•Bogdanp•18h ago•87 comments

An MVCC-like columnar table on S3 with constant-time deletes

https://www.shayon.dev/post/2025/277/an-mvcc-like-columnar-table-on-s3-with-constant-time-deletes/
15•shayonj•3d ago•1 comments

My first contribution to Linux

https://vkoskiv.com/first-linux-patch/
611•vkoskiv•4d ago•72 comments

A Story About Bypassing Air Canada's In-Flight Network Restrictions

https://ramsayleung.github.io/en/post/2025/a_story_about_bypassing_air_canadas_in-flight_network_...
72•samray•6h ago•56 comments

Multi-Core by Default

https://www.rfleury.com/p/multi-core-by-default
45•kruuuder•7h ago•18 comments

Figure 03, our 3rd generation humanoid robot

https://www.figure.ai/news/introducing-figure-03
370•lairv•1d ago•363 comments

Show HN: I've built a tiny hand-held keyboard

https://github.com/mafik/keyer
362•mafik•22h ago•99 comments

LLMs are mortally terrified of exceptions

https://twitter.com/karpathy/status/1976077806443569355
271•nought•20h ago•121 comments

A built-in 'off switch' to stop persistent pain

https://penntoday.upenn.edu/news/select-neurons-brainstem-may-hold-key-treating-chronic-pain
179•gmays•17h ago•80 comments

Origami Patterns Solve a Major Physics Riddle

https://www.quantamagazine.org/origami-patterns-solve-a-major-physics-riddle-20251006/
4•westurner•3d ago•0 comments

The Prairie Farmers Preserving the Most Threatened Ecosystem – Forever

https://reasonstobecheerful.world/prairie-farmers-preserve-most-threatened-ecosystem-forever/
4•PaulHoule•15m ago•0 comments

Subway Builder: A realistic subway simulation game

https://www.subwaybuilder.com/
269•0xbeefcab•20h ago•115 comments

A beginner's guide to deploying LLMs with AMD on Windows using PyTorch

https://gpuopen.com/learn/pytorch-windows-amd-llm-guide/
92•beckford•4d ago•26 comments

Interactive Double Pendulum Playground

https://theabbie.github.io/DoublePendulum/
51•melector•2d ago•18 comments

ScribeOCR – Web interface for recognizing text, OCR, & creating digitized docs

https://github.com/scribeocr/scribeocr
89•atomicnature•4d ago•13 comments
Open in hackernews

ScribeOCR – Web interface for recognizing text, OCR, & creating digitized docs

https://github.com/scribeocr/scribeocr
89•atomicnature•4d ago

Comments

ranger_danger•10h ago
This is awesome. Only issue was I had to disable my JShelter extension because it would freeze the page using 100% CPU forever.
aidenn0•9h ago
This is my first encounter with Scribe.js; since I have many book scans I always try OCRing them when I see this. Compared to Tesseract (which is the best I have so far), it gets the words right slightly more, but the paragraph segmentation is many times worse. On a book where every paragraph is indented, it reliably decides two consecutive one-line paragraphs are the same paragraph, which is understandable, but a downgrade from Tesseract which gets the paragraph segmentation as correct as possible (It doesn't handle paragraphs that spanpage-breaks, since I'm feeding it one page at a time)
Elucalidavah•8h ago
> Tesseract (which is the best I have so far)

Have you looked at EasyOCR?

aidenn0•22m ago
EasyOCR is significantly worse than Tesseract for clean printed text and , while being orders of magnitude slower; far better than Tesseract for low-quality clean scans and extracting text from pictures (e.g. comics), which Tesseract does not as well.
zihotki•7h ago
Scribe is Tesseract. It uses tesseract.js which is a Web Assembly port of Tesseract. So they in theory should be equal. In practice custom settings or older versions could make a difference.
criddell•1h ago
What's the motivation for doing this in the browser? It seems like intentionally choosing a more difficult path to create an inferior result.

A native MacOS or Windows application could use the OCR facilities of the operating system and, in my experience, both produce results that are far better than Tesseract.

Zardoz84•50m ago
Generate the OCR on the fly, in the browser, when you do not have the proper OCR info. As someone that works on public web libraries, I see it useful (but wasteful)
aidenn0•20m ago
This is only true in the "speed" mode; in the "quality" mode it claims better word recognition than Tesseract on clean scans (which matches my tests): https://github.com/scribeocr/scribe.js/blob/master/docs/scri...
zihotki•8h ago
According to what I read in the documentation, it uses Tesseract underneath. I've used Tesseract v3 in the past and it was pain. Tesseract 4 uses LSTM neural net. How good is the performance and quality of the recognition nowadays in v4? Could anyone share his experience?
graynk•5h ago
I use paperless-ngx for digitizing all my documents, it also uses Tesseract. The result is not perfect, but more than acceptable, if I scan at 600dpi
fodkodrasz•7h ago
I really like the idea, but unfortunately it could not cope with my usecase.

I have some lecture slides as image-only PDF (Hungarian language with a sparkle of English and Latin (biology)). I tried the tool on it and I had the following experience:

- proofreading with the overlay seems like a good idea, actually it is unusable when the original text has colors, and you need to recognize diacritic marks. Being able to show the original in grayscale or black&white could help. (BW worked, but Grayscale left everything colored)

- For proofreading the ebook mode was the most useful, I immediately spotted lots of errors that I could not see with overlay. A quick switch between the modes would be useful

- Editing text is not efficient when error rate is high (Hungarian language is not supported, that caused it mostly I guess), the interface has high overhead for mass corrections.

Very good idea, I think after a little polish it would even fit my usecase. For more traditional OCR usecases than mine it is probably already great.

constantinum•7h ago
anyone looking for an ocr or text pre-processor that maintains the layout(tables, forms) try LLMWhisperer > https://pg.llmwhisperer.unstract.com/
Zardoz84•4h ago
If it would generate ALTO XML files... IF!