frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Rolling your own serverless OCR in 40 lines of code

https://christopherkrapu.com/blog/2026/ocr-textbooks-modal-deepseek/
23•mpcsb•4d ago

Comments

voidUpdate•57m ago
Wouldn't "Serverless OCR" mean something like running tesseract locally on your computer, rather than creating an AI framework and running it on a server?
cachius•54m ago
Serverless means spinning compute resources up on demand in the cloud vs. running a server permanently.
dsr_•21m ago
~99.995% of the computing resources used on this are from somebody else's servers, running the LLM model.
normie3000•24m ago
Thanks for noting this - for a moment I was excited.
ddtaylor•39m ago
How does this compare to Tesserect?
coolness•35m ago
Slight tangent: i was wondering why DeepSeek would develop something like this. In the linked paper it says

> In production, DeepSeek-OCR can generate training data for LLMs/VLMs at a scale of 200k+ pages per day (a single A100-40G).

That... doesn't sound legal

apwheele•24m ago
Question for the crowd -- with autoscaling, when a new pod is created it will still download the model right from huggingface?

I like to push everything into the image as much as I can. So in the image modal, I would run a command to trigger downloading the model. Then in the app just point to the locally downloaded model. So bigger image, but do not need to redownload on start up.

kbyatnal•11m ago
Deepseek OCR is no longer state of the art. There are much better open source OCR models available now.

ocrarena.ai maintains a leaderboard, and a number of other open source options like dots [1] or olmOCR [2] rank higher.

[1] https://www.ocrarena.ai/compare/dots-ocr/deepseek-ocr

[2] https://www.ocrarena.ai/compare/olmocr-2/deepseek-ocr

MessageFormat: Unicode standard for localizable message strings

https://github.com/unicode-org/message-format-wg
76•todsacerdoti•3h ago•35 comments

I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

https://mastodon.world/@knowmadd/116072773118828295
776•novemp•7h ago•493 comments

I’m joining OpenAI

https://steipete.me/posts/2026/openclaw
1181•mfiguiere•15h ago•869 comments

Qwen3.5: Towards Native Multimodal Agents

https://qwen.ai/blog?id=qwen3.5
86•danielhanchen•4h ago•34 comments

Thanks a lot, AI: Hard drives are sold out for the year, says WD

https://mashable.com/article/ai-hard-drive-hdd-shortages-western-digital-sold-out
86•dClauzel•1h ago•46 comments

Rolling your own serverless OCR in 40 lines of code

https://christopherkrapu.com/blog/2026/ocr-textbooks-modal-deepseek/
23•mpcsb•4d ago•8 comments

Anthropic tries to hide Claude's AI actions. Devs hate it

https://www.theregister.com/2026/02/16/anthropic_claude_ai_edits/
82•beardyw•2h ago•35 comments

picol: A Tcl interpreter in 500 lines of code

https://github.com/antirez/picol
64•tosh•5h ago•39 comments

The Israeli spyware firm that accidentally just exposed itself

https://ahmedeldin.substack.com/p/the-israeli-spyware-firm-that-accidentally
56•0x54MUR41•1h ago•7 comments

Magnus Carlsen Wins the Freestyle (Chess960) World Championship

https://www.fide.com/magnus-carlsen-wins-2026-fide-freestyle-world-championship/
317•prophylaxis•15h ago•213 comments

Vim-pencil: Rethinking Vim as a tool for writing

https://github.com/preservim/vim-pencil
42•gurjeet•3d ago•8 comments

Modern CSS Code Snippets: Stop writing CSS like it's 2015

https://modern-css.com
533•eustoria•19h ago•209 comments

Expensively Quadratic: The LLM Agent Cost Curve

https://blog.exe.dev/expensively-quadratic
62•luu•3d ago•33 comments

1,300-year-old world chronicle unearthed in Sinai

https://www.heritagedaily.com/2026/02/1300-year-old-world-chronicle-unearthed-in-sinai/156948
70•telotortium•4d ago•10 comments

Arm wants a bigger slice of the chip business

https://www.economist.com/business/2026/02/12/arm-wants-a-bigger-slice-of-the-chip-business
107•andsoitis•11h ago•70 comments

LT6502: A 6502-based homebrew laptop

https://github.com/TechPaula/LT6502
373•classichasclass•20h ago•179 comments

Audio is the one area small labs are winning

https://www.amplifypartners.com/blog-posts/arming-the-rebels-with-gpus-gradium-kyutai-and-audio-ai
234•rocauc•3d ago•61 comments

Show HN: I generated a "stress test" of 200 rare defects from 7 real photos

5•jmalevez•3d ago•2 comments

Show HN: Microgpt is a GPT you can visualize in the browser

https://microgpt.boratto.ca
221•b44•19h ago•23 comments

JavaScript-heavy approaches are not compatible with long-term performance goals

https://sgom.es/posts/2026-02-13-js-heavy-approaches-are-not-compatible-with-long-term-performanc...
112•luu•13h ago•128 comments

I gave Claude access to my pen plotter

https://harmonique.one/posts/i-gave-claude-access-to-my-pen-plotter
225•futurecat•2d ago•148 comments

Building SQLite with a small swarm

https://kiankyars.github.io/machine_learning/2026/02/12/sqlite.html
79•kyars•8h ago•66 comments

EU bans the destruction of unsold apparel, clothing, accessories and footwear

https://environment.ec.europa.eu/news/new-eu-rules-stop-destruction-unsold-clothes-and-shoes-2026...
1082•giuliomagnifico•20h ago•721 comments

Hard problems in social media archiving

https://alexwlchan.net/2025/hard-problems-in-social-media-archiving/
14•surprisetalk•3d ago•2 comments

Designing a 36-key custom keyboard layout (2021)

https://peterxjang.medium.com/designing-a-36-key-custom-keyboard-layout-24498a0eecd4
25•speckx•2d ago•12 comments

Lost Soviet Moon Lander May Have Been Found

https://www.nytimes.com/2026/02/10/science/luna-9-moon-lander-soviet.html
73•Brajeshwar•4d ago•48 comments

Gwtar: A static efficient single-file HTML format

https://gwern.net/gwtar
256•theblazehen•22h ago•78 comments

Real-time PathTracing with global illumination in WebGL

https://erichlof.github.io/THREE.js-PathTracing-Renderer/
175•tobr•3d ago•15 comments

Show HN: Knock-Knock.net – Visualizing the bots knocking on my server's door

https://knock-knock.net
180•djkurlander•20h ago•73 comments

Pocketblue – Fedora Atomic for mobile devices

https://github.com/pocketblue/pocketblue
117•nikodunk•21h ago•36 comments