frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Database Development with AI in 2026

https://www.brentozar.com/archive/2026/01/database-development-with-ai-in-2026/
1•gmays•4m ago•0 comments

Show HN: A custom little core CPU architecture with a unique pipeline design

https://github.com/futureisAJASU/Architecture
2•AJASU•4m ago•0 comments

Show HN: ToolsAid – A privacy-first developer utilities hub built with Golang

https://toolsaid.com/
2•raihaninfo•13m ago•0 comments

Eureka launches two robotic vacuum cleaners and a floor-steamer stick vacuum

https://www.notebookcheck.net/Eureka-launches-two-robotic-vacuum-cleaners-and-a-floor-steamer-sti...
2•akg130522•20m ago•0 comments

Show HN: I built an image-to-3D tool optimized for 3D printing and game asset

https://www.imgto3d.ai
2•stewardyunn•25m ago•2 comments

Seeing Geologic Time: Exponential Browser Testing

https://tjid3.org/paper/time
1•TimothyMJones•30m ago•1 comments

What I'd do if I was 18 again

https://www.aadillpickle.com/blog/18-again
1•aadillpickle•31m ago•0 comments

DataOlllo: Private AI Data Analyst

https://apps.microsoft.com/detail/9nc4vdmwgxd8?hl=en-US&gl=US
1•olllo•41m ago•1 comments

Aliasing Alias

https://jordaneldredge.com/aliasing-alias/
2•Fudgel•51m ago•0 comments

Gavin Newsome moves to neutralize tax on billionaires

https://www.politico.com/news/2026/01/12/gavin-newsom-moves-neutralize-tax-billionaires-00723633
7•RickJWagner•53m ago•0 comments

Signal and WhatsApp Now Working on T-Mobile Satellite (Starlink)

https://cascadialink.com/confirmed-signal-and-whatsapp-now-working-on-t-mobile-satellite-starlink/
1•mrdeke•55m ago•0 comments

Nazi punks fu*k off live studio recording [video]

https://www.youtube.com/watch?v=PzHLPnGuVSQ
8•marysminefnuf•56m ago•0 comments

The marula and elephant intoxication myth

https://pmc.ncbi.nlm.nih.gov/articles/PMC10581541/
1•thunderbong•57m ago•0 comments

Byu Talk about Miracles

https://speeches.byu.edu/talks/matthew-cowley/miracles/
1•marysminefnuf•1h ago•0 comments

Meta admits to Instagram password reset mess, denies leaks

https://www.theregister.com/2026/01/11/infosec_news_in_brief/
1•maxloh•1h ago•1 comments

Show HN: Selfhosted – One click self hosted apps

https://github.com/zdunecki/selfhosted
1•zdunecki•1h ago•0 comments

Show HN: PrivateLink – Stop TikTok and others from embedding your info in links

https://private-link.com
7•huppp•1h ago•0 comments

I Try to Be Kind

https://hedgehogreview.com/web-features/thr/posts/why-i-try-to-be-kind
2•herbertl•1h ago•0 comments

Surface optimization governs the local design of physical networks

https://www.nature.com/articles/s41586-025-09784-4
2•dboreham•1h ago•0 comments

PawSense: Catproof Your Computer

http://bitboost.com/pawsense/
4•zdw•1h ago•0 comments

Former NYC Mayor Eric Adams rugs his own memecoin just 30 minutes after launch

https://old.reddit.com/r/CryptoCurrency/comments/1qbdgdt/former_nyc_mayor_eric_adams_rugs_his_own...
14•pulisse•1h ago•2 comments

Provenance Is the New Version Control

https://aicoding.leaflet.pub/3mcbiyal7jc2y
2•gpi•1h ago•0 comments

Ask HN: Only people who work in scientific research, how you benefit from AI

1•culanuchachamim•1h ago•1 comments

Show HN: Proc – A semantic CLI for process management

https://github.com/yazeed/proc
3•yazeedaloyoun•1h ago•1 comments

Sora2 – AI video generator with prompt builder and templates

https://sorax.io/
1•qmzm•1h ago•3 comments

The Post-American Internet

https://pluralistic.net/2026/01/01/39c3/#the-new-coalition
4•wyldfire•1h ago•2 comments

GRU Space is building humanity's first hotel on the Moon

https://www.gru.space/
2•d_silin•1h ago•0 comments

The most fascinating monitors at CES 2026

https://arstechnica.com/gadgets/2026/01/the-most-fascinating-monitors-at-ces-2026/
2•SilverElfin•1h ago•0 comments

Show HN: ZSweep – A keyboard-first Minesweeper inspired by Vim

https://zsweep.com
4•oug-t•1h ago•1 comments

Three Mistakes About Power and Justice (2025)

https://thenewdigest.substack.com/p/three-mistakes-about-power-and-justice
1•danielam•1h ago•0 comments
Open in hackernews

Ask HN: Recommendations for self hostable OCR to extract code from images

3•vivzkestrel•1h ago
- Requirements

- You are not paying per inference, you can self host the model

- It can run inside AWS EC2

- It has very high levels of accuracy for extracting code from images

- what are some of the most accurate OCR models out there that can extract code from images

Comments

vivzkestrel•1h ago
- as you know most models are trained on PDF, receipts, normal text etc

- this however doesn't work really well for structured text like code

- what are some absolutely state of the art self hostable OCR models out there capable of extracting code from text with very high levels of accuracy

- I have tried tesseract currently and it is not very good with this. Even if you are not familiar with any other model, perhaps you can suggest a pipeline for tesseract that I can follow to improve the accuracy of the extraction process

- Currently, my pipeline looks like this:

- for every input image, check if the image is light text on dark background or dark text on light background

- as you know tesseract is trained from mostly dark text on light background so I invert the images with dark background before processing them with tesseract

- are there other processes you think that I need to include?

treetalker•15m ago
Not sure about running it in AWS, but this works well even on Intel Macs:

https://github.com/LESIM-Co-Ltd/CoreOCR

There are other similar wrappers for macOS Vision framework; just search on GitHub.