frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API

https://github.com/majcheradam/ocrbase
40•adammajcher•3h ago•11 comments

Show HN: Preloop – An MCP proxy for human-in-the-loop tool approvals

https://preloop.ai
2•yconst•10m ago•0 comments

Show HN: www.kitty.cards – Make your own Apple Wallet cards

https://www.kitty.cards
2•xenodium•40m ago•2 comments

Show HN: Orcheo – a Python n8n‑like workflow engine built for AI agents

https://github.com/ShaojieJiang/orcheo
2•NeuralNotwork•1h ago•0 comments

Show HN: Claude Skill Editor

https://github.com/mtct/skill-editor
2•mtct88•1h ago•0 comments

Show HN: APIsec MCP Audit – Audit what your AI agents can access

https://github.com/apisec-inc/mcp-audit
2•rajaramr7•1h ago•0 comments

Show HN: Mother MCP – Manage your Agent Skills like a boss-Auto provision skills

https://github.com/dmgrok/mcp_mother_skills
2•DavidGraca•1h ago•0 comments

Show HN: Artificial Ivy in the Browser

https://da.nmcardle.com/grow
87•dnmc•13h ago•16 comments

Show HN: An interactive physics simulator with 1000’s of balls, in your terminal

https://github.com/minimaxir/ballin
64•minimaxir•22h ago•14 comments

Show HN: repere – Local-first SQL data explorer using DuckDB WASM

https://repere.ai
2•mattismegevand•3h ago•0 comments

Show HN: Subth.ink – write something and see how many others wrote the same

https://subth.ink/
76•sonnig•21h ago•41 comments

Show HN: Pipenet – A Modern Alternative to Localtunnel

https://pipenet.dev/
105•punkpeye•1d ago•19 comments

Show HN: E80: an 8-bit CPU in structural VHDL

https://github.com/Stokpan/E80
28•Axonis•2d ago•1 comments

Show HN: Munimet.ro – ML-based status page for the local subways in SF

https://munimet.ro/
12•MrEricSir•4d ago•6 comments

Show HN: A creative coding library for making art with desktop windows

https://github.com/willmeyers/window-art
33•willmeyers•20h ago•4 comments

Show HN: Movieagent.io – An agent for movie recommendations (with couple mode)

https://movieagent.io
19•roknovosel•21h ago•5 comments

Show HN: Lume 0.2 – Build and Run macOS VMs with unattended setup

https://cua.ai/docs/lume/guide/getting-started/introduction
145•frabonacci•1d ago•41 comments

Show HN: LangGraph architecture that scales (hexagonal pattern, 110 tests)

https://github.com/cleverhoods/sagecompass
3•cleverhoods•8h ago•0 comments

Show HN: AWS-doctor – A terminal-based AWS health check and cost optimizer in Go

https://github.com/elC0mpa/aws-doctor
51•elC0mpa•1d ago•21 comments

Show HN: Beats, a web-based drum machine

https://beats.lasagna.pizza
152•kinduff•1d ago•48 comments

Show HN: Figma-use – CLI to control Figma for AI agents

https://github.com/dannote/figma-use
112•dannote•2d ago•37 comments

Show HN: ChunkHound, a local-first tool for understanding large codebases

https://github.com/chunkhound/chunkhound
113•NadavBenItzhak•2d ago•30 comments

Show HN: Streaming gigabyte medical images from S3 without downloading them

https://github.com/PABannier/WSIStreamer
160•el_pa_b•3d ago•48 comments

Show HN: GibRAM an in-memory ephemeral GraphRAG runtime for retrieval

https://github.com/gibram-io/gibram
60•ktyptorio•2d ago•9 comments

Show HN: Opal Editor, free Obsidian alternative for markdown and site publishing

https://github.com/rbbydotdev/opal
33•rbbydotdev•1d ago•6 comments

Show HN: Lite Bible – A fast, minimalist Bible reader

https://litebible.org/
9•foxinthebox•19h ago•2 comments

Show HN: HTTP:COLON – A quick HTTP header/directive inspector and reference

https://httpcolon.dev/
35•ultimoo•1d ago•4 comments

Show HN: Xenia – A monospaced font built with a custom Python engine

https://github.com/Loretta1982/xenia
74•xeniafont•2d ago•41 comments

Show HN: Intent Layer: A context engineering skill for AI agents

https://www.railly.dev/blog/intent-layer/
28•Hunter17•1d ago•4 comments

Show HN: Speed Miners – A tiny RTS resource mini-game

https://speedminers.fun/
47•nickponline•2d ago•8 comments
Open in hackernews

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API

https://github.com/majcheradam/ocrbase
40•adammajcher•3h ago

Comments

mechazawa•2h ago
Is only bun supported or also regular node?
hersko•1h ago
I have a flow where i extract text from a pdf with pdf-parse and then feed that to an ai for data extraction. If that fails i convert it to a png and send the image for data extraction. This works very well and would presumably be far cheaper as i'm generally sending text to the model instead of relying on images. Isn't just sending the images for ocr significantly more expensive?
mimim1mi•1h ago
By definition, OCR means optical character recognition. It depends on the contents of the PDF what kind of extraction methodology can work. Often some available PDFs are just scans of printed documents or handwritten notes. If machine readable text is available your approach is great.
trollbridge•47m ago
I always render an image and OCR that so I don’t get odd problems from invisible text and it also avoids being affected by anything for SEO.
saaaaaam•45m ago
There was an interesting discussion on here a couple of months back about images vs text, driven by this article: https://www.seangoedecke.com/text-tokens-as-image-tokens/

Discussion is here: https://news.ycombinator.com/item?id=45652952

sgc•1h ago
How does this compare to dots.ocr? I got fantastic results when I tested dots.

https://github.com/rednote-hilab/dots.ocr

mjrpes•59m ago
Ocrbase is CUDA only while dots.ocr uses vLLM, so should support ROCm/AMD cards?
actionfromafar•9m ago
How about CPU?
v3ss0n•54m ago
How this is better over Surya/Marker or kreuzberg https://github.com/kreuzberg-dev/kreuzberg.
jadbox•21m ago
Sounds like someone needs to run their own test cases and report back on which solution does a better job...
sync•2m ago
This is essentially a (vibe-coded?) wrapper around PaddleOCR: https://github.com/PaddlePaddle/PaddleOCR

The "guts" are here: https://github.com/majcheradam/ocrbase/blob/7706ef79493c47e8...