frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Docuglean – Extract Structured Data from PDFs/Images Using AI

https://github.com/cernis-intelligence/docuglean-ocr
2•victorevogor•3h ago
Hi HN! I built Docuglean, an open-source SDK for intelligent document processing that works with OpenAI, Mistral, Google Gemini, and Hugging Face models.

The idea came from repeatedly writing boilerplate code to extract structured data from invoices, receipts, and other documents. Instead of wrestling with different API formats, I wanted a unified interface that:

- Extracts structured data using Zod/Pydantic schemas - Classifies and splits multi-section documents (e.g., medical records) - Processes documents in batches with automatic error handling - Works locally without APIs (for PDFs, DOCX, XLSX, etc.)

Key features: - Available for both TypeScript and Python - Batch processing with concurrent requests - Document classification (splits 100+ page docs by category) - Local parsers (no API needed for basic extraction) - Apache 2.0 licensed

Currently supports OpenAI, Mistral, Gemini, and Hugging Face. Planning to add Together AI, Anthropic, and more.

Would love feedback on the API design and what features would be most useful

Show HN: F32 – An Extremely Small ESP32 Board

https://github.com/PegorK/f32
180•pegor•1d ago•27 comments

Show HN: My hobby OS that runs Minecraft

https://astral-os.org/posts/2025/10/31/astral-minecraft.html
117•avaliosdev•3d ago•16 comments

Show HN: A game where you invest into startups from history

https://startupgambit.com
33•vire00•5d ago•22 comments

Show HN: Tangent – Security log pipeline powered by WASM

https://github.com/telophasehq/tangent
17•ethanblackburn•8h ago•2 comments

Show HN: MCP Traffic Analysis Tool

https://github.com/mcp-shark/mcp-shark
27•o4isec•3d ago•0 comments

Show HN: Docuglean – Extract Structured Data from PDFs/Images Using AI

https://github.com/cernis-intelligence/docuglean-ocr
2•victorevogor•3h ago•0 comments

Show HN: GitPulse – AI-powered tool to discover open source projects

https://git-pulsee.vercel.app
2•Indri-Fazliji•4h ago•0 comments

Show HN: Roundible – A Space for Anonymous Discussions

https://roundible.com
2•Oxidome•4h ago•0 comments

Show HN: I made a down detector for down detector

https://downdetectorsdowndetector.com
564•gusowen•2d ago•164 comments

Show HN: Supabase-Test – Fast Isolated Postgres DBs for Testing Supabase RLS

https://www.npmjs.com/package/supabase-test
15•pyramation•6h ago•6 comments

Show HN: Search London StreetView panoramas by text

https://london.publicinsights.uk
4•dfworks•6h ago•2 comments

Show HN: RowboatX – open-source Claude Code for everyday automations

https://github.com/rowboatlabs/rowboat
124•segmenta•2d ago•40 comments

Show HN: I built a synth for my daughter

https://bitsnpieces.dev/posts/a-synth-for-my-daughter/
1267•random_moonwalk•1w ago•209 comments

Show HN: Awesome J2ME

https://github.com/hstsethi/awesome-j2me
67•catstor•13h ago•48 comments

Show HN: OctoDNS, Tools for managing DNS across multiple providers

https://octodns.readthedocs.io/en/latest/
26•gardnr•1d ago•2 comments

Show HN: DNS Benchmark Tool – Compare and monitor resolvers

https://github.com/frankovo/dns-benchmark-tool
53•ovo101•1d ago•27 comments

Show HN: Chrome Store–featured extension that writes X replies via DOM observers

https://www.xinsight.me/
4•shashankshukla•9h ago•0 comments

Show HN: Browser-based interactive 3D Three-Body problem simulator

https://trisolarchaos.com/?pr=O_8(0.6)&n=3&s=5.0&so=0.00&im=rk4&dt=1.00e-4&rt=1.0e-6&at=1.0e-8&bs...
240•jgchaos•2d ago•111 comments

Show HN: Guts – convert Golang types to TypeScript

https://github.com/coder/guts
103•emyrk•2d ago•30 comments

Show HN: Parqeye – A CLI tool to visualize and inspect Parquet files

https://github.com/kaushiksrini/parqeye
162•kaushiksrini•3d ago•35 comments

Show HN: ESPectre – Motion detection based on Wi-Fi spectre analysis

https://github.com/francescopace/espectre
208•francescopace•3d ago•50 comments

Show HN: A subtly obvious e-paper room air monitor

https://www.nicolin-dora.ch/blog/en-epaper-room-air-monitor-part-1/
64•nomarv•2d ago•28 comments

Show HN: CTON: JSON-compatible, token-efficient text format for LLM prompts

https://github.com/davidesantangelo/cton
8•daviducolo•16h ago•1 comments

Show HN: Continuous Claude – run Claude Code in a loop

https://github.com/AnandChowdhary/continuous-claude
163•anandchowdhary•5d ago•60 comments

Show HN: Marimo VS Code extension – Python notebooks built on LSP and uv

https://github.com/marimo-team/marimo-lsp
61•manzt•1d ago•5 comments

Show HN: Vibe Prolog

https://github.com/nlothian/Vibe-Prolog
27•nl•1d ago•4 comments

Show HN: Wasda – Experience transformer attention as music

https://github.com/farukalpay/wasda
4•kinders•14h ago•0 comments

Show HN: Lamina – A compiler backend that is not LLVM or Cranelift

https://github.com/SkuldNorniern/lamina
5•skuldnorniern•14h ago•0 comments

Show HN: Interactive research papers (a big step up from ArXiv HTML)

https://sciencestack.ai
9•cjlooi•14h ago•5 comments

Show HN: Long Courrier – A custom web player for a 1h Barber Beats mix

https://monosky.mateo-siam.com/
2•Mateleo•15h ago•0 comments