frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a unified inference layer for Document Processing Models

https://github.com/adithya-s-k/Omnidocs
2•Adithya-Kolavi•1h ago
Hey HN,

I’m Adithya, a 22-year-old researcher from India. I work with a lot of document processing models while building AI pipelines, and one pain kept repeating: every model has its own inference code, preprocessing steps, and output format. Swapping models or testing new ones meant rewriting a lot of boilerplate each time.

So I built Omnidocs—an open source library to run document processing models through a simple, unified API, with a vision-first approach to understanding documents.

Key features:

> Pick a task and a model, run inference with one interface > Supports common document tasks: Text extraction, OCR, Table extraction, Layout analysis and Structured Extraction ... > 16+ models supported out of the box (many more integrations to come) > Runs locally on Mac or GPUs (MLX and vLLM backends supported) > Works with VLM APIs like GPT, Claude, Gemini and many more that support Open Responses API spec > Designed to quickly build and test document processing pipelines

This has helped me prototype document workflows much faster and compare models easily.

Would love feedback on the API design, developer experience, and what integrations would make this more useful.

Repo: https://github.com/adithya-s-k/omnidocs

Computer History Museum Recovers Rare Unix History

https://www.youtube.com/watch?v=-xlq_MPWNKk
1•todsacerdoti•28s ago•0 comments

Watching a Robotics Startup Die from the Inside

https://ruixu.us/posts/six-things-robotics-startup
1•gkolli•37s ago•0 comments

TranslateGemma now runs 100% in the browser on WebGPU with Transformers.js v4

https://huggingface.co/spaces/webml-community/TranslateGemma-WebGPU
1•tzury•1m ago•1 comments

What Holds America Together?

https://walkingtheworld.substack.com/p/what-holds-america-together
1•VelNZ•2m ago•0 comments

Show HN: Elev8or Run Creator Marketing Like Paid Ads

https://www.elev8or.io
1•Sourabhsinr•3m ago•0 comments

Michael Burry Reveals Accounting Tricks of Mag 7 Firms to Inflate Earnings

https://www.ibtimes.co.uk/michael-burry-criticizes-tech-giants-ai-accounting-1781491
1•ironyman•4m ago•0 comments

Show HN: Draw on Screen – a modern screen annotation tool with webcam

https://drawonscreen.com/vs/epicpen/
3•markjivko•5m ago•0 comments

DataClaw

https://huggingface.co/datasets?other=dataclaw
1•notsahil•5m ago•0 comments

Spotify Urn

https://liquiddeath.com/pages/spotify-urn
1•giancarlostoro•6m ago•0 comments

US judge dismisses xAI trade-secrets lawsuit against rival OpenAI for now

https://finance.yahoo.com/news/us-judge-dismisses-xai-trade-201030751.html
1•pinewurst•6m ago•0 comments

IMockupper: I built an AI tool to automate App Store asset generation

1•damdafayton•7m ago•0 comments

GitHub Copilot CLI is now generally available

https://github.blog/changelog/2026-02-25-github-copilot-cli-is-now-generally-available/
1•chrfritsch•7m ago•0 comments

Show HN: Rediflow – SSR project management, one source of truth, no spreadsheet

https://gitlab.com/rediflow_eu/rediflow
1•janipaijanen•9m ago•0 comments

The State of AI Agents in 2026: $211B VC Funding, 92% Drop in Inference Costs

https://meditations.metavert.io/p/the-state-of-ai-agents-in-2026
1•Ross00781•10m ago•0 comments

A Style Guide for AI Agent Skills

https://github.com/mgechev/skills-best-practices
1•mgechev•11m ago•0 comments

SambaNova Eyes 10T Parameter Models for Agentic AI with New Chip

https://www.hpcwire.com/2026/02/24/sambanova-eyes-10-trillion-parameter-models-for-agentic-ai-wit...
1•rbanffy•11m ago•0 comments

Show HN: An agent that records Loom-style demos

https://www.rundown.video/
1•guico•11m ago•0 comments

We Opensourced xAI's Macrohard: SOTA 82% on OSWorld

https://coasty.ai/
1•PrateekJ17•12m ago•0 comments

Luciano Floridi on the LLM "writing style"

https://www.facebook.com/luciano.floridi.2025/posts/if-you-have-read-a-sufficient-number-of-llm-w...
1•danielam•13m ago•0 comments

Claude Code Scheduler

https://github.com/jshchnz/claude-code-scheduler
1•jshchnz•14m ago•0 comments

Oops, You Wrote a Database

https://dx.tips/oops-database
1•swyx•14m ago•0 comments

Mycelial turnover and persistence of wood-decay fungi at the microscale

https://nph.onlinelibrary.wiley.com/doi/10.1111/nph.70957
2•PaulHoule•16m ago•0 comments

Attyx – tiny and fast GPU-accelerated terminal emulator written in Zig

https://github.com/semos-labs/attyx
2•nicholasrq•18m ago•0 comments

Rippled: Decentralized cryptocurrency blockchain daemon, XRP Ledger in C++

https://github.com/XRPLF/rippled
1•klaussilveira•19m ago•0 comments

Show HN: TypeDB Studio's AI agent for schema exploration and query generation

https://typedb.com/blog/vibe-querying-with-typedb-studio
2•flyingsilverfin•19m ago•1 comments

Anthropic acquires Vercept_AI to advance Claude's computer use capabilities

https://twitter.com/AnthropicAI/status/2026705792033026465
1•bigwheels•19m ago•1 comments

Russia sends migrants into Europe through secret tunnels

https://www.telegraph.co.uk/world-news/2026/02/25/russia-sends-migrants-into-europe-through-secre...
3•breppp•19m ago•1 comments

SysNav – An Intelligent Cockpit for DevOps (Local-First)

https://www.sysnav.ai/
1•sys_ravi•20m ago•1 comments

Transnormalism

https://www.gleech.org/enhance
1•speckx•21m ago•0 comments

Mojo 1.0 and compiler open-sourcing planned for 2026

https://twitter.com/Modular/status/2026703863215174028
1•ivell•21m ago•0 comments