frontpage.

Hi HN,

I built Sieves, an open-source Python library that makes it easy to build structured document AI pipelines without locking yourself into any specific LLM framework.

You can mix and match model frameworks (Outlines, LangChain, DSPy, GLiNER2, Transformers) for different tasks while keeping one declarative pipeline definition. E.g. fast local models for classification and frontier LLMs for more challenging tasks.

Includes:

  - unified task and schema abstractions 
  - pipeline for task chaining 
  - execution across multiple backends  
  - built-in evaluation, optimization, and conditional task execution  
  - support for distillation to smaller models

Full motivation, design rationale, and examples in the linked blog.

tl;dr: I was doing a lot of consulting/prototyping for document AI projects and kept running into the same lock-in and boilerplate issues, so I decided to write a library that addresses this.

If you're working a lot with document AI in greenfield projects, this may be interesting to you.

Happy to answer questions or feedback!

Repo: https://github.com/MantisAI/sieves / docs: https://sieves.ai

What I learned launching a simple ad marketplace in 24 hours

From Claude Code to Figma: Turning production code into editable Figma designs

The Obscure Media Theory That Explains '99% of Everything'

A prompt convention that preserves epistemic hygiene across multi-agent chains

Claude Code is powerful. Pilot makes it reliable

How Americans view Elon Musk and Mark Zuckerberg

Melbourne man sets up West Gate Bridge livestream from his driveway

Chrome Goldmine: Expired Chrome Extensions as Micro-SaaS

Where Christian nationalism is most dominant in the U.S.

Pulsar Found Near the Center of the Milky Way Could Test Einstein's Theories

Restish is a CLI for interacting with REST-ish HTTP APIs

Audible Launches Immersion Reading for Deeper Engagement with Books

Gwt-zsh – Stupidly simple Git worktree management

Project Paperclip. The time has come

Forget DeepSeek, dying alone is China's latest tech obsession

Connected Papers: Explore connected papers in a visual graph

Six Claude Code Strategies for a Productive Workflow

Linux 7.0 Showing Some Early Performance Regressions on Intel Panther Lake

Reacting to news is basically a cheat code for LinkedIn

Two mechanisms for dynamic type checks

Show HN: SkillForge – Turn screen recordings into agent skill files

Mark Zuckerberg testifies in L.A. trial over social media harms

How Unitree Trained Robots to Master Real Kung Fu Moves

Show HN: Zenhub Analytics for Multiple Workspaces

Kernel-enforced sandbox App and SDK for AI agents, MCP and LLM workloads

India tells univ. to leave AI summit after presenting Chinese robot as its own

Womens Sizing

Show HN: macOS native DAW with Git branching model

Genome-wide association study of major anxiety disorders in 122,341 people

A terminal you can curl

Show HN: Sieves, a unified interface for structured document AI