frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLM-schema-guard – Rust proxy enforcing JSON schemas on LLM outputs

https://github.com/AncientiCe/llm-schema-guard
1•iCeGaming•1h ago
Hey everyone, I built llm-schema-guard because LLMs are amazing at spitting out JSON... until they suddenly aren't. Even with JSON mode or function calling, you still get missing fields, wrong types, or just plain broken syntax that kills your agents, RAG flows, or any tool-calling setup. This is a lightweight Rust HTTP proxy that sits in front of any OpenAI-compatible API (think Ollama, vLLM, LocalAI, OpenAI itself, Groq, you name it). It grabs the generated output, checks it against a JSON Schema you provide, and only lets it through if it's valid. If it's invalid, strict mode kicks back a clean 400 with details. Permissive mode tries auto-retrying a few times by tweaking the prompt with a fix instruction and exponential backoff. Everything else stays the same: full streaming support (it buffers the response to validate), Prometheus metrics so you can monitor validation fails, retries, latency, and more. Config is simple YAML for upstreams, schemas per model, rate limiting, caching, etc. There's even an offline CLI if you just want to test schemas locally. It's built with Axum and Tokio for really low latency and high throughput, plus jsonschema-rs under the hood. Docker compose makes it dead simple to spin up with Ollama.

This grew out of my earlier schema-gateway project, and I'm happy to add stuff like Anthropic support, tool calling validation, or better streaming fixes if people find it useful. Stars or contributions are very welcome!

Thanks for taking a look :)

The monorepo: my personal software infrastructure

https://iafisher.com/blog/2026/01/monorepo-2026
1•thornewolf•2m ago•1 comments

NASA Demolishes Historic Test Stands That Built the Space Age

https://www.universetoday.com/articles/nasa-bids-farewell-to-historic-test-stands-that-built-the-...
1•oldnetguy•2m ago•0 comments

The Project G Stereo Was the Definition of Groovy

https://spectrum.ieee.org/project-g-stereo
1•oldnetguy•3m ago•0 comments

Show HN: Oauth2-Proxy-Injector

https://github.com/spacemule/oauth2-proxy-injector
1•spacemule•8m ago•0 comments

Addressing the Lack of Built-In Storage in UK Housing

https://dreamhomestore.co.uk/collections/bedroom-furniture-sets
1•tonypaterson•8m ago•2 comments

The birth of a WATERBEAR 1000x magnification [video]

https://www.youtube.com/shorts/d3jvvqwaaIE
1•surprisetalk•9m ago•0 comments

Snowflake Koch Fractal

https://xaya-e.bearblog.dev/11-xaya-advent/
1•surprisetalk•9m ago•0 comments

The 80s Movies Rewind

https://www.fast-rewind.com/index.html
1•surprisetalk•10m ago•0 comments

Most Code Is Just Cache

https://blog.sshh.io/p/most-code-is-just-cache
1•surprisetalk•10m ago•0 comments

Glass Onion by US Soccer, an Identifier Synchronization Tool

https://www.americansocceranalysis.com/home/2026/1/25/introducing-glass-onion-by-us-soccer-an-ide...
1•donohoe•11m ago•0 comments

A 3-part series on SQL performance optimisations

https://ohdear.app/news-and-updates/our-3-part-series-on-sql-performance-optimisations
1•Mojah•13m ago•0 comments

Any recent fiction with software engineer protagonist?

1•yedidmh•14m ago•0 comments

Gonemaster: A Golang clone of the Zonemaster DNS checker

https://codeberg.org/pawal/gonemaster
1•fanf2•14m ago•1 comments

Library of Congress Pywb Wayback Machine

https://webarchive-beta.loc.gov/playback-services/general/
1•fuggh•16m ago•0 comments

Adversarial AIs debate your question, a judge picks the winner

https://mindhalo.ai/debates/
1•ivanmikhnenkov•17m ago•0 comments

Apple to Launch These 20 Products This Year

https://www.macrumors.com/2026/01/25/rumored-apple-products/
1•01-_-•18m ago•0 comments

Self-Recognition in Conversational Agents

https://arxiv.org/abs/2002.02334
1•elasolova•20m ago•0 comments

Joyus: I Tried Datastar

https://ajmoon.com/posts/joyus-i-tried-datastar
1•alex-moon•24m ago•0 comments

Ultimate Guide to LLM Memory

https://fastpaca.com/blog/ultimate-guide-to-llm-memory/
1•cpluss•24m ago•0 comments

Spinning Around: Please Don't

https://www.siliceum.com/en/blog/post/spinning-around/
2•todsacerdoti•24m ago•0 comments

Explainable machine learning: k many q-dimensional best-fit polytopes

https://figshare.com/articles/media/A_demo_of_k-polytopes/7991909?file=14884349
1•elasolova•26m ago•0 comments

Show HN: MWP – Open spec for making web content AI-readable

https://github.com/mbumpus/mwp-spec
1•digitalegoai•27m ago•0 comments

Lawsuit Alleges That WhatsApp Has No End-to-End Encryption

https://www.pcmag.com/news/lawsuit-alleges-that-whatsapp-has-no-end-to-end-encryption?test_uuid=0...
3•OutOfHere•28m ago•0 comments

What is 9bizub, how it works, and how Nigerians should use it

https://9bizub.com/archives/3624
1•CapricornQueen•28m ago•0 comments

Let the Chinese Cars In

https://www.noahpinion.blog/p/let-the-chinese-cars-in
2•xrd•29m ago•0 comments

The Importance of Diversity

https://geohot.github.io//blog/jekyll/update/2026/01/27/the-importance-of-diversity.html
3•aabiji•30m ago•0 comments

Aarte: Personal AI Assistant

https://www.aarte.co
1•haeli05•31m ago•0 comments

Simpler JVM Project Setup with Mill 1.1.0

https://mill-build.org/blog/17-simpler-jvm-mill-110.html
1•lihaoyi•31m ago•0 comments

The Inverted Panopticon: China Weaponized the West's Own Wiretap Infrastructure

https://shanakaanslemperera.substack.com/p/the-inverted-panopticon
2•jc_811•31m ago•0 comments

I went off the deep end with AI

https://newsletter.masilotti.com/p/i-went-off-the-deep-end-with-ai
1•joemasilotti•32m ago•1 comments