frontpage.

I got tired of LLM outputs silently failing in pipelines, so I built a small scoring layer around it.

It checks three things before your output moves forward: does it match the schema you expected is it consistent across runs does it actually align with the context you provided

Returns a confidence score and a risk level. That's mostly it.

Works with OpenAI, Anthropic, Gemini, Ollama and a few others. Sync and async both supported. It's heuristic, not a guarantee. If your context is bad, the scores will be too. Hit a star, if you found this useful.

Try now: pip install hallx

The Enigma of Gertrude Stein

Stack – Business PreFlight

Show HN: I tested 15 free AI models at building real software on a $25/year VPS

Show HN: Hacker News CLI

Distributed locking is not about locks. It's about ordering

Blackstone Squeezes Thoma Bravo and Its Ailing Software Company Medallia

The Last Stack

Know if AI will replace your job

Wi-Fi That Can Withstand a Nuclear Reactor: This receiver chip can take it

Binary Posters

Anthropic Says That Claude Contains Its Own Kind of Emotions

The Download: plastic's problem with fuel prices, and SpaceX's blockbuster IPO

Anthropic's AutoDream Is Flawed

A $1.75T IPO Would Be Overpaying 30% for SpaceX

Does the Platypus Have Nipples?

Gemma 4: The new standard for local agentic intelligence on Android

Show HN: Ecotokens – Another token saver for CLI Agents

Options for Phones at Protests

SystemRescue 13 lands with Linux 6.18 and bcachefs support

A simple online forum written in Prolog

Ruby Central report reopens wounds over RubyGems repo takeover

Orchestration-as-Code – Orchestration and software are the same

Making Services with Go Right Way

Show HN: Job market trends across 1,100 tech companies

Teen's explicit Gemini Live encounter gets whole family banned

Software Engineering Is Becoming Civil Engineering

MultiGen: AI multiplayer doom playable in real-time on your phone and computer

JPMorgan Eyes $10B Daily Blockchain Goal

Useful Quantum Computers Could Be Built with as Few as 10k Qubits

Ask HN: How do you run discovery with zero network?

Show HN: Hallx – Hallucination risk scoring for LLM outputs

Comments