frontpage.

I built a hallucination detector that takes any AI-generated text, extracts verifiable factual claims, cross-checks each against search results, and outputs a credibility report with per-claim verdicts.

How it works: (1) Paste any AI response. (2) Extractor identifies factual claims — names, dates, numbers, citations. (3) Each claim gets searched independently via HTTP. (4) Comparator checks search evidence against claims. (5) Reporter scores overall credibility.

7 Python modules, ~27KB total. Uses Claude API for extraction/comparison and direct search for verification. Streamlit web UI with color-coded cards per claim.

The thesis: Hallucination is an architecture problem, not a scale problem. LLMs compute argmax P(most_likely), not P(true). More parameters make the guess more refined, but "most likely" ≠ "most true." So instead of making the guesser better, add an independent verification layer that runs on logic, not statistics.

The meta-irony: During code review, I had Claude write the code and Gemini review it. Gemini flagged claude-sonnet-4-20250514 as a "fictional model" and issued a critical blocking warning. The model is real — Gemini's training cutoff made it hallucinate about a model name while reviewing a hallucination detector. Then Claude summarized "all three AIs approved" when only two existed. Human caught both with one sentence each.

Built on a 32KB deductive reasoning engine (9 axioms, fractal-verified across 6 relationship scales). Also open source.

Detector: https://github.com/ZhangXiaowenOpen/hallucination-detector

All projects: https://github.com/ZhangXiaowenOpen

MIT + Heart Clause license. Solo dev + AI collaboration. Happy to answer questions about the architecture or why deductive verification will outlast RAG-based approaches.

Show HN: A luma dependent chroma compression algorithm (image compression)

No Other Choice

AMD hints the next-gen Xbox console could launch next year

Ax for Browser Automation Platforms: Browserless vs. Browserbase vs. Anchor

BKND Joins Supabase

Personal Information Firehose

Show HN: Agentic website to automate presales and more, by hotlines.ai

Show HN: Once – An app that only works once per day

I Am Building an AI-Powered Reverse Incubator

The dueling 'free grocery' stunts from Polymarket and Kalshi in NYC

Show HN: Tokenaru – commodity market for LLM tokens

Are We at the End of the Industrial Age?

Confide: Encrypted, ephemeral and screenshot-proof messenger

Claude Code Plugins

An Open Letter to Jony Ives AI Companion

JavaScript Bin Down in 2026

Glass Battery

Grounded Agency: The Type System Your Agent Framework Forgot to Build

Long-term memory for OpenClaw agents with the mem0/OpenClaw-mem0 plugin

Show HN: Swiss army knife for SpiderWeb Router

Stardew Valley Turns 10: The Big ConcernedApe Interview

Trump's Profiteering Hits $4B

Skill Issues: An OpenClaw Malware Campaign

What Do You Get When You Put a Mummy Through a CT Scan?

Ask HN: Why not just running OpenClaw in Docker?

Show HN: AI Blocker by Kiddokraft

We built what Canva AI should have been

Proposal to Illion-Ise the Byte System

TfL Status Page

Did we just see a black hole explode? Physicists think so

Show HN: 32KB deductive engine that catches LLM hallucinations