frontpage.

I pointed a vision model at a grocery receipt. It returned a store name, item list, and total. None of it was on the paper.

This wasn't OCR error. The model didn't confuse a "7" for a "1." It generated a plausible-looking receipt from scratch — different store, different items, different prices. If I hadn't been holding the original, I might not have caught it.

Same image, different model (same parameter count, same hardware), five seconds later: every item correct, store name right, total accurate to the penny.

The models: minicpm-v 8B (fabricated) vs qwen3-vl 8B (accurate). Both open source, both ~6GB VRAM, both running locally via Ollama on an RTX 5080.

What I learned:

1. Vision model hallucination is qualitatively different from text hallucination. A text model gives you a wrong answer to a real question. A vision model gives you a confident answer to an image it didn't process. The second is harder to detect.

2. Model selection matters more than prompt engineering for vision. Same prompt, same image — one model fabricated, one read accurately. No prompt optimization fixes a model that invents data.

3. Confidence scoring is mandatory. I added a reconciliation check: do the extracted items sum to roughly the stated total? This catches fabrication that looks plausible at the individual line-item level.

4. The fix wasn't more money or a bigger model. Same size (8B), same hardware, same cost ($0). Just a different architecture that actually reads pixels instead of generating plausible text about them.

Full writeup with the pipeline architecture and code patterns: https://dev.to/rayne_robinson_e479bf0f26/my-ai-read-a-receipt-wrong-it-didnt-misread-it-it-made-one-up-4f5n

Rust-accelerated reinforcement learning, 140x faster than Python

Iranian security chief Ali Larijani killed in air strike

India's 20 years of GDP misestimation: New evidence

AI coordinates with your friends' AI so nobody has to

Show HN: Vibe Remote – Code from your bed or the park with Claude Code/Codex

Smoother Signatures (2012)

The remaining questions after the Supreme Court's tariffs ruling

My Claude Code setup you definitely shouldn't use. It's AI Overkill

From Descartes to punk rock, the letter X has an extraordinary history

Forget Flags and Scripts: Just Rename the File

Life in Hitler's Capital

Study finds scientists' jokes mostly fall flat

SQLite WAL-reset database corruption bug

We optimized Dash's relevance judge with DSPy

TrustAgentAI – Cryptographic receipts for MCP tool calls (non-repudiation layer)

Bonanza or Bubble? Where AI Goes from Here

Gas Town by Kilo

GSD 2

I built a runtime guardrail that stops AI agents from doing dumb things

Fractal Reddit New Post

Active Engineering: A Framework for Sustainable Development in the AI Era

Have a Fucking Website

Show HN: Ship or slop – a place where agents come up with ideas and argue

Is your job safe from AI and automation? (inspired by Karpathy)

Show HN: CollabMD – Real-time multiplayer for local and Git-backed Markdown

Hardware entropy is a coupled system

Autofocus glasses watch your eyes, and shift their focus accordingly

Ask HN: What's Your AI IDE?

Reverse Engineering Binaries with AI

Mini Cheetah Clone Teardown (2022)

My AI didn't misread a receipt – it fabricated one from scratch