frontpage.

Hallucinations are still a major blocker for deploying reliable retrieval-augmented generation (RAG) systems, especially in complex domains like medical or legal.

Most existing hallucination detectors rely on full LLM inference (expensive, slow), or struggle with long-context inputs.

I built LettuceDetect — an open-source, encoder-only framework that detects hallucinated spans in LLM-generated answers based on the retrieved context. No LLMs needed, and it much more efficiently.

Highlights:

- Token-level hallucination detection (unsupported spans flagged based on retrieved evidence)

- Built on ModernBERT — handles up to 4K token contexts

- 79.22% F1 on the RAGTruth benchmark (beats previous encoder models, competitive with LLMs)

- MIT licensed

— Includes Python packages, pretrained models, and Hugging Face demo

GitHub: https://github.com/KRLabsOrg/LettuceDetect

Blog: https://huggingface.co/blog/adaamko/lettucedetect

Preprint: https://arxiv.org/abs/2502.17125

Models/Demo: https://huggingface.co/KRLabsOrg

Would love feedback from anyone working on RAG, hallucination detection, or efficient LLM evaluation. Also exploring real-time hallucination detection (vs. just post-gen) — open to thoughts/collab there.

Show HN: Goldbach Conjecture up to 4*10^18+7*10^13

Show HN: Undercutf1 – F1 Live Timing TUI with Driver Tracker, Variable Delay

Show HN: I made a Doom-like game fit inside a QR code

Show HN: Parqv – Interactive TUI Parquet Visualizer

Show HN: Web Video editor, 100% local, AI subtitle, auto cut based on volume

Show HN: LettuceDetect – Lightweight hallucination detector for RAG pipelines

Show HN: DeadDrop – Tiny tool to share files anonymously

Show HN: (bits) of a Libc, Optimized for Wasm

Show HN: Attune - Build and publish APT repositories in seconds

Show HN: AgentAPI – HTTP API for Claude Code, Goose, Aider, and Codex

Show HN: LTE-connected IoT module with remote programming and NL data analysis

Show HN: Make your bookmarks smarter with AI

Show HN: Secure Shell Manager

Show HN: i built an AI code reviewer for github (used it on itself during dev)

Show HN: I built a tool to find pain points in questions and generate SaaS ideas

Show HN: Unsure Calculator – back-of-a-napkin probabilistic calculator

Show HN: Somehash – A Blurhash-inspired exploration

Show HN: Plandex v2 – open source AI coding agent for large projects and tasks

Show HN: We Put Chromium on a Unikernel (OSS Apache 2.0)

Show HN: Zuni (YC S24) – AI Copilot for the Browser

Show HN: CR4SH3R – Fast WordPress Plugin Vulnerability Scanner

Show HN: Resonate – real-time high temporal resolution spectral analysis

Show HN: Flight search that shows aircraft and airline safety records

Show HN: I built a simple, fast transit app for the Bay Area

Show HN: We built an AI SRE that improves with every downtime incident

Show HN: Torque – A lightweight meta-assembler for any processor

Show HN: I made a game using mazes generated by ChatGPT

Show HN: Resurrecting Infocom's Unix Z-Machine with Cosmopolitan

Show HN: HN Watercooler – listen to HN threads as an audio conversation

Show HN: val – An arbitrary precision calculator language

Show HN: LettuceDetect – Lightweight hallucination detector for RAG pipelines

Show HN: Goldbach Conjecture up to 4*10^18+7*10^13

Show HN: Undercutf1 – F1 Live Timing TUI with Driver Tracker, Variable Delay

Show HN: I made a Doom-like game fit inside a QR code

Show HN: Parqv – Interactive TUI Parquet Visualizer

Show HN: Web Video editor, 100% local, AI subtitle, auto cut based on volume

Show HN: LettuceDetect – Lightweight hallucination detector for RAG pipelines

Show HN: DeadDrop – Tiny tool to share files anonymously

Show HN: (bits) of a Libc, Optimized for Wasm

Show HN: Attune - Build and publish APT repositories in seconds

Show HN: AgentAPI – HTTP API for Claude Code, Goose, Aider, and Codex

Show HN: LTE-connected IoT module with remote programming and NL data analysis

Show HN: Make your bookmarks smarter with AI

Show HN: Secure Shell Manager

Show HN: i built an AI code reviewer for github (used it on itself during dev)

Show HN: I built a tool to find pain points in questions and generate SaaS ideas

Show HN: Unsure Calculator – back-of-a-napkin probabilistic calculator

Show HN: Somehash – A Blurhash-inspired exploration

Show HN: Plandex v2 – open source AI coding agent for large projects and tasks

Show HN: We Put Chromium on a Unikernel (OSS Apache 2.0)

Show HN: Zuni (YC S24) – AI Copilot for the Browser

Show HN: CR4SH3R – Fast WordPress Plugin Vulnerability Scanner

Show HN: Resonate – real-time high temporal resolution spectral analysis

Show HN: Flight search that shows aircraft and airline safety records

Show HN: I built a simple, fast transit app for the Bay Area

Show HN: We built an AI SRE that improves with every downtime incident

Show HN: Torque – A lightweight meta-assembler for any processor

Show HN: I made a game using mazes generated by ChatGPT

Show HN: Resurrecting Infocom's Unix Z-Machine with Cosmopolitan

Show HN: HN Watercooler – listen to HN threads as an audio conversation

Show HN: val – An arbitrary precision calculator language

Show HN: Goldbach Conjecture up to 410^18+710^13

Show HN: Goldbach Conjecture up to 410^18+710^13