frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Show HN: Memotron – PKM Tool for All

https://memotron.app
1•thyaravind•2m ago•0 comments

Mail Beneluxe Not opened why

1•loxyqw•3m ago•0 comments

My Advice on (Internet) Writing

https://dynomight.net/writing-advice/
1•Curiositry•6m ago•0 comments

Lolowind Anon Blog

https://github.com/stalkiq/lolowind
1•stalkiq•15m ago•1 comments

Smart screen capture with AI insights

https://cognimate.app
1•dennisweng•19m ago•0 comments

Functionally banning school pizza is a tough sell

https://www.theatlantic.com/health/archive/2025/06/rfk-jr-maha-school-pizza/683040/
1•fortran77•21m ago•0 comments

Quantum Mixed-State Self-Attention Network

https://arxiv.org/abs/2403.02871
1•fs_tab•34m ago•0 comments

Nucleus Launches Embryo

https://mynucleus.com/embryo/press
1•euvin•38m ago•0 comments

Show HN: Most users won't report bugs unless you make it stupidly easy

https://bugdrop.app
1•lakshikag•45m ago•0 comments

Knuth-Bendix Completion Calculator

https://karldray.com/knuth-bendix/
3•karldray•50m ago•0 comments

According to Nielsen, No One Is Watching Anime

https://animebythenumbers.substack.com/p/nielsen-anime
2•zdw•56m ago•0 comments

Switch 2 factory firmware spotted in the wild

https://gbatemp.net/threads/switch-2-factory-firmware-spotted-in-the-wild.671975/
2•takoid•57m ago•0 comments

We should protect the high seas from all extraction, forever

https://www.nature.com/articles/d41586-025-01665-0
3•jdnier•57m ago•0 comments

Chasing Big Money with the Health-Care Hustlers of South Florida

https://www.bloomberg.com/features/2025-deepfake-ads-fueled-florida-health-insurance-scheme/
1•toomuchtodo•1h ago•1 comments

LTX Studio API v1 Featuring LTX-Video and FLUX.1 Kontext

https://useapi.net/blog/250603
1•useapi•1h ago•0 comments

The Beer Gut 2

https://substack.com/inbox/post/165233742
2•thunderbong•1h ago•1 comments

Mexican high school student launches mental health app

https://nomadful.io
1•liquidiguisante•1h ago•1 comments

Data centers are building their own gas power plants in Texas

https://www.texastribune.org/2025/06/05/texas-data-centers-gas-power-plants-ai/
6•1659447091•1h ago•0 comments

Review: Threat Modeling for Agentic AI – Introducing Maestro

https://securedgenai.substack.com/p/review-threat-modeling-for-agentic
2•emmap21•1h ago•0 comments

Texas governor signs cybersecurity bill establishing state command center

https://communityimpact.com/san-antonio/northeast-san-antonio-metrocom/texas-legislature/2025/06/02/gov-greg-abbott-signs-cybersecurity-bill-establishing-state-command-center/
1•1659447091•1h ago•0 comments

Asking about firearm safety during ER admissions: positive results

https://www.newswise.com/articles/children-s-hospital-of-philadelphia-researchers-show-quality-improvement-initiative-in-the-emergency-department-improved-family-engagement-around-firearm-safety-resources
2•onecommentman•1h ago•1 comments

Every LLM buzzword explained as a fantasy story (RAG, MoE, LoRA, RoPE, etc.)

https://medium.com/@isranimohit/the-lexiconia-codex-a-fantasy-story-that-teaches-you-every-llm-buzzword-3b7f6eb23da9
1•isranimohit•1h ago•1 comments

Gren to WebAssembly [video]

https://www.youtube.com/watch?v=OCpzEw4T5bA
1•DASD•1h ago•0 comments

I am an audience, first and foremost

https://ratsfromrocks.substack.com/p/i-am-an-audience-first-and-foremost
7•samclemens•1h ago•1 comments

The Astounding Counterfeit Nazi Invasion Map You've Never Heard Of

https://www.msn.com/en-us/news/other/the-astounding-counterfeit-nazi-invasion-map-you-ve-never-heard-of/ar-AA1EDiwb
1•georgecmu•1h ago•0 comments

SiCSem plans plant in Odisha, ties up with IIT

https://www.newindianexpress.com/states/odisha/2024/Jun/17/sicsem-plans-plant-in-odisha-ties-up-with-iit
1•akbarnama•1h ago•0 comments

Building an AI Server on a Budget

https://www.informationga.in/blog/building-an-ai-server-on-a-budget
11•mful•1h ago•0 comments

Open Source and the EU Cyber Resilience Act

https://lwn.net/SubscriberLink/1023306/8c3173bf68280edb/
3•pabs3•1h ago•0 comments

Show HN: Skylos. Yet another static code analyzer

https://github.com/duriantaco/skylos
3•duriantaco•1h ago•0 comments

Show HN: OpenAI-C – A lightweight OpenAI Chat API client in pure C

https://github.com/LunaStev/openai-c
2•LunaStev•1h ago•0 comments
Open in hackernews

Show HN: LLM Hallucination Detector – Works with GPT, Claude, and Local Models

https://github.com/Mattbusel/LLM-Hallucination-Detection-Script
2•Shmungus•1d ago
I built a lightweight hallucination detector that works with any LLM API.

It checks for signs of hallucinated or unreliable output using a multi-method approach (overconfidence patterns, factual density, coherence, contradictions, etc).

What it does:

Works with GPT, Claude, local models (e.g., Mistral, DialoGPT)

Outputs a hallucination probability (0.0–1.0)

Flags overconfident or uncertain language

Scores factual density, coherence, and contradictions

Compares responses to context (if provided)

Fully framework-agnostic — no extra dependencies

Built for production + research workflows

Benchmarked on 1,000+ samples:

F1: 0.75

AUC-ROC: 0.81

Fast: ~0.2s per response

Comes with plug-and-play examples:

OpenAI, Anthropic, local models

Flask API

Custom scoring configs

I’m giving this away free under MIT. Would love feedback, issues, PRs — or just to know if it helps you build safer LLM apps.

GitHub: https://github.com/Mattbusel/LLM-Hallucination-Detection-Scr...

Comments

Shmungus•1d ago
Hi HN!

I’m excited to share this lightweight hallucination detector I built to help identify unreliable or “hallucinated” outputs from LLMs like GPT, Claude, and various local models.

It uses multiple methods — from spotting overconfidence and contradictions to scoring factual density and coherence — to give a hallucination probability score for any generated response.

It’s framework-agnostic, fast (~0.2s per response), and designed for both research and production use. Plus, it’s completely free under the MIT license.

I’d love to hear your thoughts, feedback, and if you find it useful for your projects. Happy to answer questions or discuss how it works under the hood!

Thanks for checking it out!

akoboldfrying•1d ago
I'm impressed that you give precision and recall metrics for this -- and amazed that they are non-terrible. I'm amazed because a fully general hallucination detector is obviously a truth oracle -- it can answer any question about anything in the world, by framing the question as a statement and then asking whether that statement is a hallucination.

From among the analyses the tool makes, it makes sense to me that contradictions can be detected, since that doesn't require knowledge of the real world. I'm very interested in how you do this detection ("Logical inconsistencies") in practice. Likewise for "Logical progression".

Two questions:

1. Since "overconfidence" is treated as a red flag, won't applying your tool as a filter cause LLM response precision to drop, often unnecessarily? The safest answer an LLM can give to "When was the Eiffel Tower built?" is surely along the lines of "The Eiffel Tower may or may not have been built at some time in the past."

2. I don't see how this tool can detect the kind of hallucination that (a) involves no contradiction and (b) requires knowledge of the world. These come up often. Examples: Citing plausible-sounding but nonexistent court cases, calling plausible-sounding but nonexistent methods in an API.

Shmungus•1d ago
Thanks, really appreciate the thoughtful questions and skepticism (and totally agree: a “perfect” hallucination detector would be a truth oracle).

To your points:

1. Overconfidence and precision You're right that filtering on overconfidence alone could tank precision. That’s why the tool doesn’t treat it as a strict red flag, it’s one of several signals, and the final hallucination score is a weighted combination of multiple metrics (confidence, density, contradictions, progression, etc.). Overconfident phrasing tends to correlate with hallucinations in aggregate, but the idea is never to penalize all confident answers, just to flag the ones where that confidence is unjustified by the context or content.

2. Detecting hallucinations that require world knowledge Absolutely, those are the hardest cases. This tool doesn’t solve that. Instead, it acts as a proxy evaluator:

Factual density gives a rough measure of “how many claims are being made”

Overconfidence vs. ambiguity highlights where a model might be bluffing

Logical coherence and contradiction checks flag when a model violates internal structure (not ground truth) But it won’t catch the subtle world-knowledge misses (like fake court cases or made-up API calls) unless you pair it with a grounded context or use external validators.

The long-term hope is: use this tool to raise suspicion, not declare judgment. It's a cheap sanity layer, a “weak oracle” that’s fast, pluggable, and good enough to catch the dumb stuff before you escalate to expensive validators or human review.