frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Lucid – Catch hallucinations in AI-generated code before they ship

https://github.com/gtsbahamas/hallucination-reversing-system
3•jordanappsite•3h ago
Hi HN, I'm Ty. I built LUCID because I kept shipping bugs that my AI coding assistant hallucinated into existence.

Three independent papers have proven that LLM hallucination is mathematically inevitable (Xu et al. 2024, Banerjee et al. 2024, Karpowicz 2025). You can't train it away. You can't prompt it away. So I built a verification layer instead.

How it works: LUCID extracts implicit claims from AI-generated code (e.g., "this function handles null input," "this query is injection-safe," "this handles concurrent access"), then uses a second, adversarial AI pass to verify each claim against the actual implementation. You get a report showing exactly what would have shipped to production without verification.

"But can't the verifier hallucinate too?" Yes -- and that's the right question. The benchmarks below were validated by running real test suites, not by trusting LUCID's judgment. The value is that structured claim extraction + adversarial verification catches bugs that a single generation pass misses. The architecture also supports swapping LLM verification for formal methods (SMT solvers, property-based testing) per claim type as those integrations mature.

Benchmarks:

- HumanEval: 86.6% baseline -> 100% pass@5 with LUCID (164/164 problems) - SWE-bench: 18.3% baseline -> 30.3% with LUCID (+65.5%) - Both benchmarks were validated by running actual test suites, not by LLM judgment - LLM-as-judge actually performs worse at higher k values -- it hallucinates false positives

Three ways to use it:

1. MCP Server (Claude Code, Cursor, Windsurf) -- one config line, verification as a native tool 2. GitHub Action -- automated verification on every PR with inline comments 3. CLI -- npx lucid verify --repo /path/to/code

Free tier: 100 verifications/month. Get a key at https://trylucid.dev

Code: https://github.com/gtsbahamas/hallucination-reversing-system Paper: https://doi.org/10.5281/zenodo.18522644 Dashboard: https://trylucid.dev

Show HN: SQL-tap – Real-time SQL traffic viewer for PostgreSQL and MySQL

https://github.com/mickamy/sql-tap
84•mickamy•4h ago•14 comments

Show HN: I spent 3 years reverse-engineering a 40 yo stock market sim from 1986

https://www.wallstreetraider.com/story.html
231•benstopics•4d ago•72 comments

Show HN: Data Engineering Book – An open source, community-driven guide

https://github.com/datascale-ai/data_engineering_book/blob/main/README_en.md
153•xx123122•11h ago•13 comments

Show HN: Skill that lets Claude Code/Codex spin up VMs and GPUs

https://cloudrouter.dev/
121•austinwang115•13h ago•33 comments

Show HN: ClipPath – Paste screenshots as file paths in your terminal

https://github.com/BiteCraft/ClipPath
9•viniciusborgeis•7h ago•0 comments

Show HN: OpenWhisper – free, local, and private voice-to-text macOS app

https://github.com/richardwu/openwhisper
26•rwu1997•13h ago•8 comments

Show HN: Moltis – AI assistant with memory, tools, and self-extending skills

https://www.moltis.org
100•fabienpenso•1d ago•39 comments

Show HN: Lucid – Catch hallucinations in AI-generated code before they ship

https://github.com/gtsbahamas/hallucination-reversing-system
3•jordanappsite•3h ago•0 comments

Show HN: MicroVibe – minimal JSX web starter

https://github.com/melendezgg/microvibe
3•melendezgg•4h ago•1 comments

Show HN: Musecl-memory – Zero-dependency memory sync for AI agents bash and Git

https://github.com/musecl/musecl-memory
2•musecl•4h ago•0 comments

Show HN: Verify-before-release x402 gateway for AI agent transactions

2•settlddotwork•4h ago•0 comments

Show HN: Geo Racers – Race from London to Tokyo on a single bus pass

https://geo-racers.com/
137•pattle•1d ago•85 comments

Show HN: Skybolt Reflect – C++ header-only runtime reflection library

https://github.com/Prograda/skybolt-reflect
2•matt128•6h ago•0 comments

Show HN: Turn OpenClaw in a high performing development team with DevClaw

https://github.com/laurentenhoor/devclaw
2•laurentenhoor•7h ago•0 comments

Show HN: AI agents play SimCity through a REST API

https://hallucinatingsplines.com
213•aed•4d ago•72 comments

Show HN: Sol LeWitt-style instruction-based drawings in the browser

https://intervolz.com/sollewitt/
68•intervolz•3d ago•14 comments

Show HN: Explore ASN Relationships and BGP Route History with Real Internet Data

https://ipiphistory.com/
3•wangjiajie917•9h ago•0 comments

Show HN: Koala Diff – High-performance local data comparison (Rust and Polars)

https://github.com/godalida/koala-diff
2•godalida•9h ago•1 comments

Show HN: ClawProxy: An HTTP proxy that injects auth tokens into API calls

https://github.com/mlolson/clawproxy
2•LordHumungous•10h ago•0 comments

Show HN: Kuro-Nuri – Browser-based image redaction and compression using WASM

https://kuro-nuri.com/
4•kunronuri•11h ago•1 comments

Show HN: Bubble Sort on a Turing Machine

https://github.com/purplejacket/bubble_sort_on_tm
3•purplejacket•11h ago•0 comments

Show HN: Holywell – The missing SQL formatter for sqlstyle.guide

https://github.com/vinsidious/holywell
2•vcoppola•11h ago•0 comments

Show HN: A reputation index from mitchellh's Vouch trust files

https://vouchbook.dev/
2•rosslazer•11h ago•0 comments

Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents

https://github.com/JaredStewart/coderlm/blob/main/server/REPL_to_API.md
79•jared_stewart•2d ago•36 comments

Show HN: Rowboat – AI coworker that turns your work into a knowledge graph (OSS)

https://github.com/rowboatlabs/rowboat
202•segmenta•3d ago•56 comments

Show HN: Ghost – Session memory for Claude Code (local, qmd, Git-integrated)

https://github.com/notkurt/ghost
2•notkurt•12h ago•1 comments

Show HN: I speak 5 languages. Common apps taught me none. So I built lairner

https://lairner.com
32•t17r•17h ago•83 comments

Show HN: Agent Alcove – Claude, GPT, and Gemini debate across forums

https://agentalcove.ai
64•nickvec•2d ago•26 comments

Show HN: Pgclaw – A "Clawdbot" in every row with 400 lines of Postgres SQL

https://github.com/calebwin/pgclaw
45•calebhwin•1d ago•33 comments

Show HN: I built a macOS tool for network engineers – it's called NetViews

https://www.netviews.app
241•n1sni•4d ago•60 comments