frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Lucid – Catch hallucinations in AI-generated code before they ship

https://github.com/gtsbahamas/hallucination-reversing-system
3•jordanappsite•2h ago
Hi HN, I'm Ty. I built LUCID because I kept shipping bugs that my AI coding assistant hallucinated into existence.

Three independent papers have proven that LLM hallucination is mathematically inevitable (Xu et al. 2024, Banerjee et al. 2024, Karpowicz 2025). You can't train it away. You can't prompt it away. So I built a verification layer instead.

How it works: LUCID extracts implicit claims from AI-generated code (e.g., "this function handles null input," "this query is injection-safe," "this handles concurrent access"), then uses a second, adversarial AI pass to verify each claim against the actual implementation. You get a report showing exactly what would have shipped to production without verification.

"But can't the verifier hallucinate too?" Yes -- and that's the right question. The benchmarks below were validated by running real test suites, not by trusting LUCID's judgment. The value is that structured claim extraction + adversarial verification catches bugs that a single generation pass misses. The architecture also supports swapping LLM verification for formal methods (SMT solvers, property-based testing) per claim type as those integrations mature.

Benchmarks:

- HumanEval: 86.6% baseline -> 100% pass@5 with LUCID (164/164 problems) - SWE-bench: 18.3% baseline -> 30.3% with LUCID (+65.5%) - Both benchmarks were validated by running actual test suites, not by LLM judgment - LLM-as-judge actually performs worse at higher k values -- it hallucinates false positives

Three ways to use it:

1. MCP Server (Claude Code, Cursor, Windsurf) -- one config line, verification as a native tool 2. GitHub Action -- automated verification on every PR with inline comments 3. CLI -- npx lucid verify --repo /path/to/code

Free tier: 100 verifications/month. Get a key at https://trylucid.dev

Code: https://github.com/gtsbahamas/hallucination-reversing-system Paper: https://doi.org/10.5281/zenodo.18522644 Dashboard: https://trylucid.dev

TypeScript's Power in Plain JavaScript

https://dvcoolarun.com/typescript/jsdoc/2024/09/02/TypeScript-power-in-plain-javascript.html
1•dvcoolarun•20s ago•0 comments

Show HN: Mdr – TUI Markdown Reader

https://github.com/seymores/mdr
1•seymores•2m ago•0 comments

Context management is the real bottleneck in AI-assisted coding

1•hoangnnguyen•4m ago•0 comments

Insider Analytics – We have built a insider trading tracking platform

https://insideranalytics.ai
2•TheoJohn•7m ago•0 comments

Show HN: Scansprout – QR code generator I extracted from an art gallery project

https://www.scansprout.com/
1•veryhungryhippo•13m ago•0 comments

Show HN: DevUtility Hub Source Code – 117 Tools in Next.js 15

https://www.devutilityhub.me/
1•badboyshah•14m ago•0 comments

MiniMax M2.5 SOTA in Coding and Agent, Designed for Agent Universe

https://www.minimax.io/models/text
2•virgildotcodes•19m ago•0 comments

They Asked Me to Open ChatGPT During My Job Interview

https://old.reddit.com/r/jobs/comments/1r3we1z/they_asked_me_to_open_chatgpt_during_my_job/
2•_____k•19m ago•1 comments

ByteDance Seed2.0 LLM: breakthrough in complex real-world tasks

https://seed.bytedance.com/en/blog/seed2-0-%E6%AD%A3%E5%BC%8F%E5%8F%91%E5%B8%83
4•cyp0633•28m ago•4 comments

The SEC closed its investigation into Fisker

https://techcrunch.com/2026/02/13/the-sec-closed-its-investigation-into-fisker/
2•SilverElfin•29m ago•1 comments

First Proof

https://1stproof.org/
1•tosh•30m ago•0 comments

Washington pushes back against EU's bid for tech autonomy

https://www.politico.eu/article/eu-bid-for-tech-autonomy-washington-us-pushes-back/
3•frm88•32m ago•0 comments

Apple Reveals How Many iPhones Are Running iOS 26

https://www.macrumors.com/2026/02/13/apple-shares-ios-26-adoption-stats/
2•tosh•33m ago•0 comments

The Final Bottleneck

https://lucumr.pocoo.org/2026/2/13/the-final-bottleneck/
2•tosh•39m ago•0 comments

Show HN: HelloAria – AI task manager where you talk instead of type

https://www.helloaria.io/
1•saitharun_stk•39m ago•1 comments

Do Not Outsource Judgement

https://dncrews.com/do-not-outsource-judgement-76f9e5be61b9
5•mawaldne•40m ago•1 comments

Painless Activation Steering (PAS)

https://sashacui.substack.com/p/painless-activation-steering-pas
1•SashaCui•40m ago•1 comments

Show HN: Quantitative analysis of Alphabet (GOOGL) financials

https://jasonhonkl.github.io/#alphabet-quantitative-analysis
2•JasonHEIN•46m ago•0 comments

I love using TypeScript at work

https://kwojcicki.github.io/blog/WHY-I-LOVE-TYPESCRIPT
1•kwojcicki•49m ago•0 comments

14 More Lessons from 14 years at Google

https://addyosmani.com/blog/14-more-lessons/
4•talonx•1h ago•0 comments

Show HN: Swarm Curl

https://github.com/ismdeep/swarm-curl
2•ismdeep•1h ago•1 comments

The AI Dilemma

https://www.aleksandrhovhannisyan.com/blog/the-ai-dilemma/
2•aleksandrh•1h ago•0 comments

Cyber Model Arena

https://www.wiz.io/cyber-model-arena
2•ram_rattle•1h ago•0 comments

Pg_stat_ch: A PostgreSQL extension that exports every metric to ClickHouse

https://clickhouse.com/blog/pg_stat_ch-postgres-extension-stats-to-clickhouse
2•saisrirampur•1h ago•0 comments

Why haven't humans been back to the moon in over 50 years?

https://www.cnn.com/2026/02/13/science/why-humans-have-not-been-back-to-moon
3•ablaba•1h ago•2 comments

Jikipedia, a new AI-powered wiki reporting on key figures in the Epstein scandal

https://twitter.com/jmailarchive/status/2022482688691835121
2•wenjel•1h ago•0 comments

Show HN: Heart Note – a tiny web app to send beautiful one‑off digital letters

https://heartnote.online
2•azabraao•1h ago•0 comments

SnowBall: Iterative Context Processing When It Won't Fit in the LLM Window

https://enji.ai/tech-articles/snowball-iterative-context-processing/
1•puzanov•1h ago•0 comments

How to be a good Asian parent (satire)

https://www.reddit.com/r/AsianParentStories/s/yyMDWcAUdh
1•carabiner•1h ago•1 comments

The Compliance Officer Who Flagged Epstein – and Lost Her Job

https://www.levernews.com/the-compliance-officer-who-flagged-epstein-and-lost-her-job/
2•cwwc•1h ago•0 comments