frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Formal Proof: LLM Hallucinations Are Structural, Not Statistical (Coq Verified)

https://philpapers.org/rec/SCHTIC-17
2•ICBTheory•41m ago

Comments

ICBTheory•35m ago
Author here.

This paper is Part III of a trilogy investigating the limits of algorithmic cognition. Given the recent industry signals regarding "scaling plateaus" (e.g., Sutskever etc.), I attempt to formalize why these limits appear structurally unavoidable.

The Thesis: We model modern AI as a Probabilistic Bounded Semantic System (P-BoSS). The paper demonstrates via the "Inference Trilemma" that hallucinations are not transient bugs to be fixed by more data, but mathematical necessities when a bounded system faces fat-tailed domains (alpha ≤ 1).

The Proof: While this paper focuses on the CS implications, the underlying mathematical theorems (Rice’s Theorem applied to Semantic Frames, Sheaf Theoretic Gluing Failures) are formally verified using Coq.

You can find the formal proofs and the Coq code in the companion paper (Part II) here:

https://philpapers.org/rec/SCHTIC-16

I’m happy to discuss the P-BOSS definition and why probabilistic mitigation fails in divergent entropy regimes.

wiz21c•17m ago
Since we can't avoid hallucinations, maybe we can live with them ?

I mean, I regularly use LLM's and although, sometimes, they go a bit mad, most of the time they're really helpful

Show HN: I built a 1.8MB native app with self-built UI, vision and AI libraries

https://github.com/Okery/Aivition
1•jaramy•16s ago•0 comments

How to Sell to Engineers

https://www.aadillpickle.com/blog/how-to-sell-to-engineers
1•strangerding•32s ago•0 comments

Nvidia Invests $2B in Synopsys

https://www.morningstar.com/news/dow-jones/202512013548/nvidia-invests-2-billion-in-synopsys
2•mgh2•51s ago•0 comments

Six billion reasons to cheer for Shopify

https://world.hey.com/dhh/six-billion-reasons-to-cheer-for-shopify-55720846
1•software_writer•2m ago•1 comments

GoConnect – A social network limited to 5-person dev squads

1•GoConnectDev•2m ago•0 comments

Lost.js – Local, Offline, Shareable Tools

https://github.com/grothkopp/lost.js
1•growt•3m ago•1 comments

Supercharge Your AI with the Right Context: Grounded Docs MCP Server Updated

https://old.reddit.com/r/CLine/comments/1pbf9bm/supercharge_cline_with_the_right_context_grounded/
2•arabold•5m ago•0 comments

Some musings on code generation: kintsugi

https://blog.engora.com/2025/12/some-musings-on-code-generation-kintsugi.html
1•Vermin2000•9m ago•0 comments

Harper Turns 1.0 Today

https://elijahpotter.dev/articles/harper-turns-1.0-today
1•chilipepperhott•9m ago•0 comments

DeepSeek-v3.2: Pushing the Frontier of Open Large Language Models [pdf]

https://huggingface.co/deepseek-ai/DeepSeek-V3.2/resolve/main/assets/paper.pdf
2•pretext•9m ago•0 comments

Hiring at Arya Health

https://wellfound.com/company/arya-health-ai
1•maliniwim•10m ago•1 comments

Multi-Horizon Delivery Framework

https://yusufaytas.com/multi-horizon-delivery-framework/
8•yusufaytas•10m ago•0 comments

Neurologists warn against controversial migraine surgery

https://english.elpais.com/health/2025-11-18/neurologists-warn-against-controversial-migraine-sur...
1•PaulHoule•11m ago•0 comments

Pacsea – new package manager for Arch Linux

https://github.com/Firstp1ck/Pacsea
1•ssummoner001•13m ago•0 comments

Show HN: Vect AI– The "Resonance Engine" for high-growth marketing

https://vect.pro/
1•asaws•13m ago•0 comments

The "Inhuman Centipede" and Identity

https://syntheticauth.ai/posts/synthetic-auth-report-issue-020#carbon-based-paradox
1•zerolayers•14m ago•1 comments

Saving Skylab

https://www.airandspace.si.edu/stories/editorial/saving-skylab
1•fanf2•15m ago•0 comments

AI-Assisted Coding Killed My Joy of Programming

https://meysam.io/blog/ai-assisted-coding-killed-programming-joy/
2•meysamazad•15m ago•0 comments

Learn with Ari

https://arihara-sudhan.github.io/learn-with-ari/
1•arihara-sudhan•19m ago•0 comments

What is scalability anyway? (2024)

https://brooker.co.za/blog/2024/01/18/scalability.html
1•linhns•24m ago•0 comments

Digital Omnibus: Analysis of GDPR and EPrivacy Proposals by the Commission

https://noyb.eu/en/digital-omnibus-first-analysis-select-gdpr-and-eprivacy-proposals-commission
2•buzer•24m ago•0 comments

The Oceans Are Going to Rise–But When?

https://www.wired.com/story/the-oceans-are-going-to-rise-but-when/
3•Brajeshwar•25m ago•0 comments

Web dev's crawler took down major online bookstore by buying too many books

https://www.theregister.com/2025/12/01/who_me/
1•Brajeshwar•25m ago•0 comments

Zipcar proposes to cease its UK operations

https://support.zipcar.co.uk/hc/en-gb/articles/46980698921875-Zipcar-proposes-to-cease-its-UK-ope...
5•seasicksteve•25m ago•0 comments

College Students Choosing A.I. Majors over Computer Science

https://www.nytimes.com/2025/12/01/technology/college-computer-science-ai-boom.html
1•fleahunter•27m ago•0 comments

Plato's Republic as an iMessage Thread

https://pmohun.github.io/therepublic-txt/
1•pmohun•28m ago•1 comments

Google *Unkills* JPEG XL?

https://tonisagrista.com/blog/2025/google-unkills-jpegxl/
2•speckx•28m ago•0 comments

Show HN: Walrus – a Kafka alternative written in Rust

https://github.com/nubskr/walrus
2•janicerk•29m ago•0 comments

Evo-Memory: Benchmarking LLM Agent Test-Time Learning with Self-Evolving Memory

https://arxiv.org/abs/2511.20857
1•simonpure•30m ago•0 comments

Impacts of Cyclonic Storm Senyar viewed through Sentinel satellite imagery data

https://rtnf.substack.com/p/impacts-of-cyclonic-storm-senyar
2•altilunium•31m ago•0 comments