frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

When AI Speaks, Who Can Prove What It Said?

https://zenodo.org/records/18212180
3•businessmate•7h ago

Comments

businessmate•7h ago
Artificial intelligence is becoming a public-facing actor. Banks use it to explain credit decisions. Health platforms deploy it to answer clinical questions. Retailers rely on it to frame product choices. In each case, AI no longer sits quietly in the back office. It communicates directly with customers, patients and investors. That shift exposes a weakness in many governance frameworks. When an AI system’s output is later disputed, organisations are often unable to show precisely what was communicated at the moment a decision was influenced. Accuracy benchmarks, training documentation and policy statements rarely answer that question. Re-running the system does not help either. The answer may change.

This is not a technical curiosity. It is an institutional vulnerability.

kundan_s__r•7h ago
This framing resonates a lot. The core issue you’re pointing at isn’t model accuracy, it’s epistemic accountability.

In most current deployments, an AI system’s output is treated as transient: generated, consumed, forgotten. When that output later becomes contested (“Why did the system say this?”), organizations fall back on proxies—training data, benchmarks, prompt templates—none of which actually describe what happened at decision time.

Re-running the system is especially misleading, as you note. You’re no longer observing the same system state, the same context, or even the same implicit distribution. You’re generating a new answer and pretending it’s evidence.

What seems missing in many governance frameworks is an intermediate layer that treats AI output as a decision artifact—something that must be validated, scoped, and logged before it is allowed to influence downstream actions. Without that, auditability is retroactive and largely fictional.

Once AI speaks directly to users, the question shifts from “Is the model good?” to “Can the institution prove what it allowed the model to say, and why?” That’s an organizational design problem as much as a technical one.

robin_reala•7h ago
This is why you need regulation to add transparency obligations to providers, and to remove algorithmic assessment from harmful situations. The EU Artificial Intelligence Act is a good first step: https://en.wikipedia.org/wiki/Artificial_Intelligence_Act
smurda•6h ago
“They do not reliably capture what a user was shown or told.”

This adds to the case for middleware providers like Vapi, LiveKit, and Layercode. If you’re building a voice AI application using one of these SST -> LLM -> TTS providers there will be definitive logs to capture what a user was told.

Choosing a tech stack in a world where LLMs write all the code

https://behan.substack.com/p/the-data-distribution
2•behan•4m ago•0 comments

Leads Acquisition

1•xxxxxikke•4m ago•0 comments

Ask HN: What Are You Working On? (January 2026)

1•david927•5m ago•0 comments

The AI revolution is here. Will the economy survive the transition?

https://post.substack.com/p/the-ai-revolution-is-here-will-the
1•sideway•5m ago•0 comments

Show HN: Scan vibe coded apps for security vulnerabilities

https://vibeappscanner.com/
1•silexdev•6m ago•0 comments

Show HN: Slim – 50% fewer tokens than JSON for LLM applications

https://github.com/matteuccimarco/slim-protocol-core
1•matteuccimarco•6m ago•0 comments

Ledga – A Budgeting Application to See Cash Flow

https://ledga.us
1•ChicagoDave•9m ago•1 comments

Ask HN: Is There Any open source company data?

1•ankit84•11m ago•0 comments

Show HN: I put economic rules in silicon that can't be changed by software

1•PrimalOrigins•12m ago•1 comments

The Poverty Trap's Most Popular Posts: 2025

https://povertytrap.substack.com/p/the-poverty-traps-most-popular-posts-370
1•jdemartin•15m ago•0 comments

Primecoin Primality Test

https://www.johndcook.com/blog/2026/01/10/primecoin-primality-test/
1•ibobev•17m ago•0 comments

Bi-Twin Prime Chains

https://www.johndcook.com/blog/2026/01/10/bi-twin-prime-chains/
1•ibobev•17m ago•0 comments

Primecoin and Cunningham Prime Chains

https://www.johndcook.com/blog/2026/01/10/prime-chains/
1•ibobev•17m ago•0 comments

Stories into podcasts, narrated in your voice by cloning your voice

https://storybee.app/podcast-monetization/start
1•niksmac•18m ago•0 comments

There's a full line of Trustworthy Technology available now

https://aol.codeberg.page/eci/status.html
1•babylon5•19m ago•0 comments

Password Hashing Competition

https://www.password-hashing.net/
2•mooreds•19m ago•0 comments

Garbage collected handles are lifetime-contravariant

https://trynova.dev/blog/garbage-collection-is-contrarian
2•birdculture•22m ago•0 comments

Detecting event loop blocking in asyncio

https://deepankarm.github.io/posts/detecting-event-loop-blocking-in-asyncio/
1•deepankarm44•25m ago•0 comments

Replace the Retiring Windows XP with Linux

https://www.linux.com/training-tutorials/replace-retiring-windows-xp-linux/
5•righthand•27m ago•1 comments

Thoughts on Claude Code

https://www.spakhm.com/claude-code
1•coffeemug•28m ago•0 comments

Deep reinforcement learning trading bot 90%-120% returns yearly

https://github.com/zero-was-here/tradingbot
2•solosquad•29m ago•0 comments

The Curious Cult of Aldi How a German discount chain became US's hottest grocer

https://www.bloomberg.com/news/features/2026-01-06/german-grocer-aldi-built-an-american-empire-on...
1•helsinkiandrew•29m ago•1 comments

How artificial intelligence is reshaping the future of war

https://thehill.com/policy/defense/5683359-artificial-intelligence-future-war/
1•c420•33m ago•0 comments

Show HN: 17 yo built my first app after 7 months of teaching myself development

https://www.domnest.app/
1•imad-101•33m ago•0 comments

Tech and retail back a new AI shopping standard at NRF 2026

https://www.axios.com/2026/01/11/google-shopify-ai-shopping-standard-nrf-2026
1•thm•37m ago•0 comments

SpaceX Lowering Orbits: 4,400 Satellites Moving Closer to Earth

https://nasaspacenews.com/2026/01/spacex-lowering-orbits/
2•slow_typist•37m ago•0 comments

Installerpedia: Install Anything Without Hassle

https://journal.hexmos.com/introducing-installerpedia/
6•lordwiz•43m ago•0 comments

30 Years Old

https://andys.blog/30/
1•andytratt•44m ago•0 comments

Show HN: Nexus Gateway – Open-Source AI Caching Layer in Go

https://www.nexus-gateway.org/
1•Sunnyanand_dev•45m ago•0 comments

Kontigo: Y Combinator's Venezuelan Sanctions Evasion Startup

https://fintechbusinessweekly.substack.com/p/kontigo-ycombinators-venezuela-sanctions
3•firekvz•46m ago•0 comments