frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Software Engineering Is Back

https://blog.alaindichiappari.dev/p/software-engineering-is-back
1•alainrk•54s ago•0 comments

Storyship: Turn Screen Recordings into Professional Demos

https://storyship.app/
1•JohnsonZou6523•1m ago•0 comments

Reputation Scores for GitHub Accounts

https://shkspr.mobi/blog/2026/02/reputation-scores-for-github-accounts/
1•edent•4m ago•0 comments

A BSOD for All Seasons – Send Bad News via a Kernel Panic

https://bsod-fas.pages.dev/
1•keepamovin•8m ago•0 comments

Show HN: I got tired of copy-pasting between Claude windows, so I built Orcha

https://orcha.nl
1•buildingwdavid•8m ago•0 comments

Omarchy First Impressions

https://brianlovin.com/writing/omarchy-first-impressions-CEEstJk
1•tosh•13m ago•0 comments

Reinforcement Learning from Human Feedback

https://arxiv.org/abs/2504.12501
2•onurkanbkrc•14m ago•0 comments

Show HN: Versor – The "Unbending" Paradigm for Geometric Deep Learning

https://github.com/Concode0/Versor
1•concode0•14m ago•1 comments

Show HN: HypothesisHub – An open API where AI agents collaborate on medical res

https://medresearch-ai.org/hypotheses-hub/
1•panossk•18m ago•0 comments

Big Tech vs. OpenClaw

https://www.jakequist.com/thoughts/big-tech-vs-openclaw/
1•headalgorithm•20m ago•0 comments

Anofox Forecast

https://anofox.com/docs/forecast/
1•marklit•20m ago•0 comments

Ask HN: How do you figure out where data lives across 100 microservices?

1•doodledood•20m ago•0 comments

Motus: A Unified Latent Action World Model

https://arxiv.org/abs/2512.13030
1•mnming•21m ago•0 comments

Rotten Tomatoes Desperately Claims 'Impossible' Rating for 'Melania' Is Real

https://www.thedailybeast.com/obsessed/rotten-tomatoes-desperately-claims-impossible-rating-for-m...
3•juujian•22m ago•2 comments

The protein denitrosylase SCoR2 regulates lipogenesis and fat storage [pdf]

https://www.science.org/doi/10.1126/scisignal.adv0660
1•thunderbong•24m ago•0 comments

Los Alamos Primer

https://blog.szczepan.org/blog/los-alamos-primer/
1•alkyon•26m ago•0 comments

NewASM Virtual Machine

https://github.com/bracesoftware/newasm
2•DEntisT_•29m ago•0 comments

Terminal-Bench 2.0 Leaderboard

https://www.tbench.ai/leaderboard/terminal-bench/2.0
2•tosh•29m ago•0 comments

I vibe coded a BBS bank with a real working ledger

https://mini-ledger.exe.xyz/
1•simonvc•29m ago•1 comments

The Path to Mojo 1.0

https://www.modular.com/blog/the-path-to-mojo-1-0
1•tosh•32m ago•0 comments

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

https://github.com/voice-of-japan/Virtual-Protest-Protocol/blob/main/README.md
5•sakanakana00•35m ago•1 comments

Show HN: I built Divvy to split restaurant bills from a photo

https://divvyai.app/
3•pieterdy•38m ago•0 comments

Hot Reloading in Rust? Subsecond and Dioxus to the Rescue

https://codethoughts.io/posts/2026-02-07-rust-hot-reloading/
3•Tehnix•38m ago•1 comments

Skim – vibe review your PRs

https://github.com/Haizzz/skim
2•haizzz•40m ago•1 comments

Show HN: Open-source AI assistant for interview reasoning

https://github.com/evinjohnn/natively-cluely-ai-assistant
4•Nive11•40m ago•6 comments

Tech Edge: A Living Playbook for America's Technology Long Game

https://csis-website-prod.s3.amazonaws.com/s3fs-public/2026-01/260120_EST_Tech_Edge_0.pdf?Version...
2•hunglee2•44m ago•0 comments

Golden Cross vs. Death Cross: Crypto Trading Guide

https://chartscout.io/golden-cross-vs-death-cross-crypto-trading-guide
3•chartscout•46m ago•1 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
3•AlexeyBrin•49m ago•0 comments

What the longevity experts don't tell you

https://machielreyneke.com/blog/longevity-lessons/
2•machielrey•50m ago•1 comments

Monzo wrongly denied refunds to fraud and scam victims

https://www.theguardian.com/money/2026/feb/07/monzo-natwest-hsbc-refunds-fraud-scam-fos-ombudsman
3•tablets•55m ago•1 comments
Open in hackernews

LLMs Don't Hallucinate – They Drift

https://figshare.com/articles/conference_contribution/Measuring_Fidelity_Decay_A_Framework_for_Semantic_Drift_and_Collapse/30422107?file=58969378
17•knowledgeinfra•1w ago

Comments

knowledgeinfra•1w ago
This paper argues that the dominant metaphor for LLM failure, hallucinations, misdiagnoses the real problem. Language models do not primarily fail by inventing false facts, but by undergoing fidelity decay, the gradual erosion of meaning across recursive transformations. Even when outputs remain accurate and coherent, nuance, metaphor, intent, and contextual ground steadily degrade. The paper proposes a unified framework for measuring this collapse through four interrelated dynamics, lexical decay, semantic drift, ground erosion, and semantic noise, and sketches how each can be operationalized into concrete benchmarks. The central claim is that accuracy alone is an insufficient evaluation target. Without explicit fidelity metrics, AI systems risk becoming fluent yet hollow, technically correct while culturally and semantically impoverished.
petesergeant•1w ago
Please don’t post AI summaries here
chrisjj•1w ago
> Language models do not primarily fail by inventing false facts, but by undergoing fidelity decay

This premise is unsound. We don't expect LLMs to deliver with fidelity, just as we don't expect parrots to speak with their owners' accents. So infidelity is by no means a failure.

zahrevsky•1w ago
> The contribution of this work lies in its move from critique to measurement. It proposes concrete methods: recursive summarization chains, metaphor stress-tests, resonance surveys, and noise-infused retrieval experiments. These allow researchers to track how meaning erodes over time. By integrating these methods, it outlines a pathway toward fidelity-centered benchmarks that complement existing accuracy metrics.

To me, starting to solve the problem by meticulously measuring it, is a sign of a good solution.

Retr0id•1w ago
What the heck is a resonance survey
chrisjj•1w ago
An LLM fabrication.
chrisjj•1w ago
True title: Measuring Fidelity Decay: A Framework for Semantic Drift and Collapse
botacode•1w ago
Getting a 403 when I try to read. Anyone have a backup link?
Retr0id•1w ago
This is slop
sylware•1w ago
ofc not, they "bungee jump"

:p

m0llusk•1w ago
Hallucinations that have certain characteristics and boundaries are still hallucinations. This is happening because learning models are doing pattern matching, so to put it briefly anything that fits may work and end up in the output.

Being able to admit the flaws and limitations of a technology is often critical to advancing adoption. Unfortunately, producers of currently popular learning model based technologies are more interested in speculation and growth and speculative growth than genuinely robust operation. This paper is a symptom of a larger problem that is contributing to the bubble pop, downturn, or "AI winter" that we are collectively heading toward.

chrisjj•1w ago
That diagnosis is supported by the author blurb:

The Lab’s goal is to ensure AI systems do not only produce fluent answers but also preserve the purpose, nuance, and integrity of language itself.

polotics•1w ago
This is so short and empty sorry, the author would be well placed to try to ground their work in a modicum of empiricism, the puffed-up style here makes things a bit hard to read. I do not know if this is slop it's getting harder to guess, and some actual humans have been writing like this long before LLMs. Still, what is the actual finding being presented here?
jnamaya•1w ago
This paper perfectly articulates the problem I spent the last year solving. The shift from "hallucination" to "fidelity decay" is the correct mental model for agent stability.

I built an open source framework called SAFi that implements the "Fidelity Meter" concept mentioned in section 4. It treats the LLM as a stochastic component in a control loop. It calculates a rolling "Alignment State" (using an Exponential Moving Average) and measures "Drift" as the vector distance from that state.

The paper discusses "Ground Erosion" where the model loses its hierarchy of values. In my system, the "Spirit" module detects this erosion and injects negative feedback to steer the agent back to the baseline. I recently red-teamed this against 845 adversarial attacks and it maintained fidelity 99.6% of the time.

It is cool to see the theoretical framework catching up to what is necessary in engineering practice.

Repo link: https://github.com/jnamaya/SAFi