frontpage.

Hello HN, I’ve turned my Master’s research on stabilizing very deep Transformers into an open-source PyTorch library called AION-Torch. Instead of a fixed residual connection, it uses an adaptive residual that looks at how “energetic” the block’s input and output are and dials the residual strength up or down to keep things stable. On my small setup (RTX 4060) it seemed to help very deep Transformer stacks keep gradients under control and reach lower loss without special tuning.

The repo has a drop-in AionResidual module, some basic tooling to log what’s happening inside the network, and small examples to show how to plug it into existing models. I’d love feedback on whether this idea makes sense beyond toy setups, how you would benchmark it against standard residuals/DeepNorm on real tasks, and if the API feels natural for people who train larger models.

Choosing a Vector Database for Reddit

New Arduino Privacy Policy: "user shall not [...] reverse-engineer the platform"

Post-Quantum Cryptography in .NET

Introducing flat-rate pricing plans with no overages

UC Berkeley scientists hail breakthrough in decoding whale communication

Ehtml – Extended HTML for Real Apps

Feeling the force of argument (2009) [pdf]

Host overhead is killing your inference efficiency

Why a High Frame Rate TV Can't Fix Cinematic Motion

Aptible gets acquired by private equity firm Crest Rock (Opti9)

Feline Induced Psychosis?

Why Human Talent Still Matters in an AI World and How to Stand Out

Cranberry sOSS

A surprise with how ' ' handles its program argument in practice

The SEC Opposes Shareholder Proposals

Smart SFP – Mini Linux System on a Stick (Literally)

You're Doing It Wrong (Kamp 2010)

Foundations for autonomous finance – Part I

In FTC lawsuit, federal court finds that Meta is not illegal monopoly

The science of weight loss – and why your brain is wired to keep you fat

Valar Atomics Says It's the First Nuclear Startup to Achieve Criticality

Hosting on Cloudflare 'Cause I Need To

Show HN: My Album with Suno v5

What's the Deal with Kalshi's Fees

Hey where did all the Slack channels go?

Andrej Karpathy on Gemini 3

New EU Chat Control Proposal Moves Forward

Inside a Wild Bitcoin Heist: Five-Star Hotels, Cash, and Vanishing Funds

Mexico president issues ferocious warning to Trump after US strikes threat

Python Developers Looking at Introducing Rust Programming Language in CPython

Show HN: Aion-Torch – Adaptive residual scaling for deep Transformers