frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

DeepSeek's mHC: Stabilizing Training Divergence from 3,000x to 1.6x

2•Research_Brief•2h ago
While much of the attention on DeepSeek focuses on cost efficiency, the true engineering breakthrough lies in a single mechanism: Manifold-Constrained Hyper-Connections (mHC).

The core value of this research can be summarized by its impact on stability and its resulting prospects:

1. Stabilizing Training Divergence Unconstrained "Hyper-Connections" diversify connectivity but lose the identity mapping property, causing signals to explode. mHC acts as a mathematical anchor by projecting mixing matrices onto the Birkhoff polytope. In practice, this suppresses the potential divergence factor from a catastrophic 3,000x down to a stable 1.6x. This stability is the prerequisite for everything else.

2. Two Major Prospects

    Breaking the Scaling Law Plateau: By eliminating the "instability wall," mHC allows Scaling Laws to continue progressing even as we increase model depth and complexity.

    Stable Scaling of Low-Bit Models: It provides the necessary foundation for scaling ternary-weight models like BitNet, which were previously considered too volatile to train at massive scale.
I view this mathematical stability not as a radical shift, but as a necessary prerequisite for exploring more efficient, low-precision architectures that were previously considered too unstable for large-scale training.

Show HN: Carlton × CMP Signature AR NUME

https://github.com/Augmented-Reality-Virtual-Reality-AR-VR/Projects-in-AR-VR/pull/1
1•aroheir•2m ago•0 comments

The Inverse DevOps Principle

https://about.hannesortmeier.de/blog/inverse-devops-principle
1•sighansen•5m ago•0 comments

Major Canadian computer hardware online store compromised for months

https://old.reddit.com/r/bapccanada/comments/1qk4axy/canada_computers_online_card_skimmer/
1•bhouston•5m ago•1 comments

Hyundai Motor's Korean union warns of humanoid robot plan, sees threat to jobs

https://www.reuters.com/business/world-at-work/hyundai-motors-korean-union-warns-humanoid-robot-p...
1•tooltalk•8m ago•0 comments

A Management Philosopher with Heady Ideas About Beer (2009)

https://www.wsj.com/articles/SB125789690177942463
1•asplake•10m ago•0 comments

Show HN: Botnet of Ares – Hacking Simulator Open Playtest

1•tiniuclx•10m ago•0 comments

Show HN: ObsessionDB – We rebuilt ClickHouse infrastructure to cut our costs 50%

https://obsessiondb.com/
1•keks0r•11m ago•0 comments

Ask HN: What AI feature looked in demos and failed in real usage? Why?

2•kajolshah_bt•13m ago•1 comments

Ask HN: Anti-John the Baptist?

1•krautburglar•14m ago•0 comments

Show HN: Build agents via YAML with Prolog validation and 110 built-in tools

https://fabceolin.github.io/the_edge_agent/index.html
1•fabceolin•16m ago•0 comments

AI is not a NOT a horse (2023)

https://essays.georgestrakhov.com/ai-is-not-a-horse/
1•georgestrakhov•21m ago•0 comments

Partitioning a 17TB Table in PostgreSQL

https://www.tines.com/blog/futureproofing-tines-partitioning-a-17tb-table-in-postgresql/
1•shayonj•24m ago•0 comments

VS Code: Broken rendering on macOS after app resumed from idle state

https://github.com/microsoft/vscode/issues/284162
1•tosh•24m ago•0 comments

OpenAI Wants a Cut of Your Profits: Inside Its New Royalty-Based Plan

https://www.gizmochina.com/2026/01/21/openai-wants-a-cut-of-your-profits-inside-its-new-royalty-b...
1•thenaturalist•25m ago•0 comments

Shenzhou-20 Returns Safely After Historic In-Flight Debris Repairs

https://www.apollothirteen.com/article/orbital-resilience-shenzhou-20-returns-safely-following-hi...
1•darkmatternews•26m ago•0 comments

Alternatives to MinIO for single-node local S3

https://rmoff.net/2026/01/14/alternatives-to-minio-for-single-node-local-s3/
2•rymurr•26m ago•0 comments

Show HN: A verified foundation of mathematics in Coq (Theory of Systems)

1•Horsocrates•29m ago•0 comments

Heathrow's new scanners end dreaded rummage for liquids and laptops

https://www.reuters.com/world/heathrows-new-scanners-end-dreaded-rummage-liquids-laptops-2026-01-23/
1•comebhack•31m ago•0 comments

Can the prescription drug leucovorin treat autism? History says, probably not

https://www.npr.org/sections/shots-health-news/2026/01/22/nx-s1-5684294/leucovorin-autism-folic-f...
1•pseudolus•38m ago•0 comments

Davos Stops Pretending

https://messaging-custom-newsletters.nytimes.com/dynamic/render
1•doener•39m ago•2 comments

For the Children: A short story about the endgame of EU Chat Control

https://gigaprojects.online/post/1
2•giga_private•41m ago•1 comments

An Adversarial Coding Test

https://runjak.codes/posts/2026-01-21-adversarial-coding-test/
1•birdculture•42m ago•0 comments

Go Developer Survey 2025: How Gophers Use AI Tools, Editors, and Cloud Platforms

https://go.dev/blog/survey2025
1•Lwrless•42m ago•0 comments

Ask HN: What's the current best local/open speech-to-speech setup?

1•dsrtslnd23•44m ago•0 comments

A Multi-Entry Control Flow Graph Design Conundrum

https://bernsteinbear.com/blog/multiple-entry/
2•chunkles•47m ago•0 comments

Bernstein vs. United States

https://en.wikipedia.org/wiki/Bernstein_v._United_States
1•u1hcw9nx•49m ago•0 comments

Show HN: Workmux – Parallel development in tmux with Git worktrees

https://workmux.raine.dev/
1•rane•50m ago•0 comments

Show HN: 9 years building an open-source financial platform

https://github.com/finmars-platform/finmars-core
4•ogreshnev•50m ago•0 comments

Ask HN: What 'AI feature' created negative ROI in production?

1•kajolshah_bt•51m ago•1 comments

TigerBeetle's Stablecoin Mistake

https://www.news.alvaroduran.com/tigerbeetle-stablecoin-mistake/
2•ohduran•51m ago•0 comments