frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Transformers know more than they can tell: Learning the Collatz sequence

https://www.arxiv.org/pdf/2511.10811
21•Xcelerate•5d ago

Comments

niek_pas•11m ago
Can someone ELI5 this for a non-mathematician?
poszlem•3m ago
A transformer can. Here gemini:

The Experiment: Researchers trained AI models (Transformers) to solve a complex arithmetic problem called the "long Collatz step".

The "Language" Matters: The AI's ability to solve the problem depended entirely on how the numbers were written. Models using bases divisible by 8 (like 16 or 24) achieved nearly 100% accuracy, while those using odd bases struggled significantly.

Pattern Matching, Not Math: The AI did not learn the actual arithmetic rules. Instead, it learned to recognize specific patterns in the binary endings of numbers (zeros and ones) to predict the answer.

Principled Errors: When the AI failed, it didn't hallucinate random answers. It usually performed the correct calculation but misjudged the length of the sequence, defaulting to the longest pattern it had already memorized.

Conclusion: These models solve complex math by acting as pattern recognizers rather than calculators. They struggle with the "control structure" (loops) of algorithms unless the input format reveals the answer through shortcuts.

LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch
189•gpjt•6d ago•31 comments

Show HN: AlgoDrill – Interactive drills to stop forgetting LeetCode patterns

https://algodrill.io
52•henwfan•2h ago•35 comments

The Joy of Playing Grandia, on Sega Saturn

https://www.segasaturnshiro.com/2025/11/27/the-joy-of-playing-grandia-on-sega-saturn/
80•tosh•4h ago•39 comments

Transformers know more than they can tell: Learning the Collatz sequence

https://www.arxiv.org/pdf/2511.10811
24•Xcelerate•5d ago•2 comments

Why frozen test fixtures are a problem on large projects and how to avoid them

https://radanskoric.com/articles/frozen-test-fixtures
12•amalinovic•1h ago•6 comments

ZX Spectrum Next on the Internet: Xberry Pi ESP01 and Pi Zero Upgrades

https://retrogamecoders.com/zx-spectrum-next-on-the-internet-xberry-pi-esp01-and-pi-zero-upgrades/
18•ibobev•2h ago•0 comments

Icons in Menus Everywhere – Send Help

https://blog.jim-nielsen.com/2025/icons-in-menus/
629•ArmageddonIt•18h ago•263 comments

Epsilon: A WASM virtual machine written in Go

https://github.com/ziggy42/epsilon
72•ziggy42•1w ago•23 comments

A deep dive into QEMU: The Tiny Code Generator (TCG), part 1 (2021)

https://airbus-seclab.github.io/qemu_blog/tcg_p1.html
27•costco•1w ago•1 comments

The universal weight subspace hypothesis

https://arxiv.org/abs/2512.05117
309•lukeplato•13h ago•106 comments

Kroger acknowledges that its bet on robotics went too far

https://www.grocerydive.com/news/kroger-ocado-close-automated-fulfillment-centers-robotics-grocer...
195•JumpCrisscross•13h ago•186 comments

Manual: Spaces

https://type.today/en/journal/spaces
82•doener•13h ago•9 comments

No ARIA is better than bad ARIA

https://www.w3.org/WAI/ARIA/apg/practices/read-me-first/
90•robin_reala•6d ago•56 comments

Show HN: I built a system for active note-taking in regular meetings like 1-1s

https://withdocket.com
107•davnicwil•15h ago•91 comments

Jepsen: NATS 2.12.1

https://jepsen.io/analyses/nats-2.12.1
386•aphyr•19h ago•143 comments

Brent's Encapsulated C Programming Rules (2020)

https://retroscience.net/brents-c-programming-rules.html
11•p2detar•2h ago•6 comments

Constructing the Word's First JPEG XL MD5 Hash Quine

https://stackchk.fail/blog/jxl_hashquine_writeup
5•luispa•1w ago•1 comments

Strong earthquake hits northern Japan, tsunami warning issued

https://www3.nhk.or.jp/nhkworld/en/news/20251209_02/
332•lattis•23h ago•152 comments

Microsoft increases Office 365 and Microsoft 365 license prices

https://office365itpros.com/2025/12/08/microsoft-365-pricing-increase/
411•taubek•1d ago•481 comments

Has the cost of building software dropped 90%?

https://martinalderson.com/posts/has-the-cost-of-software-just-dropped-90-percent/
320•martinald•18h ago•493 comments

AMD GPU Debugger

https://thegeeko.me/blog/amd-gpu-debugging/
263•ibobev•21h ago•49 comments

Let's put Tailscale on a jailbroken Kindle

https://tailscale.com/blog/tailscale-jailbroken-kindle
299•Quizzical4230•21h ago•73 comments

Launch HN: Nia (YC S25) – Give better context to coding agents

https://www.trynia.ai/
121•jellyotsiro•20h ago•77 comments

Trials avoid high risk patients and underestimate drug harms

https://www.nber.org/papers/w34534
140•bikenaga•18h ago•46 comments

IBM to acquire Confluent

https://www.confluent.io/blog/ibm-to-acquire-confluent/
413•abd12•1d ago•333 comments

Horses: AI progress is steady. Human equivalence is sudden

https://andyljones.com/posts/horses.html
464•pbui•13h ago•383 comments

Paramount launches hostile bid for Warner Bros

https://www.cnbc.com/2025/12/08/paramount-skydance-hostile-bid-wbd-netflix.html
342•gniting•23h ago•361 comments

Hunting for North Korean Fiber Optic Cables

https://nkinternet.com/2025/12/08/hunting-for-north-korean-fiber-optic-cables/
265•Bezod•21h ago•103 comments

Cassette tapes are making a comeback?

https://theconversation.com/cassette-tapes-are-making-a-comeback-yes-really-268108
114•devonnull•5d ago•202 comments

A thousand-year-long composition turns 25 (2024)

https://longplayer.org/news/2024/12/31/a-thousand-year-long-composition-turns-25/
28•1659447091•6h ago•5 comments