frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Low-entropy tokens tolerate substitution with and 0.1 PPL cost across models

2•Trinicode•2h ago
I investigated whether an explicit entropy signal in token embeddings—beyond implicit frequency effects—carries independent information in modern LLMs, and if it could enable inference optimizations in frozen models.

Using skip-gram trained 260-d vectors (256 semantic + 4 entropy dimensions) on FineWeb-Edu, I projected them into GPT-2 and Qwen3-14B embedding spaces and substituted low-entropy tail tokens (rare/predictable, function-like) vs high-entropy common tokens (frequent/polysemous).

Surprising result: Low-entropy tail substitutions incur a near-constant +0.101 to +0.102 perplexity increase on both models—to three decimal places—despite major differences in architecture, dimension (768 vs 5120), tokenizer, and norms. This appears intrinsic to the token class.

High-entropy tokens are far more sensitive: +36 PPL on GPT-2, +9 on Qwen3 from ~500–530 swaps (per-token ratio 356× → 91×).

Residual stream analysis shows nearly identical mean convergence trajectories, but low-entropy tokens exhibit transient mid-layer variance spikes (heterogeneous paths), while high-entropy ones propagate perturbations monotonically.

Native training on Qwen3 subword vocab weakens the signal (narrower contexts) and reverses the pattern.

Repo with reproducible experiments: https://github.com/maykef/entropy2vec

Results summary (tables): https://github.com/maykef/entropy2vec/blob/main/results/summary_table.md

Conclusion: The low/high-entropy distinction is real and model-agnostic, but post-hoc exploitation for inference (speed/memory) is negligible on frozen models—lookup/forward pass remain full-cost. Meaningful gains would require conditional compute from pretraining.

Curious about similar observations in larger models or alternative uses (e.g., uncertainty detection, curriculum design).

Learning-Based Multi-Stage Strategy for Aircraft to Evade Missile

https://arxiv.org/abs/2511.05828
1•rbanffy•1m ago•0 comments

The Man Who Broke into Jail

https://www.newyorker.com/magazine/2026/03/09/alexander-friedmann-profile-prison-reform
1•fortran77•1m ago•1 comments

How to deploy your Lovable App in Brazil [video]

https://www.youtube.com/watch?v=LRr8Hycpt_E&lc=UgyvJ8m7t8qdGkkFhuF4AaABAg
1•acfilho•1m ago•0 comments

Reflections on Norway

https://minutes.substack.com/p/reflections-on-norway
1•jger15•1m ago•0 comments

The Zen of DevOps · TBNL

https://www.tibobeijen.nl/2026/02/23/introducing-the-zen-of-devops/
1•rbanffy•2m ago•0 comments

Earth Garden: Field Recordings Around the World

https://earth-garden.alen.ro/
1•alentodorov•3m ago•0 comments

Robert Tinney: 'Byte' Magazine and Beyond

https://70s-sci-fi-art.ghost.io/robert-tinney-byte-magazine-and-beyond/
1•sohkamyung•4m ago•0 comments

Show HN: Pane – Give your AI access to your financial data via MCP

https://pane.money
1•darnfish•7m ago•0 comments

Hit Your 1 Rep Max with AI

https://www.xiegerts.com/post/hit-your-1-rep-max-with-ai/
1•siegers•7m ago•0 comments

CBP Tapped into the Online Advertising Ecosystem to Track Peoples' Movements

https://lwn.net/Articles/1061085/
1•DyslexicAtheist•8m ago•0 comments

MCP Servers Are Now Searchable

https://mcpmonitoring.com/
1•jspuri•9m ago•0 comments

Microsoft Expands Starlink Alliance to Grow Azure and AI in Kenya

https://finance.yahoo.com/news/microsoft-expands-starlink-alliance-grow-160902940.html
1•andsoitis•13m ago•0 comments

Slab tearing and segmented subduction termination driven by transform tectonics

https://www.science.org/doi/full/10.1126/sciadv.ady8347
1•luu•14m ago•0 comments

Rare Earths Norway says estimate of Europe's biggest deposit jumps 81%

https://www.reuters.com/business/energy/rare-earths-norway-says-estimate-deposit-biggest-europe-j...
1•littlexsparkee•14m ago•0 comments

Anthropic-backed super PAC spends $1.6M in primary race divided over datacenters

https://www.theguardian.com/us-news/2026/mar/03/datacenter-politics-north-carolina-primary
1•colinhb•15m ago•0 comments

First AI Agent on a Smartwatch

https://twitter.com/petruspennanen/status/2028946464119165140
1•petruspennanen•15m ago•1 comments

Killed by Mozilla

https://killedbymozilla.com/
1•TigerUniversity•16m ago•0 comments

PRX Part 3 – Training a Text-to-Image Model in 24h

https://huggingface.co/blog/Photoroom/prx-part3
1•ibobev•20m ago•0 comments

Helsinki just went a full year without a single traffic death

https://www.politico.eu/article/helsinki-no-traffic-death-roads-eu-accident-finland-driving-trans...
8•mooreds•20m ago•0 comments

Select your fruit (No JavaScript)

https://codepen.io/t_afif/pen/PwGPJOB
1•ChadNauseam•20m ago•1 comments

If You Like PICO-8, You'll Love Kaplay (Probably)

https://jslegenddev.substack.com/p/if-you-like-pico-8-youll-love-kaplay
1•ibobev•20m ago•0 comments

It's an Obscure Psychedelic Used to Treat Trauma. Could It Help Me?

https://www.nytimes.com/2026/03/01/magazine/ibogaine-psychedelic-treatment-trauma-mental-health.html
2•whack•21m ago•0 comments

MicroTimes Interviews Borland's Philippe Kahn Again (1995)

https://computeradsfromthepast.substack.com/p/microtimes-interviews-borlands-philippe-93a
1•rbanffy•22m ago•0 comments

Behold the Power of Meta:Substitute

https://brevzin.github.io/c++/2026/03/02/power-of-substitute/
1•ibobev•24m ago•0 comments

Pincer – Python AI agent framework, security-first

https://github.com/pincerhq/pincer
1•vpu2301•24m ago•1 comments

Compiling Prolog to Forth [pdf]

https://vfxforth.com/flag/jfar/vol4/no4/article4.pdf
2•PaulHoule•25m ago•0 comments

Maryland Senators Approve Bill to Let Off-Duty Firefighters, EMTs Use Cannabis

https://www.marijuanamoment.net/maryland-senators-approve-bill-to-let-firefighters-and-rescue-wor...
1•treatsmokenjury•26m ago•1 comments

Zed will require age identification for its services

https://zed.dev/terms#21-eligibility
24•delduca•26m ago•17 comments

Linux in Space: The aerospace industry's attitude for Space Architechture

https://www.windriver.com/blog/Linux-Flies-into-Space
2•huxleyFiddler•28m ago•2 comments

The magic of adding random noise to black and white images [video]

https://www.youtube.com/watch?v=kT4p1GXq4HY
1•ColinWright•28m ago•0 comments