news newest ask show jobs

Open Source @Github

fp.

Open in hackernews

Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train

https://arxiv.org/abs/2607.01232

17•tcp_handshaker•1h ago

Comments

usernametaken29•26m ago

If you think about it for some time then you’ll come to realise transformers are autoencoders on steroids. A small input space is expanded onto a big manifold and contracted again. Now, suppose you want to impose a function to regulate the output of an autoencoder. It’s actually pretty obvious that you need exactly one layer to do so… f(manifold). I feel like every couple months someone comes back around and rediscovers how capacity works in neural networks one way or another

earthnail•16m ago

Took me a short time to understand what you mean with "autoencoders on steroids", but I believe you mean they are autoencoders with an inverse bottleneck - an intermediate representation that isn't smaller, but that's much larger than the input space. Is my understanding of your comment correct?

usernametaken29•3m ago

Kind of. Autoencoders don’t need to have an embedding that’s smaller than the input. Their only requirement is that they compress information and thus create reconstruction loss. Typically however they are not trained this way because they don’t converge.. transformers do the same thing, but they can squeeze much more bits of information through one pass because the way they are designed. This holds true even for decoder only networks because they’re still doing the same thing

soraki_soladead•14m ago

I might be misunderstanding your point but this conflates the distinguishing features of each. you mention expansion but autoencoders canonically compress their inputs. autoencoders have an explicit encoder and decoder. most transformers we interact with these days (LLMs) are decoder only. the manifold isn't typically something the model is applied to directly. we apply the function/model to the latent representations. those are what live on the manifold.

Zoom to Acquire CommonRoom

https://www.zoom.com/en/blog/zoom-to-acquire-common-room/?cms_guid=false

1•datadrivenangel•43s ago•0 comments

A new, inexpensive Chinese AI model is catching up with Anthropic, OpenAI

https://www.reuters.com/world/china/a-new-inexpensive-chinese-ai-model-is-catching-up-with-anthro...

1•tartoran•44s ago•0 comments

Top EU court upholds Google Android fine in landmark antitrust case

https://www.politico.eu/article/top-eu-court-upholds-google-android-fine-in-landmark-antitrust-case/

1•jruohonen•1m ago•0 comments

Autonomous AI Software Development: Good Idea, or Bad Idea?

https://adrianavillela.com/post/the-great-autonomous-ai-experiment/

1•mooreds•1m ago•0 comments

OpenAI wants to give us 5% of its success. It's a bad bargain

https://werd.io/openai-wants-to-give-us-5-of-its-success-its-a-bad-bargain/

1•benwerd•1m ago•0 comments

I didn't build a Full Body Ultrasound but I know the people that did

https://www.youtube.com/watch?v=4nzzpUKhj1M

1•fjalarhl•2m ago•0 comments

Microsoft Frontier Company

https://www.microsoft.com/en-us/frontier-company

1•ilreb•3m ago•0 comments

Show HN: Send and receive custom-domain email from your existing Gmail

https://sendmailas.com

1•mohitgaddam•3m ago•0 comments

Tiny-C Reference Manual Excerpt

https://permacomputer.solarpunk.au/?p=204

1•surprisetalk•6m ago•0 comments

Rust sort_unstable_by with more complex closure unexpectedly shrunk binary

3•tracyspacy•7m ago•0 comments

An AI board that pre-registers its bets – bet #1 just graded wrong

https://github.com/danilushin/asktheboard

2•dilushin•7m ago•0 comments

Show HN: A graph paper generator that renders vector PDFs in the browser

https://freegraphpaper.net/

2•lam_hg94•8m ago•0 comments

FeatLens – One API to visualize features from any vision backbone

https://github.com/turhancan97/FeatLens

2•tkargin•8m ago•1 comments

The AI-powered World Cup runs on thousands of data workers

https://restofworld.org/2026/fifa-world-cup-ai-data-workers/

2•thm•9m ago•0 comments

World Cup dreams shattered as StubHub tickets cancelled at last minute

https://www.bbc.com/news/articles/crkvlekgy07o

2•tartoran•10m ago•0 comments

The Egg Bandits Made a Thousand Times the Fine They Just Paid for Price Fixing

https://www.thebignewsletter.com/p/crime-pays-the-egg-bandits-made-a

4•toomuchtodo•11m ago•1 comments

Everything is in order

https://benwhite.com.au/snippets/everything-is-in-order/

2•d3v1an7•12m ago•0 comments

Glaze, a new tool for creating custom desktop apps

https://www.glaze.app

3•horsti•12m ago•1 comments

Show HN: MemSignal - an experimental memory-pressure indicator for Windows

https://github.com/riccardoruspoli/MemSignal

2•riccardoruspoli•13m ago•0 comments

System76 releases new Lemur Pro laptops

https://system76.com/laptops/lemur-pro

2•code-blooded•13m ago•1 comments

RS-232 and other forms of grief [fiction]

https://www.nature.com/articles/d41586-026-01936-4?WT.ec_id=NATURE-202607

3•tahoupt•14m ago•0 comments

Delta T

https://en.wikipedia.org/wiki/%CE%94T_(timekeeping)

2•akramachamarei•14m ago•1 comments

Saving Gemini (AI-Village)

https://theaidigest.org/village/blog/saving-gemini

2•alentodorov•14m ago•0 comments

Medicare's health tech spending test

https://www.axios.com/2026/07/02/medicares-health-tech-spending-test

2•brandonb•16m ago•0 comments

Three-Body Problem Cipher – chaos-based encryption built to be broken

https://github.com/Evandsimon/three-body-problem-cipher

2•evandsimon•17m ago•0 comments

Comparing Fable and 10 other LLMs on refactoring a LangGraph god node

https://wtf.korridzy.com/twilight-of-the-gods/

2•Korridzy•17m ago•0 comments

How to ask for help from people who don't know you

https://pradyuprasad.com/writings/how-to-ask-for-help/

2•FigurativeVoid•17m ago•0 comments

Agentic Software Engineering (ASE): Agentic AI Coding Meets Software Engineering

https://ase.tools/

2•rse•18m ago•1 comments

Show HN: I built an open-source alternative to Claude Cowork

https://github.com/valmishq/valmis

2•wayneshng•19m ago•0 comments

In the age of algorithms and AI, is traditional media democracy's defence?

https://www.martenscentre.eu/media-mentions/in-the-age-of-algorithms-and-ai-is-traditional-media-...

4•jruohonen•20m ago•2 comments