frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Idea) Autoregressive joint embedding predictor model

1•LarsDu88•15h ago
Originally posted on reddit (https://www.reddit.com/r/deeplearning/comments/1q8yfgw/idea_feedback_using_joint_embeddings_lejepa_to/), but in lieu of good forums for this kind of stuff, am reposting on HN for some feedback.

I've been brainstorming ideas recently, and one paper that caught my attention was Yann LeCunn's leJEPA paper. It claims to solve a large host of problems with joint embedding model training, and it had me thinking...

What if you simply replace the discrete tokenizer used by LLMs with joint embeddings, and make your autoregressive language model, a "predict the next latent embedding"

For example:

- Write some software to convert text to images where every 8x8 block (or maybe 16x16?) contains a character or whitespace. Can incorporate augmentations like jitter and font changes. - Train a leJEPA VIT model on generated text "images" using SSL to create embeddings from these "images"

- Freeze the leJEPA trained VIT embedding model, and use it as a frozen embedding layer for an autoregressive transformer based model that "predicts the next embedding"

- With the embedding model and the autoregressive latent predictor frozen, train a decoder that translates embeddings into discrete tokenized text.

I can see the following benefits:

- No discrete tokenizer for input

- Autoregressive latent predictor model quickly outputs full image scale concepts rather than individual discrete tokens and can be run asynchronously very quickly compared to the embedding -> discrete text model

- Cohesive multimodality built in... text-free images are still images that can result in latents, perhaps with finetuning on pure image datasets.

In my mind this would be more akin to how humans think - with far superior image recall than text sequence recall and thinking abstractly before speaking or typing language.

The Celtic Tiger bridge that wouldn't open because of a lost remote control

https://www.thejournal.ie/sean-ocasey-bridge-remote-1713102-Oct2014/
1•JumpCrisscross•2m ago•0 comments

The Robot Cars Have Come for the Kids

https://www.nytimes.com/2026/01/05/us/waymo-kids-los-angeles.html
1•JumpCrisscross•4m ago•0 comments

Iranian Crown Prince in Exile – Interview with Reza Pahlavi (2025) [video]

https://www.youtube.com/watch?v=VwWQ3hnJLZQ
1•thomassmith65•5m ago•0 comments

Philosopher of Pride

https://aeon.co/essays/the-hidden-role-of-pride-and-shame-in-the-human-hive
1•benbreen•5m ago•0 comments

Show HN: Chordle. Learn to identify pitch by playing Wordle with chords

https://codepen.io/tehryanx/full/RNRGGEQ
1•tehryanx•6m ago•0 comments

The Manifold Mind of Saul Bellow

https://www.metropolitanreview.org/p/the-manifold-mind-of-saul-bellow
1•samclemens•6m ago•0 comments

People are abusing Facebook's deceased persons account hacked request form

https://infosec.exchange/@teriradichel/115873364828247139
2•gpi•8m ago•0 comments

[Claude Code Plugin Proposal] Add agent-session-commit to iterate on AGENTS.md

https://github.com/anthropics/claude-code/pull/17395
1•Olshansky•9m ago•0 comments

Tcl Nxtpaper 70 Pro phone has dedicated reading modes that help reduce strain

https://www.pcmag.com/news/tcl-nxtpaper-70-pro-phone-dials-up-the-specs-we-go-hands-on-at-ces-2026
1•teleforce•10m ago•0 comments

Show HN: Lolodex turns email threads and attachments into clean/searchable notes

https://lolodex.com
1•yungookim•19m ago•0 comments

The Wren Stack

https://speakez.tech/blog/wren-stack/
1•Multicomp•22m ago•1 comments

Rationality, Reward, and Sleep Training

https://dogdogfish.com/blog/2026/01/10/rationality-psychology-sleep-training/
1•matthewsharpe3•25m ago•0 comments

Ask HN: If AI wins, don't AI companies lose?

2•SuboptimalEng•27m ago•4 comments

Meet ski map artist James Niehues, the 'Monet of the mountains'

https://adventure.com/ski-map-artist-james-niehues/
1•gyomu•31m ago•0 comments

Show HN: Symfreq – Analyse symbol frequencies in code (Rust)

https://github.com/vaskort/symfreq
2•vaskort•34m ago•0 comments

The NIH has lost its scientific integrity. So we left

https://www.statnews.com/2026/01/10/nih-resign-protest-four-leaders-cite-interference-censorship/
4•mikhael•38m ago•0 comments

The Many Meanings of "Stack"

https://ezzeriesa.notion.site/The-many-meanings-of-stack-bc768cb186714b579547b7b8681ee32f
2•kurinikku•39m ago•0 comments

Show HN: Understand your Claude Code sessions

https://confabulous.dev
1•jjak82•40m ago•0 comments

Ask HN: What tools/workflow do you use to write technical books?

1•JSLegendDev•43m ago•0 comments

The Trust Trap: Why Bad Actors Are Moving from Burner Domains to Big Tech

https://www.urlert.com/blog/trust-trap-bad-actors-big-tech
3•tomerhe•44m ago•0 comments

If users notice your software, you're a loser

https://pivot-to-ai.com/2026/01/10/if-users-notice-your-software-youre-already-a-loser/
6•fasterandworse•46m ago•0 comments

Nine (Seemingly Impossible C64 Demo) (2025)

https://www.linusakesson.net/scene/nine/index.php
2•s4i•47m ago•0 comments

Open‑source VANTRUE dashcam stitcher (front and cabin PiP)

https://github.com/SteveClement/vantrue-dashcam-stitcher
1•SteveClement•47m ago•1 comments

The O'Shaughnessy Fellowships and Grants Program

https://forms.osv.llc/fellowships2026
1•cat-whisperer•49m ago•0 comments

Out-of-Context: Constrained Tool Based Exploration of Context

https://www.gojiberries.io/out-of-context-constrained-tool-based-exploration-of-context/
1•neehao•51m ago•0 comments

Iran's Revolutionary Guards declare 'red line' on security as protests escalate

https://www.france24.com/en/middle-east/20260110-iran-s-revolutionary-guards-declare-red-line-on-...
3•mooreds•58m ago•0 comments

MVP = Embarrassing

https://www.mooreds.com/wordpress/archives/3725
1•mooreds•58m ago•0 comments

Show HN: webrtc-rs/rtc – A Sans-I/O WebRTC Stack for Rust

3•rainliu•58m ago•2 comments

Retrotransposon drives cancer by altering 3D genome structure

https://www.stjude.org/media-resources/news-releases/2026-medicine-science-news/retrotransposon-d...
1•birriel•59m ago•0 comments

Beginner Race – Marble Madness (FM Towns) Music

https://www.youtube.com/watch?v=nDjRZ674c_4
1•doener•59m ago•0 comments