frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

How much attention do you need, really? Experiments in O(1) latent reasoning

https://www.notion.so/Direct-Semantic-Reasoning-Unit-The-O-1-AI-Primitive-That-Reasons-In-Latent-Space-22fc65dfc8738069aa62e8b563b8e6b4?source=copy_link
2•orderone_ai•9h ago

Comments

orderone_ai•9h ago
Hello, fellow kids!

I want to share what I've been working on the last few weeks: O(1) inference across whole tasks through direct vector transformation. A few facts upfront to give you an idea of how it goes:

1. Implemented as part of a PoC of what I call the Promptable General Classifier (a classifier which can be prompted for general tasks, including (some, limited) reasoning tasks, and has inference-time hot swappable vocabulary/classes), and the 1.09B implementation:

    1. Runs 93x faster than Zephyr 7B (and this is being generous to Zephyr, as I had to add post-processing to extract labels from malformed LLM output, and I didn't count the time necessary to complete this post processing in the Zephyr's benchmarks

    2. Matches Zephyr 7B's batched accuracy across 13 tasks at 77.7% (the unbatched run with Zephyr gets one more correct, so it's 80%. The DSRU is much more deterministic, and it receives no accuracy boost from running unbatched). Note that I did prompt engineering on 2-3 of these to help the DSRU. The prompt engineering seemed to have no impact on Zephyr’s performance, which I’m assuming is due to its robustness as a professionally built LLM rather than a PoC of a new architecture made by a lone amateur researcher

    3. ~19x faster latency than Zephyr 7B
2. Separately trained on entailment tasks, and scored 80% (~2.66x better than chance) on a 3-label text entailment task (entails, contradicts, neutral), and 50% on a 3-label multiple choices entailment task ('1', '2', '3') - notes in the white paper on why the difference

3. The core model has an inference time at 1.09B of around 1ms per batch, but this is purely in post-attention latent space. This model has generalization capabilities, but lacks the full flexibility of an LLM. In exchange for giving that up, it gains extreme inference speeds, determinism, and extremely straightforward training with smooth loss landscapes. I was a bit hesitant to put this out so early, kept thinking about edge cases, ways I could add just a bit more rigor, etc, but I decided the perfect was the enemy of the good, and put together this white paper over the course of a couple of weekends with some midweek refinements.

I'll be releasing a full reference implementation of the training pipeline that can run on midrange consumer hardware with default settings on github in…I’m thinking 4 weeks, probably, depending on how busy I end up being - doing this with a day job has been...a lot, to say the least.

I’d release it now, but frankly, it’s an embarrassing ball of mud that I hacked my way do haphazardly while chasing positive signal. Now that I’ve gotten this far, I can implement it more thoughtfully - and try a new specific model architecture that I think will work a lot better for a lot of comparative reasoning tasks.

It is patent pending, but I'm permitting personal experimentation and thesis work without restriction. This includes grad students using it for their degrees! You can share results and discuss your work, but distribution of trained models or derivatives is not permitted. For funded research, institutional use, or anything commercial, usage is not permitted for now.

I hope you all find it interesting!

Sea snot: The noxious plague troubling Istanbul's coast

https://www.bbc.com/future/article/20250710-the-summer-slime-threatening-turkish-beaches
1•littlexsparkee•3m ago•0 comments

Ask HN: What million dollar questions do you want answers for?

1•sandwichsphinx•7m ago•0 comments

Stellantis declares bankruptcy in China, with $1B in debts

https://www.italpassion.fr/en/stellantis/stellantis-declares-bankruptcy-in-china-with-1-billion-in-debts/
2•teleforce•8m ago•0 comments

As an app developer, how can you generate passive income?

1•ppkkK•11m ago•0 comments

Asmjit

https://asmjit.com/
1•andsoitis•13m ago•0 comments

Ambit

1•andsoitis•13m ago•0 comments

James Webb, Hubble space telescopes prepare to reduce operations

https://www.astronomy.com/science/james-webb-hubble-space-telescopes-face-reduction-in-operations-over-funding-shortfalls/
2•geox•14m ago•0 comments

IndexTTS2: Emotional duration-controlled autoregressive zero-shot text-to-speech

https://index-tts.github.io/index-tts2.github.io/
1•satvikpendem•14m ago•1 comments

US demands to know what allies would do in event of war over Taiwan

https://www.ft.com/content/41e272e4-5b25-47ee-807c-2b57c1316fe4
1•mhga•17m ago•0 comments

Store Tags After Payloads

https://www.scattered-thoughts.net/writing/store-tags-after-payloads/
1•todsacerdoti•19m ago•0 comments

India's richest man wants to turn every TV into a PC

https://techcrunch.com/2025/07/11/indias-richest-man-wants-to-turn-every-tv-into-a-pc/
2•droideqa•20m ago•1 comments

Revolutionizing Athletic Recruitment

https://usport.ai/
1•mariarezhylo•21m ago•1 comments

Ecdsa Nonces, Lattice, RNG Et Patterns: Decrypting a Bitcoin Exploit

https://www.cyphertux.net/articles/en/research/ecdsa-nonces-lattice-attacks-bitcoin-exploit
1•TechDebtDevin•23m ago•1 comments

The Sacrifices We Choose to Make

https://michaelnotebook.com/sacrifice/index.html
2•akkartik•31m ago•0 comments

Algorithms for making interesting organic simulations

https://bleuje.com/physarum-explanation/
1•SerCe•35m ago•0 comments

Denmark Aims to Use Copyright Law to Protect People from Deepfakes

https://www.nytimes.com/2025/07/10/world/europe/denmark-deepfake-copyright-ai-law.html
1•bookofjoe•43m ago•1 comments

Rtrvr

https://www.rtrvr.ai/
1•handfuloflight•48m ago•0 comments

Show HN: A Chrome extension that detects malicious websites

https://chromewebstore.google.com/detail/cheztrap/nekhbkmakcoobbhgckdjakflhcdhpjjk
1•SuperLordPanda•49m ago•0 comments

Browser AI Agents Are the New "Weakest Link"

https://labs.sqrx.com/browser-ai-agents-the-new-weakest-link-22a38a552d7f
1•botanicals6•51m ago•0 comments

Agentic Doc: Agentic Data Extraction from Visually Complex Documents

https://github.com/landing-ai/agentic-doc
2•yanng404•53m ago•0 comments

Itty-AWS: 34KB AWS SDK for Effect

https://github.com/sam-goodwin/itty-aws
2•nateb2022•57m ago•0 comments

A Simple IP Geolocation API (free for non-commercial)

https://ip-api.com/
1•tony-allan•57m ago•3 comments

Interview with Alan Kay

https://web.archive.org/web/20120716230547/http://www.drdobbs.com/article/print?articleId=240003442&siteSectionName=architecture-and-design
3•b-man•58m ago•1 comments

You Need 'Productive Friction'

https://every.to/context-window/why-you-need-productive-friction
1•handfuloflight•1h ago•0 comments

3D printing method turns biodegradable polymers into conductive components

https://techxplore.com/news/2025-07-3d-method-biodegradable-polymers-electronic.html
1•PaulHoule•1h ago•0 comments

Discovery of ancient riverbeds suggests Mars once wetter than thought

https://www.theguardian.com/science/2025/jul/10/mars-once-wetter-than-thought-surprise-discovery-10000-miles-ancient-riverbeds
1•Bluestein•1h ago•0 comments

The Tyranny of the Marginal User

https://nothinghuman.substack.com/p/the-tyranny-of-the-marginal-user
1•cropcirclbureau•1h ago•0 comments

Perl 5.42.0 Released

https://medium.com/@Re-News/perl-5-42-0-released-performance-gains-feature-refinements-and-key-security-fixes-1976628bc763
2•DASD•1h ago•0 comments

Show HN: I wrote backend editor that adds AI agents and database to Lovable UIs

https://www.youtube.com/watch?v=3AlscfiAJmY
3•alessiapacca•1h ago•1 comments

What Happened to All the Human Bird Flu Cases?

https://undark.org/2025/07/10/opinion-bird-flu-emergency-end/
4•Gaishan•1h ago•1 comments