frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: A reasoning model that infers over whole tasks in 1ms in latent space

https://github.com/OrderOneAI/dsru_whitepaper
2•orderone_ai•4h ago
I've spent the last few weeks working on a novel model architecture. It is not a transformer - it lacks attention, tokens, and softmax.

However: Batch Processing: Average batch size: 10 Time per batch: 13.03ms Time per example in batch: 1.30ms

TASK SUMMARY WITH TIMING

===================================================

Task Correct Total Accuracy Med Time (ms)

---------------------------------------------------

Emotion Classification 10 10 100.0 % 1.30

Toxicity Classification 9 10 90.0 % 1.29

Sentiment Classification 10 10 100.0 % 1.34

Domain Classification 8 10 80.0 % 1.30

Sarcasm Detection 6 10 60.0 % 1.34

Scam Detection 7 10 70.0 % 1.31

Age Appropriateness Classification 4 10 40.0 % 1.28

Urgency Level Classification 4 10 40.0 % 1.25

Privacy Policy Classification 9 10 90.0 % 1.32

Dialogue Speaker Classification 8 10 80.0 % 1.29

Book Review Sentiment 10 10 100.0 % 1.25

Empathetic Direction Classification 10 10 100.0 % 1.29

Virtual Assistant Action Classification 6 10 60.0 % 1.37

---------------------------------------------------

OVERALL 101 130 77.7 %

===================================================

It can do interesting things.

This has a lot of caveats and limitations. However, the model is available for download via a script in the repo, and the exact benchmarks I used are available. The white paper gets into theory and application, as well as reveals a lot of limitations and interesting differences from transformers in terms of training and prompting behavior. It also produces extensive appendices (over 100 pages) on training datasets used, and performance on the ~260 (I think?) NIV2 tasks in its validation dataset.

Running inference for the DSRU model + BGE embedding model together takes a bit shy of 10GB of VRAM, and the reference comparison model -- Zephyr 7B -- takes about 15GB of VRAM.

Comments

throwawayffffas•4h ago
Can I ask? why do you have a single model for all these tasks?

Wouldn't it be easier and more ergonomic to users to have dedicated models for each of this tasks?

orderone_ai•3h ago
Thank you for the question!

I would say that ease of use and deployment is actually a good reason to have a single model.

We don't train 20 LLMs for different purposes - we train one (or, I guess 3-4 in practice, each with their own broad specialization), and then prompt it for different tasks.

This simplifies deployment, integration, upgrading, etc.

This model is basically the same - instead of having a restriction to doing single-task classification. This means that a user can complete new tasks using a new prompt, not a new model.

throwawayffffas•3h ago
While I agree with the general reasoning, isn't it harder for the user to prompt the model correctly as opposed to selecting a specialized model that they wish to use?

That's the feeling I have when I try to use LLMs for more general language processing.

Have you run in cases where the model "forgets" the task at hand and switches to another mid text stream?

Regardless of all of the above. It looks to me that your choice of reasoning and problem solving in the latent space is a great one and where we should be collectively focusing our efforts, keep up the good work.

tripplyons•1h ago
How does this model compare to just using a linear classifier trained on BGE embeddings?

ZX Spectrum – Introduction To Programming (1983) [video]

https://www.youtube.com/watch?v=jPUaOS-TXfI
1•austinallegro•1m ago•0 comments

Commodore 64 Ultimate: Basic Beige

https://www.commodore.net/product-page/commodore-64-ultimate-basic-beige-batch1
1•doener•2m ago•0 comments

ETT: Expanding the Long Context Understanding Capability of LLMs at Test-Time

https://arxiv.org/abs/2507.06313
1•PaulHoule•3m ago•0 comments

C++ Library

https://mcyoung.xyz/2025/07/14/best/#fnref:terrible-people
1•todsacerdoti•4m ago•0 comments

Giant map details nerves across a mouse's body: see stunning pics

https://www.nature.com/articles/d41586-025-02156-y
1•bookofjoe•6m ago•1 comments

Being Boring app: relax and meditate for a short while on Apple devices

https://www.peterborgapps.com/beingboring/
1•sea-gold•7m ago•0 comments

Ice cream producers to phase out artificial food dyes

https://www.cnbc.com/2025/07/14/many-us-ice-cream-producers-to-phase-out-artificial-food-dyes-by-2028.html
1•Bluestein•8m ago•0 comments

As AI advances, the best interfaces will be the ones we don't see

https://airesidency.substack.com/p/a-screenless-future
1•carlyayres•8m ago•0 comments

AI's Goldilocks Problem: Powell, Huang, and Amodei Can't Agree

https://fortune.com/2025/07/14/will-ai-destroy-white-collar-jobs-entry-level-gen-z-amodei/
1•Bluestein•9m ago•0 comments

Sell Yourself Well – What Soham Parekh Can Teach Us

https://www.fldr.zip/blog/sell-yourself
1•wyxuan•10m ago•0 comments

Undiscovered galaxies orbiting the Milky Way, supercomputer simulations hint

https://www.livescience.com/space/cosmology/100-undiscovered-galaxies-may-be-orbiting-the-milky-way-supercomputer-simulations-hint
1•Bluestein•11m ago•0 comments

Collatz's Tape

https://gbragafibra.github.io/2025/07/12/collatz_ant8.html
1•Fibra•11m ago•0 comments

My Cybersecurity Research on Red Lion G3 Web Server Vulnerabilities

1•hacker_might•15m ago•0 comments

C-: A Portable Assembly Language (1997)

https://www.microsoft.com/en-us/research/publication/c-a-portable-assembly-language/
1•thunderbong•17m ago•0 comments

I Answer 18 Questions

https://www.honest-broker.com/p/i-answer-18-questions
1•paulpauper•21m ago•0 comments

Show HN

https://www.hexar.ai/
2•prajwalgote•24m ago•0 comments

LittleHorse Kernel: A Platform for Distributed Event-Driven Applications

https://github.com/littlehorse-enterprises/littlehorse
1•mooreds•24m ago•0 comments

Show HN: I Made a Product Image and Ad Cloner

https://extension.xsocialai.com/
1•pvisilias•24m ago•0 comments

Practical Design Patterns for Modern AI Systems

https://www.infoq.com/articles/practical-design-patterns-modern-ai-systems/
1•mooreds•24m ago•0 comments

Guinea Worm Eradication Program

https://www.cartercenter.org/health/guinea_worm/index.html
1•mooreds•25m ago•0 comments

Show HN: Build your app's backend with just 1 prompt

https://sitegui.app
1•ciaovietnam•28m ago•0 comments

Perplexity's Comet AI browser, I like where it's going (but it's not there yet)

https://www.zdnet.com/article/i-tried-perplexitys-comet-ai-browser-and-i-like-where-its-going-but-its-not-there-yet/
1•CrankyBear•29m ago•0 comments

Row Polymorphic Programming

https://www.stranger.systems/posts/by-slug/row-polymorphic-programming.html
3•todsacerdoti•30m ago•0 comments

Canada steals the spotlight at Europe's biggest tech event

https://betakit.com/canada-steals-the-spotlight-at-europes-biggest-tech-event/
1•saubeidl•31m ago•0 comments

Is there a cost to try catch blocks?

https://brandewinder.com//2025/07/09/performance-cost-of-try-catch-blocks/
1•gsky•31m ago•0 comments

Spotted in Prod – Mobile animation examples

https://www.spottedinprod.com/
1•pipase•31m ago•0 comments

UnoCSS: The instant on-demand Atomic CSS engine

https://unocss.dev/
1•pipase•33m ago•0 comments

Brain drug: The deadliest "addiction" isn't a drug. It's something much worse

https://slate.com/life/2025/07/drug-brain-addiction-revenge-public-health-death.html
3•DocFeind•33m ago•0 comments

The CIA Reveals More of Its Connections to Lee Harvey Oswald

https://www.washingtonpost.com/national-security/2025/07/14/cia-oswald-jfk-assassination-joannides/
2•ricksunny•35m ago•1 comments

Updated default robots.txt on Shopify storefronts

https://twitter.com/igrigorik/status/1944828600194359804
1•mfiguiere•36m ago•0 comments