frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ternary Bonsai: Top Intelligence at 1.58 Bits

https://prismml.com/news/ternary-bonsai
41•nnx•2d ago

Comments

wmf•1h ago
Yet again they're comparing against unquantized versions of other models. They would probably still win but by a much smaller size margin.
Dumbledumb•54m ago
Wouldnt the margin be higher? All other models being moved from unquantized to quantized would lower their performance, while bonsai stays. I get what you see if it was in regards to score/modelsize, but not for absolute performance
mchusma•1h ago
Ever since I saw the first one of these one-bit models made by Microsoft, I thought this was a fascinating route. I assume that in practice, this is less helpful than it seems, just because there's every economic incentive in the world for the big AI labs to produce small, powerful, fast models. None of them seem to be using this technique, so it's interesting, but I suspect it's not quite working.

I also have yet to see any of these at a larger scale. For example, can you try one of these at 100 billion parameters?

yodon•1h ago
So excited to see this - the big advantage of 1.58 bits is there are no multiplications at inference time, so you can run them on radically simpler and cheaper hardware.
Animats•44m ago
At 4 bits, you could just have a hard-wired table lookup. Two 4 bit values in, 256 entry table. You can have saturating arithmetic and a post-processing function for free. Somebody must be building hardware like that.
Animats•46m ago
This makes sense. The 1-bit model implies needing 2x as many neurons, because you need an extra level to invert. But the ternary model still has a sign, just really low resolution.

(I've been reading the MMLU-Redux questions for electrical engineering. They're very funny. Fifty years ago they might have been relevant. The references to the Intel 8085 date this to the mid-1970s. Moving coil meters were still a big thing back then. Ward-Leonard drives still drove some elevators and naval guns. This is supposed to be the hand-curated version of the questions. Where do they get this stuff? Old exams?)

[1] https://github.com/aryopg/mmlu-redux/blob/main/outputs/multi...

ericb•40m ago
This is pretty cool! I would love to see an even larger models shrunk down.

If you got that into a couple gigs--what could you stuff into 20 gigs?

armanj•13m ago
I did a quick benchmark & compared it with Qwen3.5: https://github.com/ArmanJR/PrismML-Bonsai-vs-Qwen3.5-Benchma...

in my results, accuracy-wise Ternary-Bonsai-8B is on par with Qwen3.5-4B. But in accuracy-per-byte, bonsai is the clear winner:

=> Ternary-Bonsai-1.7B achieved 65.1% from 462 MiB, beating Qwen3.5-0.8B by 12 points while being ~5% smaller on disk. => Ternary-Bonsai-4B is the accuracy-per-byte winner above 1 GiB. 83.0% from only 1.1 GiB, within 2 points of Qwen3.5-4B at 40% of the weight size.

they show strong promise on edge devices and where disk space is limited. I think this lab is worth watching.

John Ternus to become Apple CEO

https://www.apple.com/newsroom/2026/04/tim-cook-to-become-apple-executive-chairman-john-ternus-to...
1173•schappim•5h ago•601 comments

How to Make a Fast Dynamic Language Interpreter

https://zef-lang.dev/implementation
24•pizlonator•57m ago•0 comments

Jujutsu megamerges for fun and profit

https://isaaccorbrey.com/notes/jujutsu-megamerges-for-fun-and-profit
124•icorbrey•4h ago•31 comments

Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

https://qwen.ai/blog?id=qwen3.6-max-preview
538•mfiguiere•11h ago•282 comments

Kimi vendor verifier – verify accuracy of inference providers

https://www.kimi.com/blog/kimi-vendor-verifier
161•Alifatisk•7h ago•17 comments

Soul Player C64 – A real transformer running on a 1 MHz Commodore 64

https://github.com/gizmo64k/soulplayer-c64
75•adunk•5h ago•19 comments

Ternary Bonsai: Top Intelligence at 1.58 Bits

https://prismml.com/news/ternary-bonsai
42•nnx•2d ago•8 comments

ggsql: A Grammar of Graphics for SQL

https://opensource.posit.co/blog/2026-04-20_ggsql_alpha_release/
351•thomasp85•12h ago•72 comments

Quantum Computers Are Not a Threat to 128-Bit Symmetric Keys

https://words.filippo.io/128-bits/
131•hasheddan•9h ago•56 comments

OpenAI ad partner now selling ChatGPT ad placements based on “prompt relevance”

https://www.adweek.com/media/exclusive-leaked-deck-reveals-stackadapts-playbook-for-chatgpt-ads/
167•jlark77777•4h ago•78 comments

All phones sold in the EU to have replaceable batteries from 2027

https://www.theolivepress.es/spain-news/2026/04/20/eu-to-force-replaceable-batteries-in-phones-an...
961•ramonga•12h ago•790 comments

Deezer says 44% of songs uploaded to its platform daily are AI-generated

https://techcrunch.com/2026/04/20/deezer-says-44-of-songs-uploaded-to-its-platform-daily-are-ai-g...
292•FiddlerClamp•10h ago•272 comments

Kefir C17/C23 Compiler

https://sr.ht/~jprotopopov/kefir/
124•conductor•3d ago•5 comments

Modern Rendering Culling Techniques

https://krupitskas.com/posts/modern_culling_techniques/
77•krupitskas•1d ago•16 comments

Brussels launched an age checking app. Hackers took 2 minutes to break it

https://www.politico.eu/article/eu-brussels-launched-age-checking-app-hackers-say-took-them-2-min...
142•axbyte•16h ago•71 comments

Even 'uncensored' models can't say what they want

https://morgin.ai/articles/even-uncensored-models-cant-say-what-they-want.html
95•llmmadness•3h ago•75 comments

Japan's Cherry Blossom Database, 1,200 Years Old, Has a New Keeper

https://www.nytimes.com/2026/04/17/climate/japan-cherry-blossom-database-scientist.html
16•caycep•3d ago•0 comments

Monero Community Crowdfunding System

https://ccs.getmonero.org/ideas/
57•OsrsNeedsf2P•4h ago•33 comments

Zero-Copy Pages in Rust: Or How I Learned to Stop Worrying and Love Lifetimes

https://redixhumayun.github.io/databases/2026/04/14/zero-copy-pages-in-rust.html
39•ingve•4d ago•4 comments

WebUSB Extension for Firefox

https://github.com/ArcaneNibble/awawausb
197•tuananh•13h ago•180 comments

M 7.4 earthquake – 100 km ENE of Miyako, Japan

https://earthquake.usgs.gov/earthquakes/eventpage/us6000sri7/
262•Someone•15h ago•121 comments

Bloom (YC P26) Is Hiring

https://www.ycombinator.com/companies/trybloom/jobs
1•RayFitzgerald•8h ago

F-35 is built for the wrong war

https://warontherocks.com/cogs-of-war/the-f-35-is-a-masterpiece-built-for-the-wrong-war/
190•anjel•5h ago•368 comments

Year of the IPv6 Overlay Network

https://www.defined.net/blog/year-of-the-ipv6-overlay-network/
8•stock_toaster•3d ago•1 comments

10 years ago, someone wrote a test for Servo that included an expiry in 2026

https://mastodon.social/@jdm_/116429380667467307
188•luu•1d ago•103 comments

Atlassian enables default data collection to train AI

https://letsdatascience.com/news/atlassian-enables-default-data-collection-to-train-ai-f71343d8
501•kevcampb•13h ago•114 comments

Sauna effect on heart rate

https://tryterra.co/research/sauna-effect-on-heart-rate
366•kyriakosel•12h ago•206 comments

Kimi K2.6: Advancing open-source coding

https://www.kimi.com/blog/kimi-k2-6
570•meetpateltech•10h ago•288 comments

Writing string.h functions using string instructions in asm x86-64 (2025)

https://pmasschelier.github.io/x86_64_strings/
44•thaisstein•3d ago•4 comments

I learned Unity the wrong way

https://darkounity.com/blog/how-i-learned-unity-the-wrong-way
133•lelanthran•4d ago•80 comments