frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: OpenGraviton – Run 500B+ parameter models on a consumer Mac Mini

https://opengraviton.github.io
3•fatihturker•1h ago
Hi HN,

I built OpenGraviton, an open-source AI inference engine designed to push the limits of running extremely large models on consumer hardware.

The system combines several techniques to drastically reduce memory and compute requirements:

• 1.58-bit ternary quantization ({-1, 0, +1}) for ~10x compression • dynamic sparsity with Top-K pruning and MoE routing • mmap-based layer streaming to load weights directly from NVMe SSDs • speculative decoding to improve generation throughput

These allow models far larger than system RAM to run locally.

In early benchmarks, OpenGraviton reduced TinyLlama-1.1B from ~2.05GB (FP16) to ~0.24GB using ternary quantization. Synthetic stress tests at the 140B scale show that models which would normally require ~280GB FP16 can fit within ~35GB when packed with the ternary format.

The project is optimized for Apple Silicon and currently uses custom Metal + C++ tensor unpacking.

Benchmarks, architecture, and details: https://opengraviton.github.io

GitHub: https://github.com/opengraviton

Comments

fatihturker•1h ago
Author here.

The architecture page explains how ternary quantization, dynamic sparsity, and mmap layer streaming work together to push models far beyond normal RAM limits.

Happy to answer questions about the implementation or benchmarks.

MrLey•1h ago
This is cool project

Young billionaires are behind the prediction market boom. They hate each other

https://www.npr.org/2026/03/06/nx-s1-5735893/iran-war-kalshi-polymarket-feud
1•JumpCrisscross•21s ago•0 comments

Life Happens at 1x Speed

https://terriblesoftware.org/2026/01/08/life-happens-at-1x-speed/
1•mooreds•56s ago•0 comments

The Full Rewrite: AI Edition

https://huntersoftwareconsulting.com/posts/ai-full-rewrite/
1•mooreds•2m ago•0 comments

Why Do Ivy League Colleges Reject Some Students with Perfect Scores

https://www.forbes.com/sites/marlenacorcoran/2026/02/27/why-do-ivy-league-colleges-reject-some-st...
1•paulpauper•3m ago•0 comments

The Origin Story of gRPC

https://bsky.app/profile/danciruli.cloud/post/3mf6dg74nws2k
1•mooreds•3m ago•0 comments

Students Are Finding New Ways to Cheat on the SAT

https://www.nytimes.com/2026/01/28/us/politics/sat-college-board-cheating.html
2•paulpauper•4m ago•0 comments

I Asked 6 AIs to Nuke My Computer [video]

https://www.youtube.com/watch?v=YI-CAuUix_E
1•EvanZhouDev•5m ago•0 comments

Why Gen Z Is Unprepared for the Workplace

https://www.wsj.com/lifestyle/careers/gen-z-worker-skills-294463f6
1•paulpauper•5m ago•0 comments

From Studio to Street: The Story of DAT (1990)

https://www.muzines.co.uk/articles/from-studio-to-street-the-story-of-dat/7315
1•naves•5m ago•0 comments

The Apollo Guidance Computer Talk (2017) [video]

https://www.youtube.com/watch?v=xx7Lfh5SKUQ
1•frederikvs•6m ago•0 comments

Show HN: SRA – A new architectural pattern for modern product engineering

https://github.com/FelixZY/specification-realization-assembly-bible
1•FelixZY•8m ago•0 comments

The Dangerous Illusion of AI Coding? – Jeremy Howard [video]

https://www.youtube.com/watch?v=dHBEQ-Ryo24
1•tartoran•8m ago•0 comments

Information Topology as a Behavioral Parameter in Multi-Agent Systems

https://medium.com/towards-artificial-intelligence/information-topology-in-multi-agent-systems-cb...
1•erenkaratas•9m ago•0 comments

Armed robots take to the battlefield in Ukraine war

https://www.bbc.com/news/articles/c62662gzlp8o
2•aa_is_op•10m ago•0 comments

Product Review: The K Desktop Environment, Version 1.0 (1999)

https://www.linuxjournal.com/article/3111
1•1970-01-01•11m ago•0 comments

Ask HN: Can we talk about AI Astroturfing?

3•overgard•11m ago•0 comments

OpenAI robotics leader resigns over concerns on surveillance and auto-weapons

https://fortune.com/2026/03/07/openai-robotics-leader-caitlin-kalinowski-resignation-pentagon-sur...
4•elsewhen•12m ago•0 comments

19 States approved permanent daylight saving time

https://pix11.com/news/19-states-approved-permanent-daylight-saving-time-why-they-still-have-to-c...
2•geox•14m ago•1 comments

Show HN: AI video generator for small businesses without video production budget

https://seedanceflow.ai
1•frankylarry•16m ago•0 comments

Moral Hazard

https://gregmankiw.blogspot.com/2011/02/moral-hazard.html
1•kamaraju•16m ago•0 comments

Learning Rust with Too Many Linked Lists

https://rust-unofficial.github.io/too-many-lists/
1•Brysonbw•17m ago•0 comments

Why Current AI Systems are not good to work with

https://ghost.iamr0b0tx.com/blog/2026/03/07/why-current-ai-systems-are-not-very-good-to-work-with/
1•iamr0b0tx•19m ago•0 comments

Trump gets data center companies to pledge to pay for power generation

https://arstechnica.com/tech-policy/2026/03/leading-ai-datacenter-companies-sign-pledge-to-buy-th...
1•joozio•23m ago•1 comments

SimEarth: Realtime

https://github.com/xraymemory/simearth-realtime
1•idempotent_•24m ago•1 comments

Lawmakers Want DoD Investigated for Biblical 'Armageddon' Claims

https://www.military.com/daily-news/2026/03/06/lawmakers-want-dod-hegseth-investigated-biblical-a...
3•Jimmc414•24m ago•0 comments

Show HN: Personal Standup

https://personal-standup.vercel.app/
2•baristaGeek•25m ago•0 comments

January 6 commemorative plaque appears in Capitol after years of delay

https://www.cnn.com/2026/03/07/politics/january-6-plaque-installed-capitol
4•Tomte•25m ago•0 comments

The Power Brokers Behind the $250B Influencer Economy

https://www.wsj.com/lifestyle/careers/uta-influencer-managers-ali-berman-raina-penchansky-alix-ea...
4•gmays•26m ago•0 comments

The Antifragile Organization: Designing Systems That Evolve Through Chaos

https://medium.com/@adocarreno/the-antifragile-organization-designing-systems-that-evolve-through...
3•lawrenceyan•26m ago•0 comments

Footage shows US citizen shot dead by ICE agent in Texas traffic stop

https://www.bbc.com/news/articles/cedzep6gp07o
4•tartoran•27m ago•0 comments