frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: OpenGraviton – Run 500B+ parameter models on a consumer Mac Mini

https://opengraviton.github.io
3•fatihturker•2h ago
Hi HN,

I built OpenGraviton, an open-source AI inference engine designed to push the limits of running extremely large models on consumer hardware.

The system combines several techniques to drastically reduce memory and compute requirements:

• 1.58-bit ternary quantization ({-1, 0, +1}) for ~10x compression • dynamic sparsity with Top-K pruning and MoE routing • mmap-based layer streaming to load weights directly from NVMe SSDs • speculative decoding to improve generation throughput

These allow models far larger than system RAM to run locally.

In early benchmarks, OpenGraviton reduced TinyLlama-1.1B from ~2.05GB (FP16) to ~0.24GB using ternary quantization. Synthetic stress tests at the 140B scale show that models which would normally require ~280GB FP16 can fit within ~35GB when packed with the ternary format.

The project is optimized for Apple Silicon and currently uses custom Metal + C++ tensor unpacking.

Benchmarks, architecture, and details: https://opengraviton.github.io

GitHub: https://github.com/opengraviton

Comments

fatihturker•2h ago
Author here.

The architecture page explains how ternary quantization, dynamic sparsity, and mmap layer streaming work together to push models far beyond normal RAM limits.

Happy to answer questions about the implementation or benchmarks.

MrLey•1h ago
This is cool project

Show HN: ANSI-Saver – A macOS Screensaver

https://github.com/lardissone/ansi-saver
54•lardissone•4h ago•19 comments

Show HN: Leonardo – FFmpeg Video Converter for Linux Creators

https://github.com/RossContino1/Leonardo
3•RossC17331•41m ago•1 comments

Show HN: Paster – A keyboard-first clipboard manager for Vim users

https://pasterapp.com
2•luanderock•56m ago•0 comments

Show HN: Tessera – MCP server that gives Claude persistent memory and local RAG

https://github.com/besslframework-stack/project-tessera
3•jasonjeong•59m ago•0 comments

Show HN: µJS, a 5KB alternative to Htmx and Turbo with zero dependencies

https://mujs.org
46•amaury_bouchard•9h ago•15 comments

Show HN: Prompt Armour – Real-time PII detection for AI chatbots, 100% local

https://prompt-armour.vercel.app/
3•TheAlexRider•1h ago•1 comments

Show HN: Argus – VSCode debugger for Claude Code sessions

https://github.com/yessGlory17/argus
46•lydionfinance•3h ago•21 comments

Show HN: OpenGraviton – Run 500B+ parameter models on a consumer Mac Mini

https://opengraviton.github.io
3•fatihturker•2h ago•2 comments

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

https://github.com/c0m4r/kula
73•c0m4r•18h ago•49 comments

Show HN: I open-sourced my Steam game, 100% written in Lua, engine is also open

https://github.com/willtobyte/reprobate
35•delduca•19h ago•17 comments

Show HN: Moongate – Ultima Online server emulator in .NET 10 with Lua scripting

https://github.com/moongate-community/moongatev2
275•squidleon•1d ago•157 comments

Show HN: PKGSmith

https://pkgsmith.app/
2•Fogh•4h ago•0 comments

Show HN: JotSpot – a super fast Markdown note tool with instant shareable pages

https://jotspot.io/
2•Rageypeep•4h ago•1 comments

Show HN: Claude-replay – A video-like player for Claude Code sessions

https://github.com/es617/claude-replay
92•es617•1d ago•31 comments

Show HN: OculOS – Any desktop app as a JSON API via OS accessibility tree

https://github.com/huseyinstif/oculos
10•stif1337•10h ago•1 comments

Show HN: Somnia – a dream journal that locks 2 minutes after your alarm fires

https://www.somniavault.me/
2•SushanKKsdfsdf•4h ago•0 comments

Show HN: Bulk Image Generator – Create AI variations and remove bg in batch

https://bulkimagegenerator.app/
4•fairyFayra•4h ago•0 comments

Show HN: 1v1 coding game that LLMs struggle with

https://yare.io
24•levmiseri•1d ago•7 comments

Show HN: OSle – A 510 bytes OS in x86 assembly, now with a C API

https://github.com/shikaan/osle/releases/tag/16800a5
2•shikaan•5h ago•0 comments

Show HN: Smelt – Extract structured data from PDFs and HTML using LLM

https://github.com/akdavidsson/smelt
2•smeltcli•5h ago•0 comments

Show HN: Recruiter Analytics for Developer Portfolios

https://portlumeai.com/blog/recruiter-analytics-developer-portfolio-tracking
4•portlumeai•5h ago•0 comments

Show HN: Reconstruct any image using primitive shapes, runs in-browser via WASM

https://github.com/taiseiue/primitive-playground
40•taiseiue•4d ago•8 comments

Show HN: Diamond – an interactive CLI for editing trees

https://github.com/justindmassey/diamond
2•justindmassey•5h ago•0 comments

Show HN: Aegis – Open-source pre-execution firewall for AI agents

https://github.com/Justin0504/Aegis
2•AEGIS_JB•1h ago•0 comments

Show HN: Nirvana – A TUI YouTube Music Player with a Physics-Based Visualizer

https://github.com/iamekabir-web/Nirvana
4•ekabir•7h ago•0 comments

Show HN: A trainable, modular electronic nose for industrial use

https://sniphi.com/
34•kwitczak•4d ago•24 comments

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

https://dev.moment.com/
187•armandhammer10•1d ago•61 comments

Show HN: Making Braindance from Cyberpunk 2077 a reality

https://www.braindance.dance/
4•shibo•10h ago•0 comments

Show HN: Git-lanes – Parallel isolation for AI coding agents using Git worktrees

https://github.com/bugrax/git-lanes
5•bugrax•10h ago•3 comments

Show HN: Interactive 3D globe of EU shipping emissions

https://seafloor.pages.dev
20•marcohaber•1d ago•7 comments