frontpage.

Hi HN, I built OpenGraviton, an open-source AI inference engine that pushes the limits of running extremely large LLMs on consumer hardware. By combining 1.58-bit ternary quantization, dynamic sparsity with Top-K pruning and MoE routing, and mmap-based layer streaming, OpenGraviton can run models far larger than your system RAM—even on a Mac Mini. Early benchmarks: TinyLlama-1.1B drops from ~2GB (FP16) to ~0.24GB with ternary quantization. At 140B scale, models that normally require ~280GB fit within ~35GB packed. Optimized for Apple Silicon with Metal + C++ tensor unpacking, plus speculative decoding for faster generation. Check benchmarks, architecture, and details here: https://opengraviton.github.io GitHub: https://github.com/opengraviton This project isn’t just about squeezing massive models onto tiny hardware—it’s about democratizing access to giant LLMs without cloud costs. Feedback, forks, and ideas are very welcome!

Show HN: Starter-structure-CLI – scaffold apps from stack combinations

ComicsAI

Show HN: arxiv-digest: Daily robotics paper scouting for OpenClaw and Zotero

SSH-Based Mail for Agents

Knuth Claude's Cycles note update: problem now fully solved, by LLMs

Programmers will document for Claude, but not for each other

Emit Emails – Personalized email sending without the fluff

AI Crash the Physics of the Collapse [video]

3AM Coding:cracking persistent open-source memory for agents

Crow Watch: A Hacker News Alternative

Analysis of Ninth Circuit Allows TOS Amendment by Email–Ireland-Gordy vs. Tile

Terence Tao: Formalizing a proof in Lean using Claude Code [video]

Apple: The first 50 years, CBS Sunday Morning [video]

NSF National Deep Inference Fabric

CorridorKey – Perfect Green Screen Keys

Shockwave Player Reimplemented in Rust and WASM

How to win slots and influence people

Ask HN: Where do all the laid off devs hang out?

Set-OutlookSignatures v4.26.0 support for M365 sovereign clouds

Show HN: TrustScan – Simplify privacy policies and audit GDPR compliance

Every business will have AGI by 2027

Show HN: Marketing Content Generator AI-powered multi-channel content platform

Show HN: I built a mini PowerBI for tech comps with no dev experience with Codex

Fontcrafter: Turn Your Handwriting into a Real Font

Show HN: cursor-tg – Run Cursor Cloud Agents from Telegram

MoltBrowser MCP

Show HN: FretBench – I tested 14 LLMs on reading guitar tabs. Most failed

Show HN: NirvaCrop – Offline Python tool for batch video cropping

A sneak preview behind an embedded software factory. I suspect "rad" is back

Sumi – Open-source voice-to-text with local AI polishing

Show HN: Run 500B+ Parameter LLMs Locally on a Mac Mini