frontpage.

Hi HN, I built this because I wanted to see if I could pre-train large-vocabulary LLMs (like Gemma with 262k tokens) on hardware accessible to independent researchers.

Standard exact Cross-Entropy instantly OOMs on 16GB GPUs at that scale.

To bypass this, I implemented MAXIS Loss. It uses a "Ghost Logit" to mathematically simulate the missing probability mass of unsampled tokens, rather than materializing the full 262k-wide matrix.

Benchmarks on a 16GB VRAM card (T4):

17.5x faster in the loss layer compared to the Triton-optimized Liger Kernel.

~39% VRAM reduction in the objective calculation. Includes RandNLA Attention, which uses Causal Kronecker Sketching to keep memory flat as sequence length grows.

I’ve included technical reports with the formal math in the repository. I would love any technical feedback on the partition function simulation or the sketching approach.

Orb.Farm

My custom agent used 87% fewer tokens when I gave it Skills for its MCP tools

Ending the Sugar Rush

Show HN: ThresholdIQ – Browser-based anomaly detection Engine

The Freedom Stack

Hydropower Line from Quebec to Queens Could Power a Million NYC Homes

Redpanda pushes the envelope on Nvidia Vera

Solving Problems by Writing Out Questions and Answers

The day Point Loma launched a ship made of concrete

Appt Helper – Skip the Global Entry Interview Backlog

Iranians Use an App to Map Military Bases and Missile Sites – and So Does Israel

Ford Now Sells a Supercharger Kit to Make the F-150 Lobo a Real Street Truck

AI agents framework for TypeScript and Deno

Humanities in the Machine

Benjamin Netanyahu is struggling to prove he's not an AI clone

Cognitive Security

AI as Economic Warfare

Theorem_ledger.md

Show HN: LynString – Translate Android Strings.xml with AI

AI is helping choose targets in Iran war – now it's a target too

Show HN: Live-Editable Svelte Pages

JetBrains is shutting down "Code With Me" in all its IDEs

Build Everything

Housing Costs, Now vs. 1939

Shameless Guesses, Not Hallucinations

Show HN: YouTube video discovery engine for language learning

Tesla's Terafab chip fab ambitions ignore its lack of semiconductor experience

I built a hydraulic pedal system that ships standard with every SIM rig we make

OneWeeb: Local JPG Compression for 20KB Government Form Photos

Show HN: macOS ElevenLabs Scribe v2 app

Show HN: MaximusLLM – Train 262k-vocab LLMs on a single 16GB GPU