frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: MaximusLLM – Train 262k-vocab LLMs on a single 16GB GPU

https://github.com/yousef-rafat/MaximusLLM/blob/main/README.md
2•yousef_g•1h ago
Hi HN, I built this because I wanted to see if I could pre-train large-vocabulary LLMs (like Gemma with 262k tokens) on hardware accessible to independent researchers.

Standard exact Cross-Entropy instantly OOMs on 16GB GPUs at that scale.

To bypass this, I implemented MAXIS Loss. It uses a "Ghost Logit" to mathematically simulate the missing probability mass of unsampled tokens, rather than materializing the full 262k-wide matrix.

Benchmarks on a 16GB VRAM card (T4):

17.5x faster in the loss layer compared to the Triton-optimized Liger Kernel.

~39% VRAM reduction in the objective calculation. Includes RandNLA Attention, which uses Causal Kronecker Sketching to keep memory flat as sequence length grows.

I’ve included technical reports with the formal math in the repository. I would love any technical feedback on the partition function simulation or the sketching approach.

Orb.Farm

https://orb.farm/
1•onestay42•50s ago•0 comments

My custom agent used 87% fewer tokens when I gave it Skills for its MCP tools

https://seroter.com/2026/03/16/my-custom-agent-used-87-fewer-tokens-when-i-gave-it-skills-for-its...
1•richards•2m ago•0 comments

Ending the Sugar Rush

https://civic.io/2026/03/16/ending-the-sugar-rush/
1•cdrnsf•4m ago•0 comments

Show HN: ThresholdIQ – Browser-based anomaly detection Engine

https://thresholdiq.app
1•vigneshj•4m ago•1 comments

The Freedom Stack

https://www.ianbetteridge.com/the-freedom-stack/
1•cdrnsf•5m ago•0 comments

Hydropower Line from Quebec to Queens Could Power a Million NYC Homes

https://www.nytimes.com/2026/03/16/nyregion/hydro-power-nyc.html
2•JumpCrisscross•9m ago•0 comments

Redpanda pushes the envelope on Nvidia Vera

https://www.redpanda.com/blog/nvidia-vera-cpu-performance-benchmark
1•PeterCorless•9m ago•0 comments

Solving Problems by Writing Out Questions and Answers

https://nguyenhuythanh.com/posts/problem-solving-qnas/
1•thanhnguyen2187•12m ago•0 comments

The day Point Loma launched a ship made of concrete

https://timesofsandiego.com/military/2026/03/14/the-day-point-loma-launched-a-ship-made-of-concrete/
1•gscott•13m ago•0 comments

Appt Helper – Skip the Global Entry Interview Backlog

https://appthelper.com/en
1•Roberto_guido•14m ago•0 comments

Iranians Use an App to Map Military Bases and Missile Sites – and So Does Israel

https://www.thefp.com/p/iranians-use-an-app-to-map-military
2•mhb•18m ago•0 comments

Ford Now Sells a Supercharger Kit to Make the F-150 Lobo a Real Street Truck

https://www.thedrive.com/news/ford-now-sells-a-supercharger-kit-to-make-the-f-150-lobo-a-real-str...
1•PaulHoule•19m ago•0 comments

AI agents framework for TypeScript and Deno

https://github.com/a7ul/vibes
1•atulanand94•20m ago•0 comments

Humanities in the Machine

https://blainsmith.com/essays/humanities-in-the-machine/
1•birdculture•20m ago•0 comments

Benjamin Netanyahu is struggling to prove he's not an AI clone

https://www.theverge.com/tech/895453/ai-deepfake-netanyahu-claims-conspiracy
5•amrrs•20m ago•0 comments

Cognitive Security

https://ghuntley.com/cogsec/
2•ghuntley•21m ago•0 comments

AI as Economic Warfare

https://ghuntley.com/warfare/
1•ghuntley•22m ago•0 comments

Theorem_ledger.md

https://github.com/affectively-ai/aeon/blob/main/docs/ebooks/145-log-rolling-pipelined-prefill/co...
1•taylorbuley•23m ago•0 comments

Show HN: LynString – Translate Android Strings.xml with AI

https://www.lynstring.dev/
1•jharteg•24m ago•1 comments

AI is helping choose targets in Iran war – now it's a target too

https://www.abc.net.au/news/2026-03-15/iran-war-ai-technology-data-centres/106443004
3•breve•26m ago•0 comments

Show HN: Live-Editable Svelte Pages

https://svedit.dev
2•_mql•27m ago•0 comments

JetBrains is shutting down "Code With Me" in all its IDEs

https://www.neowin.net/news/jetbrains-is-shutting-down-this-neat-little-feature-in-its-ides/
6•bundie•28m ago•1 comments

Build Everything

https://duggan.ie/posts/build-everything
1•duggan•28m ago•0 comments

Housing Costs, Now vs. 1939

https://chrisdillow.substack.com/p/housing-costs-now-vs-1939
1•rwmj•29m ago•0 comments

Shameless Guesses, Not Hallucinations

https://www.astralcodexten.com/p/shameless-guesses-not-hallucinations
1•toomuchtodo•29m ago•0 comments

Show HN: YouTube video discovery engine for language learning

https://lingolingo.app/discovery
1•yunusabd•29m ago•0 comments

Tesla's Terafab chip fab ambitions ignore its lack of semiconductor experience

https://electrek.co/2026/03/16/teslas-terafab-chip-fab-ambitions-ignore-its-total-lack-of-semicon...
5•breve•32m ago•0 comments

I built a hydraulic pedal system that ships standard with every SIM rig we make

1•simcoaches•33m ago•0 comments

OneWeeb: Local JPG Compression for 20KB Government Form Photos

https://oneweeb.com/compress-jpg-20kb.html
1•Zepubo•33m ago•0 comments

Show HN: macOS ElevenLabs Scribe v2 app

https://github.com/leopiney/elevenscribe
1•leopiney•34m ago•0 comments