news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

ROLV: 50.5× speedup and 91.4% energy savings Qwen2.5-72B MoE expert FFN (1xB200)

https://rolv.ai

1•heggenhougen•1h ago

Comments

heggenhougen•1h ago

We just ran the exact expert FFN slice from Qwen2.5-72B-Instruct (8192×28672, batch 512) on a single NVIDIA B200.

Results (ROLV vs vendor-best cuBLAS):

- Speedup : 50.5× (4953% faster) - Energy Savings : 91.4% - Tokens/s : 6.42M vs 127k - TFLOPS : 3,018 vs 59.7 - Energy : 64 J vs 742 J - Per-iter : 0.000080 s vs 0.004027 s

A_hash and V_hash are identical for both runs (full reproducibility).

ROLV_norm_hash: 8dbe5f139fd946d4cd84e8cc612cd9f68cbc87e394457884acc0c5dad56dd8dd

This is the real hot path for MoE inference. No synthetic matrices.

Comments and questions welcome

The One-Person Stack

https://www.ivan.codes/blog/the-one-person-stack

1•andout_•39s ago•0 comments

Anthropic and The Pentagon

https://www.schneier.com/blog/archives/2026/03/anthropic-and-the-pentagon.html

1•aduffy•1m ago•0 comments

Gitzy is now on TestFlight A modern, native iOS Git client

https://testflight.apple.com/join/SB16NCfr

1•marc0janssen•2m ago•1 comments

My dad made the biggest jewelled egg in the world

https://www.theguardian.com/books/2026/mar/07/paul-kutchinsky-egg-obsession-destroy-marriage-fami...

1•n1b0m•2m ago•0 comments

Palantir and Anthropic AI helped the US hit 1k Iran targets in 24 hours

https://www.moneycontrol.com/europe/?url=https://www.moneycontrol.com/world/how-palantir-and-anth...

1•rainhacker•4m ago•0 comments

Show HN: OSle now has a C API and still fits in 510 bytes

https://github.com/shikaan/osle/releases/tag/16800a5

1•shikaan•5m ago•0 comments

'Mainly, you fast fooded': Monzo UK bank under fire over 'shaming' year reviews

https://www.theguardian.com/money/2026/mar/07/monzo-customer-language-year-in-monzo-review

2•lonelyasacloud•6m ago•1 comments

Tell HN: If your idea is worth stealing, AI makes it easy. NDA or not

3•NatalijaAAD•7m ago•1 comments

Show HN: I gave Claude a Stripe account and said make $1M. Day 1

https://dashboard-mocha-delta-98.vercel.app

1•Auto_Claude•8m ago•0 comments

Show HN: Torrent Preview – macOS QuickLook extension for .torrent files

https://github.com/sveinbjornpalsson/TorrentPreview

1•sveinbjornp•8m ago•0 comments

Claude Code deletes developers' production setup, including database

https://www.tomshardware.com/tech-industry/artificial-intelligence/claude-code-deletes-developers...

2•vanburen•9m ago•0 comments

Paperclip – Open-source orchestration for zero-human companies

https://paperclip.ing/

2•microsoftedging•10m ago•0 comments

Show HN: Turn ordinary selfies into high-quality dating photos

https://www.aidatepics.com/

1•mohitvaswani•11m ago•0 comments

Dropcomments for your static site generator blog

https://dropcomments.net/

2•shozzipen•13m ago•0 comments

No you can't build it in a day

https://blog.alaindichiappari.dev/p/no-you-cant-build-it-in-a-day

1•alainrk•14m ago•0 comments

Show HN: Smelt – Extract structured data from PDFs and HTML using LLM

https://github.com/akdavidsson/smelt

2•smeltcli•15m ago•0 comments

Claude built a system in 3 rounds, latent bugs from round 1 exploded in round 3

https://github.com/mycelium-clj/mycelium/blob/main/benchmarks/SCALING.md

3•yogthos•20m ago•0 comments

The Case of the Disappearing Secretary

https://rowlandmanthorpe.substack.com/p/the-case-of-the-disappearing-secretary

2•rwmj•20m ago•0 comments

Show HN: Recruiter Analytics for Developer Portfolios

https://portlumeai.com/blog/recruiter-analytics-developer-portfolio-tracking

4•portlumeai•20m ago•0 comments

The yoghurt delivery women combatting loneliness in Japan

https://www.bbc.com/travel/article/20260302-the-yoghurt-delivery-women-combatting-loneliness-in-j...

3•ranit•20m ago•0 comments

Show HN: I built an API that adds auth, OAuth and billing to apps

https://www.syntro.fun/

2•vivuusik•21m ago•0 comments

Impending kOS (2014)

https://vector.org.uk/impending-kos/

2•tosh•23m ago•0 comments

Global Warming Has Accelerated Significantly

https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2025GL118804

2•Noaidi•24m ago•1 comments

Show HN: Implica – The news app that connects the dots for you

https://implica.app

2•balazsvincze_•27m ago•0 comments

Iran's Guards challenges Trump to have US Navy escort oil tankers

https://www.reuters.com/world/middle-east/irans-guards-challenges-trump-have-us-navy-escort-oil-t...

3•geox•27m ago•0 comments

DJI will pay $30K to the man who accidentally hacked 7k Romo robovacs

https://www.theverge.com/news/890982/dji-pay-sammy-azdoufal-robot-vacuum-hack-romo-security

2•gradus_ad•27m ago•0 comments

Show HN: Diamond – an interactive CLI for editing trees

https://github.com/justindmassey/diamond

1•justindmassey•30m ago•0 comments

Memory and storage shortages may lead to shipping Steam Machines in 2027

https://www.pcgamer.com/hardware/valve-still-hopes-to-ship-steam-machines-in-2026-but-a-delay-int...

1•joaogui1•32m ago•0 comments

Graphene-based 'artificial skin' brings human-like touch closer to robots

https://techxplore.com/news/2026-03-graphene-based-artificial-skin-human.html#google_vignette

1•stevenjgarner•33m ago•0 comments

Show HN: Mapping 10 years of world model research (489 papers, 2012–2026)

https://github.com/Bowen12137/Awesome-World-Models

1•Arthur12137•33m ago•0 comments