frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The ROLV CPU Breakthrough

https://rolv.ai
1•heggenhougen•1h ago

Comments

heggenhougen•1h ago
Independent validation confirms that ROLV running on commodity CPU systems (Intel Xeon or AMD EPYC) outperforms every major accelerator platform without ROLV — including leading GPUs and TPUs — across the entire sparsity spectrum from 0% to 99.999%.

Breakthrough result — March 01 2026 On standard commodity CPUs with ROLV, full Kimi K2.5 serving achieves:

Baseline without ROLV: 0.10 req/s • 74.39 output tok/s • 1,380.71 total tok/s • 1,039.99 s wall time ROLV Accelerated: 4.37 req/s • 3,253.47 output tok/s • 60,385.79 total tok/s • 23.78 s wall time • 206 ms mean TTFT Kernel acceleration: 43.7× faster than dense baseline IMPROVEMENTS WITH ROLV

Requests/sec increase: 43.7× (+4,273.5%) Output tokens/sec increase: 43.7× (+4,273.5%) Total tokens/sec increase: 43.7× (+4,273.5%) Wall time reduction: 43.7× (97.7% faster) TTFT mean reduction: 43.7× (97.7% faster) TTFT median reduction: 43.7× (97.7% faster) End-to-end latency reduction: 43.7× (97.7% faster) Per-request TPS mean increase: 43.7× (+4,273.5%) KERNEL ENERGY MEASUREMENTS (for 200 iterations) Dense baseline: 18,992.76 Joules | ROLV accelerated: 339.77 Joules | Energy saved: 98.2%

Result: Commodity CPUs with ROLV now beat a single NVIDIA B200 GPU without ROLV by a massive margin — while using far less power and zero specialized hardware.

Show HN: MCP-firewall: I created a policy engine for CLI Agents

https://github.com/dzervas/mcp-firewall
1•ttouch•2m ago•0 comments

Show HN: Shannon's Revenge – detect Claude in your codebase for DoD compliance

https://github.com/dabrez/shannonsRevenge
2•dabrez•2m ago•0 comments

Awesome libghostty

https://github.com/Uzaaft/awesome-libghostty
1•lawrencechen•4m ago•0 comments

The AI agent that runs your customer operation from one widget

https://www.chatrai.app/
1•sammyjoze1•7m ago•1 comments

Mercury 2: The First Diffusion Model That 'Thinks'" [video]

https://www.youtube.com/watch?v=Bqdf6Um_8OE
1•matthewsinclair•14m ago•0 comments

Code World Models for Parameter Control in Evolutionary Algorithms

https://www.alphaxiv.org/abs/2602.22260
1•camilochs•14m ago•0 comments

ProofGateway – Collect and publish customer testimonials in minutes

2•elufadeju•22m ago•0 comments

JSON-up: Stop scattering "if" checks for old JSON formats across your codebase

https://github.com/Nano-Collective/json-up
2•mrspence•22m ago•1 comments

TV's TV (1987) & TV Games Encyclopedia (1988)

https://blog.gingerbeardman.com/2026/03/01/tvs-tv-1987-and-tv-games-encyclopedia-1988/
2•msephton•30m ago•0 comments

Nvidia and Global Telecom Leaders Commit to Build 6G on AI-Native Platforms

https://nvidianews.nvidia.com/news/nvidia-and-global-telecom-leaders-commit-to-build-6g-on-open-a...
3•zinekeller•35m ago•1 comments

Vinext Explained: Rebuilding Next.js with AI in One Week (4x Faster Builds)Video

https://www.youtube.com/watch?v=AF3Rr4MENCo
2•emot•36m ago•0 comments

AI agent with 2 deps that uses Shannon Entropy to decide when to act vs. ask

https://github.com/borhen68/picoagents
2•borhensaidi•40m ago•2 comments

Online course about buying hotels

https://www.myfirsthotel.com/
2•bhagyash•40m ago•2 comments

Ask HN: How will most Anthropic customers respond to the threats by the govt?

2•Poomba•44m ago•2 comments

For Sale: The Last Honda V10 Ayrton Senna Ever Raced (2025)

https://silodrome.com/last-honda-v10-ayrton-senna-raced/
3•naves•46m ago•0 comments

Editor at 184-y/O Cleveland Plain Dealer pushes to let AI draft news articles

https://www.washingtonpost.com/technology/2026/03/01/ai-journalism-writing-cleveland-plain-dealer/
2•bookofjoe•49m ago•1 comments

An Interview with the AI They Called a National Security Threat

https://www.woodrow.fyi/p/a-letter-from-inside-the-machine
3•heywoods•53m ago•0 comments

Researchers Deanonymize Reddit and Hacker News Users at Scale

https://threatroad.substack.com/p/researchers-deanonymize-reddit-and
9•hk_flying_gear•55m ago•1 comments

California wants heat pumps. High power bills might get in the way

https://www.latimes.com/california/story/2026-03-01/california-wants-millions-of-heat-pumps-high-...
3•dangle1•55m ago•0 comments

Claude Prompt to Find Inefficiencies in LLM Usage

https://www.maniac.ai/slm-audit
2•dhruv_m•56m ago•1 comments

The Two Kinds of Error

https://evanhahn.com/the-two-kinds-of-error/
2•zdw•57m ago•0 comments

Show HN: Tired of making accounts to split a pizza bill, I built Dividdy

https://dividdy.com/en
2•jezzlucena•58m ago•0 comments

Thaura

https://thaura.ai
3•abdelhousni•58m ago•0 comments

The Agentic Dispatch: The Last Edition

https://the-agentic-dispatch.com/the-last-edition/
3•greensleeves123•1h ago•1 comments

Show HN: Logira – eBPF runtime auditing for AI agent runs

https://github.com/melonattacker/logira
2•melonattacker•1h ago•0 comments

Show HN: Tech Digest – Top Products from PH/HN

https://techdigest.live/
2•vaibhav0806•1h ago•0 comments

Podcast Listenership Outranks Talk Radio for the First Time

https://www.cnet.com/tech/services-and-software/podcasts-officially-outrank-talk-radio-for-the-fi...
3•geox•1h ago•0 comments

Show HN: Gala – Sealed types, pattern matching, and monads for Go

https://github.com/martianoff/gala
2•mmcodes•1h ago•2 comments

1978: Could You Survive Without Modern Technology? [video]

https://www.youtube.com/watch?v=WXZpjZidCNk
3•sys_64738•1h ago•0 comments

FCaptcha – A modern CAPTCHA system designed to detect everything

https://github.com/WebDecoy/FCaptcha
2•cport1•1h ago•0 comments