news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Nano-vLLM: How a vLLM-style inference engine works

https://neutree.ai/blog/nano-vllm-part-1

37•yz-yu•1h ago

Comments

jbarrow•15m ago

The whole thing feels AI written, generated from the codebase.

For instance, it goes into (nano)vLLM internals and doesn’t mention PagedAttention once (one of the core ideas that vLLM is based on)[1].

Also mentions that Part 2 will cover dense vs MoE’s, which is weird because nanovllm hardcodes a dense Qwen3 into the source.

Here are better (imo) explainers about how vLLM works:

- https://hamzaelshafie.bearblog.dev/paged-attention-from-firs...

- https://www.aleksagordic.com/blog/vllm

- https://huggingface.co/blog/continuous_batching

Aleksa’s blog is a bit in the weeds for my taste but it’s really worth working through.

A lot of the magic of vLLM happens in the PagedAttention kernels, which are really succinctly implanted in nanovllm. And the codebase is great and readable by itself!

—

1. https://arxiv.org/abs/2309.06180

Recieving Some 'Smart' Spam (2008)

https://ryandoyle.net/posts/recieving-some-smart-spam/

1•jruohonen•47s ago•0 comments

AI Coding Assistants Copying All Code to China

https://www.schneier.com/blog/archives/2026/02/ai-coding-assistants-secretly-copying-all-code-to-...

1•metahost•47s ago•0 comments

I'm still not using GUIs: A guide to the terminal (2019)

https://www.lucasfcosta.com/blog/terminal-guide-2019

1•speckx•55s ago•0 comments

Tasker: Spec-driven development with Claude Code

https://github.com/Dowwie/tasker

1•Dowwie•2m ago•0 comments

Show HN: Vibe code on your mobile device

https://www.npmjs.com/package/aigo

1•wakandan•2m ago•0 comments

The Crown Made of Leaves

https://worldsensorium.com/the-crown-made-of-leaves/

1•dnetesn•2m ago•0 comments

Can We Protect Science?

https://nautil.us/can-we-protect-science-1264227/

1•dnetesn•3m ago•0 comments

Ask HN: Can you beat my score of 862,294 points for today's PluriSnake puzzle?

1•amichail•4m ago•0 comments

Free Online Courses

https://alison.com

1•geox•5m ago•0 comments

From Human Ergonomics to Agent Ergonomics

https://wesmckinney.com/blog/agent-ergonomics/

1•jbredeche•5m ago•0 comments

Let the Arms Race Begin

https://www.nytimes.com/2026/01/30/opinion/nuclear-treaty-deal-start.html

2•nomilk•5m ago•1 comments

Hongdown: An opinionated Markdown formatter in Rust

https://github.com/dahlia/hongdown

1•PaulHoule•6m ago•0 comments

Google AI helped IDF drones with targeting in 2024 breaching its own policies

https://www.washingtonpost.com/technology/2026/02/01/google-ai-israel-military/

3•bhouston•8m ago•1 comments

Rubber Duck Debugging

https://en.wikipedia.org/wiki/Rubber_duck_debugging

4•vinhnx•9m ago•0 comments

Validity of the Single Processor Approach (1967)

https://dl.acm.org/doi/epdf/10.1145/1465482.1465560

1•jruohonen•9m ago•0 comments

Bad Apple, but It's MathML

https://conflor.es/bad-apple-mathml

2•bkardell•9m ago•2 comments

CMA proposes package of measures to improve Google search services in UK

https://www.gov.uk/government/news/cma-proposes-package-of-measures-to-improve-google-search-serv...

1•robtherobber•10m ago•0 comments

Offosm: OSM that you can access while you're offline

https://github.com/altilunium/offosm

1•altilunium•14m ago•0 comments

The Internet's Latest Lie: Moltbook

https://startupfortune.com/the-internets-latest-lie-moltbook-has-no-autonomous-ai-agents-only-hum...

2•lu5t•15m ago•0 comments

Reimplementing Tor from Scratch for a Single-Hop Proxy

https://foxmoss.com/blog/kurrat/

2•foxmoss•15m ago•0 comments

A premature software standard has led to billions in losses

https://hugo0.com/blog/how-erc20-held-back-blockchain-payments-a-decade

1•montenegrohugo•15m ago•1 comments

Intro to Cstml (Or: XML Meets JSON)

https://docs.bablr.org/guides/cstml/

1•todsacerdoti•15m ago•0 comments

The Dependency Layer in Digital Sovereignty

https://nesbitt.io/2026/01/28/the-dependency-layer-in-digital-sovereignty.html

2•speckx•15m ago•0 comments

Valued at 800M after less than two years,Vega eyes the path blazed by Wiz

https://www.calcalistech.com/ctechnews/article/sjs00wc2ubx

1•myth_drannon•16m ago•0 comments

Hybrid pricing is the default now – here's the data

https://www.solvimon.com/blog/hybrid-pricing-is-the-default-now-heres-the-data

2•arnon•17m ago•0 comments

Why software stocks are getting pummeled

https://www.economist.com/business/2026/02/01/why-software-stocks-are-getting-pummelled

2•andsoitis•18m ago•0 comments

A creator's bill of rights for the AI era

https://smmall.cloud/blog/creators-bill-of-rights-for-the-ai-era

1•a_band•18m ago•0 comments

Ask HN: Why not have a subreddit where all voting is done by a single AI?

2•amichail•23m ago•4 comments

I'm a wanted criminal in Italy. Well, maybe. Probably not

https://82mhz.net/posts/2026/01/i-m-a-wanted-criminal-in-italy-well-maybe-probably-not/

1•speckx•24m ago•0 comments

Show HN: A free and open source SSH client for iOS

https://github.com/neon443/ShhShell

1•neon443•24m ago•0 comments