frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

LLM Inference Handbook

https://bentoml.com/llm/
128•djhu9•9h ago

Comments

sherlockxu•4h ago
Hi everyone. I'm one of the maintainers of this project. We're both excited and humbled to see it on Hacker News!

We created this handbook to make LLM inference concepts more accessible, especially for developers building real-world LLM applications. The goal is to pull together scattered knowledge into something clear, practical, and easy to build on.

We’re continuing to improve it, so feedback is very welcome!

GitHub repo: https://github.com/bentoml/llm-inference-in-production

armcat•2h ago
Amazing work on this, beautifully put together and very useful!
aligundogdu•3h ago
It's a really beautiful project, and I’d like to ask something purely out of curiosity and with the best intentions. What’s the name of the design trend you used for your website? I really loved the website too.
holografix•15m ago
Very good reference thanks for collating this!

Bill Atkinson's Psychedelic User Interface

https://patternproject.substack.com/p/from-the-mac-to-the-mystical-bill
22•cainxinth•1h ago•0 comments

OpenFront: Realtime Risk-like multiplayer game in the browser

https://openfront.io/
90•thombles•5h ago•29 comments

FP8 is ~100 tflops faster when the kernel name has "cutlass" in it

https://twitter.com/cis_female/status/1943069934332055912
79•limoce•1h ago•28 comments

Apple vs the Law

https://formularsumo.co.uk/blog/2025/apple-vs-the-law/
241•tempodox•5h ago•206 comments

Show HN: Pangolin – Open source alternative to Cloudflare Tunnels

https://github.com/fosrl/pangolin
296•miloschwartz•14h ago•61 comments

Postgres LISTEN/NOTIFY does not scale

https://www.recall.ai/blog/postgres-listen-notify-does-not-scale
468•davidgu•3d ago•200 comments

The day someone created 184 billion Bitcoin (2020)

https://decrypt.co/39750/184-billion-bitcoin-anonymous-creator
21•lawrenceyan•7h ago•17 comments

LLM Inference Handbook

https://bentoml.com/llm/
130•djhu9•9h ago•4 comments

Batch Mode in the Gemini API: Process More for Less

https://developers.googleblog.com/en/scale-your-ai-workloads-batch-mode-gemini-api/
115•xnx•3d ago•42 comments

The ChompSaw: A Benchtop Power Tool That's Safe for Kids to Use

https://www.core77.com/posts/137602/The-ChompSaw-A-Benchtop-Power-Tool-Thats-Safe-for-Kids-to-Use
198•surprisetalk•3d ago•117 comments

At Least 13 People Died by Suicide Amid U.K. Post Office Scandal, Report Says

https://www.nytimes.com/2025/07/10/world/europe/uk-post-office-scandal-report.html
6•xbryanx•8m ago•1 comments

At Amazon's Biggest Data Center, Everything Is Supersized for A.I

https://www.nytimes.com/2025/06/24/technology/amazon-ai-data-centers.html
31•pseudolus•2h ago•14 comments

Show HN: Interactive pinout for the Raspberry Pi Pico 2

https://pico2.pinout.xyz
65•gadgetoid•3d ago•9 comments

What is Realtalk’s relationship to AI? (2024)

https://dynamicland.org/2024/FAQ/#What_is_Realtalks_relationship_to_AI
261•prathyvsh•20h ago•84 comments

Flix – A powerful effect-oriented programming language

https://flix.dev/
289•freilanzer•22h ago•144 comments

Series of posts on HTTP status codes (2018)

https://evertpot.com/http/
52•antonalekseev•2d ago•7 comments

Show HN: Cactus – Ollama for Smartphones

https://github.com/cactus-compute/cactus
173•HenryNdubuaku•16h ago•67 comments

FOKS: Federated Open Key Service

https://foks.pub/
247•ubj•23h ago•54 comments

Underwater turbine spinning for 6 years off Scotland's coast is a breakthrough

https://apnews.com/article/tidal-energy-turbine-marine-meygen-scotland-ffff3a7082205b33b612a1417e1ec6d6
203•djoldman•21h ago•176 comments

Graphical Linear Algebra

https://graphicallinearalgebra.net/
261•hyperbrainer•20h ago•20 comments

Btrfs Allocator Hints

https://lwn.net/ml/all/cover.1747070147.git.anand.jain@oracle.com/
19•forza_user•2d ago•6 comments

The Wet History of Media in the Bathroom

https://thereader.mitpress.mit.edu/the-wet-history-of-media-in-the-bathroom/
12•zdw•3d ago•3 comments

Red Hat Technical Writing Style Guide

https://stylepedia.net/style/
226•jumpocelot•21h ago•110 comments

Operational Apple-1 Computer for sale [video]

https://www.youtube.com/watch?v=XdBKuBhdZwg
57•guiambros•2d ago•23 comments

Show HN: I built a playground to showcase what Flux Kontext is good at

https://fluxkontextlab.com
66•Zephyrion•1d ago•15 comments

Show HN: Open source alternative to Perplexity Comet

https://www.browseros.com/
241•felarof•18h ago•92 comments

Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)"

https://simonwillison.net/2025/Jul/11/grok-musk/
431•simonw•11h ago•294 comments

Orwell Diaries 1938-1942

https://orwelldiaries.wordpress.com/page/2/
120•bookofjoe•18h ago•69 comments

Analyzing database trends through 1.8M Hacker News headlines

https://camelai.com/blog/hn-database-hype/
159•vercantez•3d ago•80 comments

Diffsitter – A Tree-sitter based AST difftool to get meaningful semantic diffs

https://github.com/afnanenayet/diffsitter
133•mihau•23h ago•34 comments