frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Tokenflood – simulate arbitrary loads on instruction-tuned LLMs

https://github.com/twerkmeister/tokenflood
10•twerkmeister•6d ago
Hi everyone, I just released an open source load testing tool for LLMs:

https://github.com/twerkmeister/tokenflood

=== What is it and what problems does it solve? ===

Tokenflood is a load testing tool for instruction-tuned LLMs hat can simulate arbitrary LLM loads in terms of prompt, prefix, and output lengths and requests per second. Instead of first collecting prompt data for different load types, you can configure the desired parameters for your load test and you are good to go. It also let's you assess the latency effects of potential prompt parameter changes before spending the time and effort to implement them.

I believe it's really useful for developing latency sensitive LLM applications and * load testing self-hosted LLM model setups * Assessing the latency benefit of changes to prompt parameters before implementing those changes * Assessing latency and intraday variation of latency on hosted LLM services before sending your traffic there

=== Why did I built it? ===

Over the course of the past year, part of my work has been helping my clients to meet their latency, throughput and cost targets for LLMs (PTUs, anyone? ). That process involved making numerous choices about cloud providers, hardware, inference software, models, configurations and prompt changes. During that time I found myself doing similar tests over and over with a collection of adhoc scripts. I finally had some time on my hands and wanted to properly put it together in one tool.

=== What am I looking for? ===

I am sharing this for three reasons: Hoping this can make other's work for latency-sensitive LLM applications simpler, learning and improving from feedback, and finding new projects to work on.

So please check it out on github (https://github.com/twerkmeister/tokenflood), comment, and reach out at thomas@werkmeister.me or on linkedin(https://www.linkedin.com/in/twerkmeister/) for professional inquiries.

=== Pics ===

image of cli interface: https://github.com/twerkmeister/tokenflood/blob/main/images/...

result image: https://github.com/twerkmeister/tokenflood/blob/main/images/...

Gemini 3

https://blog.google/products/gemini/gemini-3/
852•preek•6h ago•596 comments

GitHub: Git operation failures

https://www.githubstatus.com/incidents/5q7nmlxz30sk
224•wilhelmklopp•58m ago•180 comments

Google Antigravity

https://antigravity.google/
493•Fysi•5h ago•592 comments

I am stepping down as the CEO of Mastodon

https://blog.joinmastodon.org/2025/11/my-next-chapter-with-mastodon/
149•Tomte•3h ago•54 comments

Pebble, Rebble, and a path forward

https://ericmigi.com/blog/pebble-rebble-and-a-path-forward/
225•phoronixrly•4h ago•91 comments

Bild AI (YC W25) Is Hiring: Make Housing Affordable

https://www.ycombinator.com/companies/bild-ai/jobs/m2ilR5L-founding-engineer-applied-ai
1•rooppal•9m ago

Gemini 3 Pro Model Card [pdf]

https://storage.googleapis.com/deepmind-media/Model-Cards/Gemini-3-Pro-Model-Card.pdf
72•virgildotcodes•10h ago•295 comments

OrthoRoute – GPU-accelerated autorouting for KiCad

https://bbenchoff.github.io/pages/OrthoRoute.html
40•wanderingjew•2h ago•6 comments

Cloudflare Global Network experiencing issues

https://www.cloudflarestatus.com/incidents/8gmgl950y3h7
2235•imdsm•10h ago•1419 comments

The code and open-source tools I used to produce a science fiction anthology

https://compellingsciencefiction.com/posts/the-code-and-open-source-tools-i-used-to-produce-a-sci...
10•mojoe•5h ago•1 comments

Oracle is underwater on its 'astonishing' $300B OpenAI deal

https://www.ft.com/content/064bbca0-1cb2-45ab-85f4-25fdfc318d89
79•busymom0•1h ago•27 comments

Trying out Gemini 3 Pro with audio transcription and a new pelican benchmark

https://simonwillison.net/2025/Nov/18/gemini-3/
67•nabla9•2h ago•25 comments

Solving a million-step LLM task with zero errors

https://arxiv.org/abs/2511.09030
92•Anon84•5h ago•36 comments

How Quake.exe got its TCP/IP stack

https://fabiensanglard.net/quake_chunnel/index.html
427•billiob•13h ago•108 comments

Chuck Moore: Colorforth has stopped working [video]

https://www.youtube.com/watch?v=MvkGBWXb2oQ#t=22
23•netten•1d ago•3 comments

Show HN: RowboatX – open-source Claude Code for everyday automations

https://github.com/rowboatlabs/rowboat
28•segmenta•2h ago•4 comments

Show HN: Guts – convert Golang types to TypeScript

https://github.com/coder/guts
59•emyrk•3h ago•16 comments

Mysterious holes in the Andes may have been an ancient marketplace

https://www.sydney.edu.au/news-opinion/news/2025/11/10/mysterious-holes-in-the-andes-may-have-bee...
9•gmays•6d ago•0 comments

Strix Halo's Memory Subsystem: Tackling iGPU Challenges

https://chipsandcheese.com/p/strix-halos-memory-subsystem-tackling
50•PaulHoule•4h ago•22 comments

Short Little Difficult Books

https://countercraft.substack.com/p/short-little-difficult-books
124•crescit_eundo•7h ago•76 comments

When 1+1+1 Equals 1

https://mathenchant.wordpress.com/2024/12/19/when-111-equals-1/
22•surprisetalk•5d ago•10 comments

Nearly all UK drivers say headlights are too bright

https://www.bbc.com/news/articles/c1j8ewy1p86o
582•YeGoblynQueenne•7h ago•593 comments

A 'small' vanilla Kubernetes install on NixOS

https://stephank.nl/p/2025-11-17-a-small-vanilla-kubernetes-install-on-nixos.html
11•todsacerdoti•10h ago•3 comments

Google boss says AI investment boom has 'elements of irrationality'

https://www.bbc.com/news/articles/cwy7vrd8k4eo
86•jillesvangurp•15h ago•180 comments

Experiment: Making TypeScript immutable-by-default

https://evanhahn.com/typescript-immutability-experiment/
80•ingve•7h ago•68 comments

The Miracle of Wörgl

https://scf.green/story-of-worgl-and-others/
120•simonebrunozzi•10h ago•64 comments

Show HN: Tokenflood – simulate arbitrary loads on instruction-tuned LLMs

https://github.com/twerkmeister/tokenflood
10•twerkmeister•6d ago•0 comments

Mathematics and Computation (2019) [pdf]

https://www.math.ias.edu/files/Book-online-Aug0619.pdf
60•nill0•9h ago•13 comments

Court settlement calls for NPR to get $36M to operate US public radio system

https://apnews.com/article/trump-npr-lawsuit-2cc4abfa8cf00fe6f89e387e63eb4a2a
81•geox•3h ago•43 comments

A day at Hetzner Online in the Falkenstein data center

https://www.igorslab.de/en/a-day-at-hetzner-online-in-the-falkenstein-data-center-insights-into-s...
149•speckx•5h ago•61 comments