frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLM Simulation – Experience TTFT and tokens/SEC before investing

https://llmsimulation.ht-x.com
1•hertzdog•33m ago
I built a small tool to simulate the user experience of LLM response speeds, focusing on TTFT (time to first token) and tokens/second.

Instead of reading benchmark numbers, you can feel how fast or slow different configurations are, by adjusting TTFT, token generation rate, and output length. It streams tokens exactly as an LLM would, but without generating real content.

I was wondering which Apple should I buy and then I did it in the weekend, to better feel what does it mean to run locally a model.

The project/toy is public on github too: https://github.com/htxsrl/localllmsimulation

Thanks to the sources (cited) for the real benchmarks that allowed me to set up a small ML model to fit even futuristic hardware (like an imaginary M9 with 2048 Gb RAM and 3000Gb/s bandwidth).

Comments

ndgold•13m ago
Lovely, yoinked.

Also, I’m seeing check marks next to all quants which confused me a little bit when trying to select.

hertzdog•11m ago
Thanks! I added the check marks because when I was testing different quantizations, I often picked a model and only afterwards discovered that it couldn’t even load — it just didn’t fit into RAM or VRAM.

So the check mark simply indicates that the model can actually run under those constraints (fits in memory), not that it’s selected.

Make Money Not War: Trump's Real Plan for Peace in Ukraine

https://www.wsj.com/world/russia/russia-u-s-peace-business-ties-4db9b290
1•richardatlarge•14s ago•0 comments

RingChime – 65,000 restored phone ringtones from 170 brands

https://lockchime.com/ringchime/
1•gogyjay•1m ago•1 comments

Firesign Theatre: greatest satirists of 20th century techno-romanticism

https://magazine.mindplex.ai/post/firesign-theatre-the-greatest-satirist-of-20th-century-media-cu...
1•MilnerRoute•2m ago•0 comments

In Northern Scotland, the Neolithic Age Never Ended

https://www.newyorker.com/magazine/2025/12/01/in-northern-scotland-the-neolithic-age-never-ended
1•samizdis•3m ago•0 comments

Student Perceptions of AI Coding Assistants in Learning

https://arxiv.org/abs/2507.22900
2•victorbuilds•3m ago•0 comments

Design-a-Protein.com

https://design-a-protein.com
1•tjala•4m ago•0 comments

Ask HN: Is there a HN but more business/startup oriented?

1•vasilzhigilei•9m ago•0 comments

Self-driving cars will transform urban economies

https://www.economist.com/finance-and-economics/2025/11/27/self-driving-cars-will-transform-urban...
1•adidoit•10m ago•0 comments

Show HN: Web Checker – Browser extension for cycling through website lists

https://chromewebstore.google.com/detail/web-checker/cbcnciigmdlengjcbieeolembcagmoba
1•NickeaTea•10m ago•0 comments

Finding Flowers in Chaos

https://pollrobots.com/blog/2025-11-28-finding-flowers/
1•pacaro•11m ago•0 comments

MetaFun: Compile Haskell-like code to C++ template metaprograms

https://gergo.erdi.hu/projects/metafun/
2•Philpax•12m ago•0 comments

An approach to playing backing chords for Irish Traditional Music

https://irishchords.com/
2•upamj•12m ago•0 comments

Teleprompt Overlay

1•edihasaj•14m ago•0 comments

Create Repeatable Success

https://www.frankblecha.com/blog/create-repeatable-success.md/
1•sweetgiorni•14m ago•0 comments

The Forgotten Roman Ruins of the ‘Pompeii of the Middle East’

https://news.artnet.com/art-world/huge-jerash-jordan-pompeii-middle-easy-2708480
1•pseudolus•15m ago•0 comments

Ukraine hits two Russian 'shadow fleet' tankers with drones

https://www.reuters.com/business/aerospace-defense/ukraine-hit-two-shadow-fleet-tankers-with-dron...
1•geox•17m ago•0 comments

Heavy metal, a new tune for Taiwan diplomacy

https://www.taipeitimes.com/News/taiwan/archives/2025/11/30/2003848074
1•giuliomagnifico•19m ago•0 comments

Zero Knowlege Proof of Compositeness

https://www.johndcook.com/blog/2025/11/29/zkp-composite/
3•ColinWright•24m ago•0 comments

Show HN: Push local LLMs to max speed without overheating

https://github.com/laithrw/llm-threader
1•nate_rw•24m ago•0 comments

Size Matters

https://matklad.github.io/2025/11/28/size-matters.html
1•ibobev•26m ago•0 comments

Duplication Isn't Always an Anti-Pattern

https://medium.com/@HobokenDays/rethinking-duplication-c1f85f1c0102
2•HideInNews•26m ago•0 comments

MUM-Based Hash Functions

https://vnmakarov.github.io/performance/optimization/2025/11/25/mum-based-hash-functions.html
1•ibobev•26m ago•0 comments

Compiled ZX Spectrum Basic and Z88DK Added to Online Retro IDE

https://retrogamecoders.com/zx-spectrum-basic-z88dk/
2•ibobev•29m ago•0 comments

Show HN: Chess on a Donut/Torus and Deep-Dive

https://mchess.io/donut
1•mannymakes•30m ago•0 comments

Show HN: LLM Simulation – Experience TTFT and tokens/SEC before investing

https://llmsimulation.ht-x.com
1•hertzdog•33m ago•2 comments

Ask HN: Recreate Ghost of Tsushima's tales animation?

1•shlip•33m ago•0 comments

Ask HN: Is it possible to get 1000 users in 10 days?

2•Mikecraft•33m ago•0 comments

OCaml maintainers reject massive AI-generated pull request

https://devclass.com/2025/11/27/ocaml-maintainers-reject-massive-ai-generated-pull-request/
3•Qem•35m ago•0 comments

Japan Unveils Human Washing Machine, Now You Can Get Washed Like Laundry

https://www.ndtv.com/offbeat/japan-launches-human-washing-machine-for-public-use-after-expo-succe...
2•Terretta•42m ago•0 comments

I built a powerful tool for YouTube Creators

https://commentscope.co/
2•sanky369•47m ago•0 comments