frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: A tool to benchmark LLM APIs (OpenAI, Claude, local/self-hosted)

https://llmapitest.com/
34•mrqjr•9h ago
I recently built a small open-source tool to benchmark different LLM API endpoints — including OpenAI, Claude, and self-hosted models (like llama.cpp).

It runs a configurable number of test requests and reports two key metrics: • First-token latency (ms): How long it takes for the first token to appear • Output speed (tokens/sec): Overall output fluency

Demo: https://llmapitest.com/ Code: https://github.com/qjr87/llm-api-test

The goal is to provide a simple, visual, and reproducible way to evaluate performance across different LLM providers, including the growing number of third-party “proxy” or “cheap LLM API” services.

It supports: • OpenAI-compatible APIs (official + proxies) • Claude (via Anthropic) • Local endpoints (custom/self-hosted)

You can also self-host it with docker-compose. Config is clean, adding a new provider only requires a simple plugin-style addition.

Would love feedback, PRs, or even test reports from APIs you’re using. Especially interested in how some lesser-known services compare.

Comments

mdhb•7h ago
In what universe is a post created by a new account with zero comments and a grand total of 2 votes over the course of 2 hours doing on the front page?
iRomain•6h ago
LLM
vntok•2h ago
It's an informative post about new tech, that fits pretty well here of all places.

Why would you want the author to write about something else to validate the post? That would be an appeal to authority, which is the complete opposite of what the Hacker Manifesto has always been about in terms of ethos, goals, etc.

bdangubic•2h ago
I am polishing up my blog about some FORTRAN code I wrote last week in hopes of the same :)
swyx•4h ago
idk what it is but buying that domain made it seem more commercial and therefore less trustworthy. also most people prob want to just use artificialanalysis' numbers rather than self run benchmarks (but this is ok if want to run your own)

I made my VM think it has a CPU fan

https://wbenny.github.io/2025/06/29/i-made-my-vm-think-it-has-a-cpu-fan.html
414•todsacerdoti•11h ago•96 comments

Bitcoin's Security Budget Issue: Problems, Solutions and Myths Debunked

https://budget.day/
51•marcbarbosa•3h ago•27 comments

Ask HN: What Are You Working On? (June 2025)

87•david927•5h ago•297 comments

The Book of Shaders

https://thebookofshaders.com/
22•max_•3d ago•2 comments

Cell Towers Can Double as Cheap Radar Systems for Ports and Harbors (2014)

https://spectrum.ieee.org/cell-tower-signals-can-improve-port-security
25•transpute•3h ago•18 comments

Revisiting Knuth's "Premature Optimization" Paper

https://probablydance.com/2025/06/19/revisiting-knuths-premature-optimization-paper/
63•signa11•3d ago•28 comments

Modelling API rate limits as diophantine inequalities

https://vivekn.dev/blog/rate-limit-diophantine
37•viveknathani_•2d ago•4 comments

Finding a former Australian prime minister’s passport number on Instagram (2020)

https://mango.pdf.zone/finding-former-australian-prime-minister-tony-abbotts-passport-number-on-instagram/
53•guiambros•3h ago•6 comments

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok

https://github.com/octelium/octelium
282•geoctl•13h ago•111 comments

Nearly 20% of cancer drugs defective in 4 African nations

https://www.dw.com/en/nearly-20-of-cancer-drugs-defective-in-4-african-nations/a-73062221
49•woldemariam•1h ago•12 comments

Error handling in Rust

https://felix-knorr.net/posts/2025-06-29-rust-error-handling.html
93•emschwartz•4h ago•75 comments

4-10x faster in-process pub/sub for Go

https://github.com/kelindar/event
102•kelindar•10h ago•26 comments

Bloom Filters by Example

https://llimllib.github.io/bloomfilter-tutorial/
183•ibobev•13h ago•28 comments

We accidentally solved robotics by watching 1M hours of YouTube

https://ksagar.bearblog.dev/vjepa/
48•alexcos•9h ago•47 comments

Many ransomware strains will abort if they detect a Russian keyboard installed (2021)

https://krebsonsecurity.com/2021/05/try-this-one-weird-trick-russian-hackers-hate/
191•air7•6h ago•118 comments

Using the Internet without IPv4 connectivity

https://jamesmcm.github.io/blog/no-ipv4/
253•jmillikin•16h ago•111 comments

The Medley Interlisp Project: Reviving a Historical Software System [pdf]

https://interlisp.org/documentation/young-ccece2025.pdf
78•pamoroso•10h ago•6 comments

China Dominates 44% of Visible Fishing Activity Worldwide

https://oceana.org/press-releases/china-dominates-44-of-visible-fishing-activity-worldwide/
65•scubakid•3h ago•32 comments

Commodore acquired for a 'low seven figure' price – CEO from retro community

https://www.tomshardware.com/video-games/retro-gaming/commodore-acquired-for-a-low-seven-figure-price-new-acting-ceo-comes-from-the-retro-community
27•amichail•2h ago•3 comments

Several Anker power bank models recalled: Supplier's batteries pose fire hazard

https://www.heise.de/en/news/Fire-hazard-Anker-recalls-several-power-bank-models-also-in-Germany-10463193.html
17•Eduard•1h ago•7 comments

Loss of key US satellite data could send hurricane forecasting back 'decades'

https://www.theguardian.com/us-news/2025/jun/28/noaa-cuts-hurricane-forecasting-climate
235•trauco•7h ago•108 comments

Show HN: Rust -> WASM, K-Means Color Quantization Crate for Image-to-Pixel-Art

https://github.com/gametorch/image_to_pixel_art_wasm
26•gametorch•3d ago•4 comments

Oldest boomerang doesn't come back

https://www.bbc.co.uk/news/articles/cren818q5x1o
5•ljf•3d ago•1 comments

The $25k car is going extinct?

https://media.hubspot.com/why-the-25000-car-is-going-extinct
39•pseudolus•9h ago•49 comments

The Unsustainability of Moore's Law

https://bzolang.blog/p/the-unsustainability-of-moores-law
125•shadyboi•18h ago•101 comments

Raymond Laflamme (1960-2025)

https://scottaaronson.blog/?p=8949
11•stmw•2d ago•2 comments

Reverse Engineering the Microchip CLB

http://mcp-clb.markomo.me/
19•_Microft•3h ago•3 comments

Show HN: A tool to benchmark LLM APIs (OpenAI, Claude, local/self-hosted)

https://llmapitest.com/
34•mrqjr•9h ago•5 comments

More on Apple's Trust-Eroding 'F1 the Movie' Wallet Ad

https://daringfireball.net/2025/06/more_on_apples_trust-eroding_f1_the_movie_wallet_ad
846•dotcoma•17h ago•530 comments

Anticheat Update Tracking

https://not-matthias.github.io/posts/anticheat-update-tracking/
5•not-matthias•4h ago•0 comments