frontpage.

I made this simple tool to compare local LLMs. Any provider that supports OpenAI-like APIs can be used (LMStudio, Llama.cpp, Ollama) but you can also use Openrouter/OpenAI if you change the base URL accordingly.

In my opinion it is not particularly useful for comparing different models from different companies since some models are optimized heavily on math or even trained on AIME problems.

However it is really useful for testing different quantizations of the same model or the same quantization from different providers.

Let me know what you think about it!

Also check the README to see some examples of the results you will get from it.

The robot vacuums we've tested

Real TikTokers are pretending to be Veo 3 AI creations for fun, attention

OAuth 2.0 Flows

Codex Seraphinianus: The Weirdest Book

Ask HN: Simple open source to remove the bright background on a web page?

A Future of Software Development?

Ask HN: Why are dating apps so bad? Why hasn't anyone made a good one?

DreaMS: Self-Supervised Molecular Representations from Mass Spectra

Ask HN: Should teachers explain in what sense the Earth rotates around the Sun?

How to Agentically Deconstruct CC

A Glucose Monitor for Someone Without Diabetes: Optimal or Overkill?

The Custodial Stablecoin Rekt Test

Innovations in aging biology: highlights from the ARDD emerging science workshop

Black Death bacterium has become less lethal after genetic tweak

What if we stop treating security testing as a separate thing?

The Visual World of 'Samurai Jack'

Starship: Dead End?

AI Is Learning to Escape Human Control

Programming languages and Linux commands in Spanish

Psyclone Media

Show HN: Ramsey – A desktop pet that eats your files and gives you achievements

Show HN: Deep Research – Open-Source Customizable Reasoning Framework for Devs

The Novelist in the Age of AI

Show HN: Swiftor – AI Hacking Platform [cheap vms, 20 models, voice-chat, MCP]

The v0 composite model family

An AI bot might be asking the questions at your next job interview

Show HN: I made a Custom GPT to master business automation

Show HN: I built a fun way to learn why startups succeed/fail 1 guess at a time

HugstonOne 1.0.3 the New Version

Breaking the Sorting Barrier for Directed Single-Source Shortest Paths [pdf]

Show HN: Local LLM AIME benchmarking tool