frontpage.

Hi HN,

I'm a software engineer working on C++/python/robotics by day, dabbing into web apps by night. I built The Frontier (https://the-frontier.app) because the LLM market is moving so fast it's hard to tell if you're overpaying for performance.

Pricing is easy to find, but it's hard to tell if you're missing a similarly priced or even cheaper model with better performance. So I built a visualization that maps LM Arena’s Elo scores against OpenRouter’s pricing.

The main thing it does is calculate the Pareto frontier. It highlights the optimal models at each price point, so you can easily spot when a model is technically a "bad deal" compared to its peers.

The hard part: The real headache wasn't the UI, it was the messy data. LMArena names models one way (e.g. "qwen3-coder-480b-a35b-instruct"), OpenRouter another ("qwen/qwen3-coder"), and you have to deal with a mess of variants like "thinking", "instruct", "fast", or "v1.0" vs "v1". I ended up building an automated scoring system to match these models automatically so the chart stays clean without manual mapping.

I'm pretty happy with the result, I find myself surfing the frontier (literally), going up and down the frontier to find the best model for my use case and budget.

The Tech: - React + Vite - ECharts for the visualization - A daily sync to keep the chart up-to-date with new releases

I also just added Latency and Throughput metrics because sometimes latency or throughput is just as important as intelligence.

I’d love to hear what you think, especially if you spot any weird model matches (Unfortunately they still happen) or have ideas of what to add next ! I have a few ideas, like combining latency and throughput into one, or even intelligence, latency and throughput, I'll call it Wisdom :)

URL: https://the-frontier.app/

Thanks!

People with obesity 70% more likely to be hospitalised by or die from infection

Mathematicians disagree on the essential structure of the complex numbers

Safe Chrome extension that can auto summarize articles

Show HN: Verifly – Email verification API at $5/10k (vs $75 for competitors)

Cadence ChipStack AI Super Agent Demo Overview [video]

Should Memory and Learning layer be built in-house?

Show HN: Open sourcing our ERP (Sold $500k contracts, 7k stars)

Large tech companies don't need heroes

National Lab of the Rockies, formerly NREL, lays off more than 130 employees

Show HN: A design collaboration layer for local LLM CLIs

The Case for Scaling Venture

Show HN: I built a visual node system for CI/CD that supports GitHub Actions

Superlinear Returns (2023)

Surf's Up in Slop City

Digital Iris [video]

Frost Bros, Rope Makers and Yarn Spinners

Ask HN: Are automated tests (Selenium) still relevant today?

IP Address Space for Outer Space

122

Novo Nordisk sues Hims after $49 weight-loss pill sparks FDA backlash

Brain train game may help protect against dementia for up to 20 years

Autonomous AI Agents for Business Automation

Tri leverages Toyota factories for robotics learning

Dynamic type systems are not inherently more open

Show HN: ClawKit｜Open-source toolkit to configure and debug OpenClaw AI agents

Notes to myself: 65 principles distilled from 10k posts

The Megaprocessor Laughs at Your Puny Integrated Circuits (2016)

Most Watched Java Conference Talks of 2025

The Blurry Boundaries Between Programming and Direct Use

Zen HN

Show HN: The Frontier, Tracking the LLM Pareto Frontier