frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Inference Arena: Compare LLM performance across hardware, engines, and platforms

https://dria.co/inference-arena
2•driaforall•2h ago

Comments

driaforall•2h ago
We’ve been frustrated by how scattered LLM benchmarking has become. Endless Reddit threads, conflicting posts, and inconsistent metrics across GPUs and inference engines.

So we built Inference Arena, an open benchmarking hub where you can:

- Discover and compare inference results for open models across vLLM, SGLang, Ollama, MLX, and LM Studio

- See performance trade-offs for quantized versions

- Analyze throughput, latency, and cost side by side across hardware setups

- Explore benchmark data interactively or access it programmatically via MCP (Model Context Protocol)

You can also use it in Agent Mode where an agent can search, analyze, and compare results (and even fetch new ones from the web or subreddits).

We’d love feedback from you on:

- Which metrics matter most for your workflows (TTFT, TPS, memory, cost?)

- Other engines or quantization methods you’d like to see

- How we can make the data more useful for real-world inference tuning

MCP url: https://mcp-api-production-44d1.up.railway.app/ GitHub: https://github.com/firstbatchxyz/inference-arena

Why Circular AI Deals Among OpenAI, Nvidia, AMD Are Raising Eyebrows

https://www.bloomberg.com/news/articles/2025-10-08/the-circular-openai-nvidia-and-amd-deals-raisi...
1•1vuio0pswjnm7•4m ago•0 comments

Portland is not burning. Here's live context and sourced fact checks

https://isportlandburning.com/
2•dangle1•5m ago•0 comments

WebSockets vs. HTTP: Stop Choosing the Wrong Protocol

https://medium.com/@shivangsharma6789/websockets-vs-http-stop-choosing-the-wrong-protocol-fd0e92b...
1•thunderbong•11m ago•1 comments

All in on MatMul? Don’t Put All Your Tensors in One Basket!

https://www.sigarch.org/dont-put-all-your-tensors-in-one-basket-hardware-lottery/
1•matt_d•11m ago•0 comments

Collectives: Nextcloud App for projects to organize together

https://github.com/nextcloud/collectives
1•vpt•11m ago•0 comments

The current status of 8K movies

https://www.flatpanelshd.com/news.php?subaction=showfull&id=1759823393
1•indigodaddy•12m ago•0 comments

New study sheds light on how exercise helps lose weight

https://medicalxpress.com/news/2025-09-weight.html
2•PaulHoule•13m ago•1 comments

14 years later, Siri is again the key to Apple's future

https://www.macworld.com/article/2935321/siri-is-key-to-apples-future.html
1•CharlesW•13m ago•0 comments

Human Error Is the Point: On Teaching College During the Rise of AI

https://therumpus.net/2025/10/02/human-error-is-the-point-on-teaching-college-during-the-rise-of-ai/
1•CharlesW•16m ago•0 comments

AI at Play – Lessons from a silly benchmark

https://andreasthinks.me/posts/ai-at-play/
1•fragmede•16m ago•0 comments

Joan Kennedy, Who Married into a Dynasty, Dies at 89

https://www.nytimes.com/2025/10/08/us/joan-kennedy-dead.html
1•whack•17m ago•0 comments

Trump says he 'took the freedom of speech away' on flag burning

https://www.usatoday.com/story/news/nation/2025/10/08/trump-flag-burning-first-amendment-portland...
10•saubeidl•26m ago•3 comments

CryptoBanc, BancCrypto, Cryptofficium Consortium

https://www.instagram.com/banccrypto/
1•CryptoBanc•26m ago•0 comments

Show HN: Identiqwe – Collect Deterministic pixel art avatars from any text

https://identiqwe.maxcomperatore.com/
1•maxcomperatore•30m ago•0 comments

Show HN: A new platform for devs, hackers, and crypto folks

https://hashmate.app
1•DeveloperOne•30m ago•1 comments

Show HN: We turned browser screen recordings into customizable AI agents

https://gabrieloperator.com
1•vipin-tanna•32m ago•0 comments

AI and Deep Learning Accelerators Beyond GPUs in 2025

https://www.bestgpusforai.com/blog/ai-accelerators
1•javaeeeee•33m ago•1 comments

Visibility Engine for AI Models

https://twitter.com/alexmdees/status/1975607272123605484
1•sert_121•40m ago•0 comments

Show HN: A better way to run Bazel in Docker

https://github.com/ouillie/bazel-docker
1•bloppe•45m ago•0 comments

Scientists develop first accurate blood test to detect chronic fatigue syndrome

https://www.theguardian.com/society/2025/oct/08/scientists-say-they-have-first-blood-test-to-diag...
2•ryangibb•47m ago•1 comments

The Orphan Tsunami of 1700 [pdf]

https://pubs.usgs.gov/pp/pp1707/pp1707.pdf
1•oliverkwebb•49m ago•0 comments

Show HN: I Made Strava for Habits

https://www.trackwme.com/
1•jvmeshan•53m ago•0 comments

Internet Archive Ordered to Block Books in Belgium

https://torrentfreak.com/internet-archive-ordered-to-block-books-in-belgium-after-talks-with-publ...
2•gslin•53m ago•0 comments

Exploring a Self-Hosted Community Edition of Athenic AI (BYO-LLM)

1•AthenicDataOps•53m ago•0 comments

Show HN: We built an open source dev tool for OpenAI Apps SDK

https://www.mcpjam.com/blog/apps-sdk
3•matt8p•54m ago•0 comments

Show HN: LLM-use – Routing, caching and A/B testing for LLMs

https://github.com/llm-use/llm-use
1•justvugg•55m ago•0 comments

Apple Made ICE Agents a Protected Class

https://migrantinsider.com/p/scoop-apple-quietly-made-ice-agents
9•lukev•55m ago•2 comments

AI's grip on the S&P is total and Morgan Stanley's top analyst lays out case

https://finance.yahoo.com/news/top-analyst-very-concerned-nvidia-211320554.html
1•zerosizedweasle•56m ago•2 comments

Why no-code always fails and how AI tools like Frog just might change that

https://frogi.cc
1•reieicucv•56m ago•1 comments

Neutral

https://blog.webb.page/2025-10-08-neutral.txt
1•NetOpWibby•56m ago•0 comments