frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

LMArena Is a Cancer on AI

https://surgehq.ai/blog/lmarena-is-a-plague-on-ai?r=greg
6•gk1•1d ago

Comments

halbgut•1d ago
Like any LLM benchmark, LMArena is highly flawed. I do think it has a right to exist. For me anecdotally it has been indicative of which LLMs style I like best, not necessarily its factual accuracy. It hasn't however been a very useful tool to find the best LLM for a given job.

To the article's point though, it's treated as the gold standard, which it isn't. We should have learned that with the sycophancy-gate.

I'm not sure if the methodology here really is sound for the question at hand. It's a bit like saying, oh prediction markets don't work because 40% of people that voted were wrong.

You can't really get around running your own benchmarks for the job at hand, if you really want to get 95th-percentile performance on a task.

What questions should smart people ask about videos of police/military killings?

https://www.facebook.com/DavidGLarson/posts/how-should-smart-people-interpret-policemilitary-acti...
2•QuantumGood•1m ago•1 comments

ICE kills woman in MN on Jan 7, 2025 [video]

https://old.reddit.com/r/law/comments/1q6o4d0/another_angle_of_ice_shooting_woman_in_mn_172025/
2•martythemaniak•1m ago•0 comments

Show HN: Corli – Major update focused on stability and new features

https://www.corli.app/
1•zipqt•3m ago•0 comments

Snoringpunch

https://snoringpunch.vercel.app/
2•M0HD197•4m ago•1 comments

A 200-year-old book distributor is closing

https://www.npr.org/2026/01/07/nx-s1-5668426/libraries-books-distributor-closing
2•andsoitis•6m ago•0 comments

Will AI-powered humanoid robots someday work alongside us? [60 Minutes] [video]

https://www.youtube.com/watch?v=CbHeh7qwils
1•indigodaddy•7m ago•0 comments

Show HN: The kissing number theorem predicts particle masses from sphere packing

https://colab.research.google.com/drive/1_zDIOONfs4WvnpG7GDEH6hzSM25Fsu93?usp=sharing
2•AlekseN•11m ago•1 comments

Pink Ranger–Dressed Hacker Takes Down White Supremacist Websites Live Onstage

https://gizmodo.com/hacker-dressed-as-the-pink-ranger-takes-down-white-supremacist-websites-live-...
2•mrzool•11m ago•1 comments

The Personal Panopticon

https://twitter.com/mollycantillon/status/2008918474006122936
1•delichon•12m ago•1 comments

Fresh Onion Directory – Whereis.it.com

https://whereis.it.com
1•TheServitor•13m ago•0 comments

ICE agent fatally shoots woman in Minneapolis

https://www.reuters.com/world/us/us-federal-agent-involved-minneapolis-shooting-during-immigratio...
9•mraniki•13m ago•2 comments

Show HN: PAlignPrims – C++ library for sequence alignment beyond bioinformatics

https://github.com/offbynull/palignprims
1•offbynull•15m ago•0 comments

Campaigns Are Knowledge Workers and the Tools Just Caught Up

https://matthodges.com/posts/2026-01-07-ai-agents-campaigns/
1•m-hodges•16m ago•0 comments

Lack of Sweet-Receptor Gene Accounts for Cats' Indifference Toward Sugar (2005)

https://web.archive.org/web/20060423082857/http://genetics.plosjournals.org/perlserv/?request=get...
1•bookofjoe•16m ago•0 comments

Show HN: An offline first, with state-in-URL, workout planning and tracking app

https://mateuszitelli.github.io/trainlink/#2nZfbbts4EIbfhdcuwPPBd0n2VKCLFk3vFoWg2GosrCNnJbntIsi77...
1•mzitelli•17m ago•0 comments

Tool UI: Component library for tool calls

https://www.tool-ui.com
1•petekp•18m ago•0 comments

Show HN: NewsMap – local news on a map (like Zillow but for news)

https://newsmap.me/
1•ajones05•18m ago•1 comments

Space Agency Confirms Breach – Hackers Claim 200 GB of Data Stolen

https://www.forbes.com/sites/daveywinder/2026/01/04/space-agency-confirms-breach---hackers-claim-...
1•vodou•19m ago•0 comments

Show HN: PostureGuard – Free posture monitoring using webcam

https://posture-guard-theta.vercel.app/
1•fanel•20m ago•0 comments

So you wanna de-bog yourself

https://www.experimental-history.com/p/so-you-wanna-de-bog-yourself
1•calvinfo•20m ago•0 comments

Show HN: Anyware – Remote Control for Claude Code

https://anyware.run/
1•igorzij•21m ago•0 comments

The application of AI tools to Erdos problems passes a milestone

https://mathstodon.xyz/@tao/115855840223258103
1•ColinWright•21m ago•0 comments

MIT 15.773 Hands-On Deep Learning Spring 2024 [video]

https://www.youtube.com/watch?v=kyQ0CRkYhy4
1•mdp2021•22m ago•0 comments

Water Heater Mines Bitcoin. It Could Help Solve AI's Energy Problem

https://www.cnet.com/home/energy-and-utilities/superheat-bitcoin-water-heater-ces-2026/
1•rmason•26m ago•0 comments

Tips to Read More This Coming Year

https://www.millersbookreview.com/p/10-tips-to-read-more-this-coming-year
2•ingve•28m ago•0 comments

ChatGPT is losing market share as Google Gemini gains ground

https://www.bleepingcomputer.com/news/artificial-intelligence/chatgpt-is-losing-market-share-as-g...
1•speckx•28m ago•0 comments

Study examines carbon footprint of wearable health tech

https://news.cornell.edu/stories/2026/01/study-examines-carbon-footprint-wearable-health-tech
1•JeanKage•29m ago•0 comments

Why sports stars who head the ball are more likely to die of Alzheimer's

https://www.bbc.com/future/article/20260106-the-health-dangers-of-heading-the-ball-in-sport
1•breve•30m ago•0 comments

Search your past ChatGPT, Claude and perplexity chats with context

https://github.com/siv-io/Index-AI-Chat-Search
2•siv_io_•30m ago•0 comments

Operation Absolute Resolve: How the US Captured Nicolas Maduro

https://www.dailymail.co.uk/news/article-15435381/Nicolas-Maduro-captured-reconstruction-Trump-Op...
1•febed•31m ago•0 comments