frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: State of the Art of Coding Models, According to Hacker News Commenters

https://hnup.date/hn-sota
15•yunusabd•1h ago
Hello HN,

I was away from my computer for two weeks, and after coming back and reading the latest discussions on HN about coding assistants (models, harnesses), I felt very out of the loop. My normal process would have been to keep reading and figure out the latest and greatest from people's comments, but I wanted to try and automate this process.

Basically the goal is to get a quick overview over which coding models are popular on HN. A next iteration could also scan for harnesses that people use, or info on self-hosting or hardware setups.

I wrote a short intro on the page about the pipeline that collects and analyzes the data, but feel free to ask for more details or check the Google Sheet for more info.

https://hnup.date/hn-sota

Comments

jdw64•1h ago
Interpreting these metrics is quite interesting.

One thing for sure is that while Claude is currently taking the #1 spot in mentions, it carries a lot of negative sentiment due to API pricing policies and frequent server downtime. On the other hand, the runner-up, GPT-5.5, actually seems to have more positive feedback.

Personally, my experience with Codex wasn't as good as with Claude Code (Codex freezes on Windows more often than you'd expect), so this is a bit surprising. That said, the more defensive GPT is definitely better in terms of sheer code-writing capability. However, GPT actually has quite a few issues with text corruption when generating in Korean or Chinese—something English-speaking users probably don't notice. In terms of model capabilities, when given the same agent.md (CLAUDE.md) file, I think GPT is better at writing code, while Claude is better at writing text during code reviews.

Looking at the bottom right, Qwen and DeepSeek are open-source, so they are largely mentioned in the context of guarding against vendor lock-in, which drives positive sentiment. Considering that Hacker News occasionally shows negative sentiment toward China, the fact that they are viewed this positively—unlike US models—shows that being open-source is a massive advantage in itself.

Anyway, one thing for sure is that Gemini is pretty much unusable.

Jabbles•57m ago
Please fix your graph so the names of the models are readable
marcuskaz•39m ago
Also, the stacked graph only allows you to quickly see total mentions, really hard to compare negative or positive sentiment across models at a glance.
yakkomajuri•57m ago
"Prompts an LLM" -> which LLM?

I saw you're using Gemini for the sentiment rating (which I guess you picked because it's not often mentioned and thus "neutral"? lol)

But would be interesting to get more details overall

ranger_danger•37m ago
Just FYI this article seems to define "start of the art" as "popular", as measured by "total mentions and user sentiment", without any bearing on the technical abilities or actual usage of the model.
mellosouls•25m ago
That's pretty much exactly what the title says.

The technical abilities and usage are derived from the commenters usage reflections.

How the Legal Opium Market Shaped Global Trade–and Led to an Opioid Crisis

https://www.bu.edu/articles/2026/how-the-legal-opium-market-led-to-an-opioid-crisis/
1•hhs•24s ago•0 comments

Former head of 'Pentagon's think tank' joins Anthropic

https://www.defenseone.com/technology/2026/05/former-head-pentagons-think-tank-joins-anthropic/41...
1•Jimmc414•2m ago•0 comments

Tesla owner won $10k in court for Tesla's FSD lies. Tesla is still fighting him

https://electrek.co/2026/05/02/this-tesla-owner-won-10k-in-court-for-teslas-fsd-lies-tesla-is-sti...
1•breve•4m ago•0 comments

Show HN: Language app with spaced repetition and comprehensible input

1•ChadNauseam•4m ago•0 comments

The Claude Delusion: Richard Dawkins believes his AI chatbot is conscious

https://www.dailygrail.com/2026/05/the-claude-delusion-richard-dawkins-believes-his-female-ai-cha...
1•SwellJoe•5m ago•0 comments

Google Summer of Code 2026 selected projects

https://blog.rust-lang.org/2026/04/30/gsoc-2026-selected-projects/
1•kazu11max17•7m ago•0 comments

AI agents are briefly overhyped

https://stevekrouse.com/agent-hype
1•stevekrouse•17m ago•0 comments

To Make Orchestras More Diverse, End Blind Auditions

https://www.nytimes.com/2020/07/16/arts/music/blind-auditions-orchestras-race.html
1•bilsbie•19m ago•0 comments

Meta faces New Mexico trial that could force change to Facebook, other platforms

https://www.reuters.com/legal/government/meta-faces-new-mexico-trial-that-could-force-changes-fac...
3•1659447091•28m ago•0 comments

The Race Is on to Find the Treasure Buried in San Francisco

https://www.nytimes.com/2026/05/02/us/san-francisco-buried-treasure-chest.html
1•mistersquid•31m ago•0 comments

AWS Lightsail's $0.09/GB Bandwidth Overage Is a Trap for Small Projects

https://galaxycloudsolutions.com/blog/aws-lightsail-vs-galaxy-cloud-solutions/
2•rougereaper420•32m ago•0 comments

With $1 Cyberattacks on the Rise, Durable Defenses Pay Off

https://spectrum.ieee.org/ai-cyberattacks-memory-safe-code
1•rbanffy•40m ago•0 comments

Coatue has a plan to buy up land for data centers, possibly for Anthropic

https://techcrunch.com/2026/05/01/coatue-has-a-plan-to-buy-up-land-for-data-centers-possibly-for-...
1•Brajeshwar•40m ago•0 comments

The Computer Programme Episode 1, 1982 [video]

https://archive.org/details/the_computer_programme_ep01
2•petethomas•41m ago•0 comments

Voice-AI-for-Beginners – A curated learning path for developers

https://github.com/mahimairaja/voiceai
2•mahimai•46m ago•0 comments

Restorative Yoga and the Biology of Belonging

https://parrik.com/puzzles/the-partition-problem/
1•parrik•46m ago•0 comments

Facepunch launches s&box, the highly anticipated successor to Garry's Mod

https://www.gamingonlinux.com/2026/04/facepunch-launches-s-box-the-highly-anticipated-successor-t...
5•embedding-shape•48m ago•1 comments

Dynamic Traefik configuration with multiple Docker hosts

https://blog.vasi.li/automating-mantrae-traefik-management-with-mantrae-agent/
2•vsviridov•49m ago•0 comments

Grinta – Local-first coding agent, 7 months solo, open source today

https://github.com/josephsenior/Grinta-Coding-Agent
1•YoussefMejdi•50m ago•1 comments

Trump's border wall expansion just bulldozed an ancient tribal site

https://www.washingtonpost.com/climate-environment/2026/04/30/border-wall-damage-indigenous-arizona/
5•gnabgib•52m ago•0 comments

What Is GStack? Gary Tan's Open-Source Startup Framework for Claude Code

https://www.mindstudio.ai/blog/what-is-gstack-gary-tan-claude-code-framework
2•evo_9•55m ago•0 comments

The physics slop that YouTube wants me to make [video]

https://www.youtube.com/watch?v=Cd5EHfRerGI
2•surprisetalk•58m ago•0 comments

Built this for my civil engineering firm's static site on Cloudflare Pages

https://github.com/bwengr/knowledge-base-spec
1•bwengr•58m ago•0 comments

How to run a cross-cutting campaign

https://parrik.com/puzzles/the-campaign-cascade/
1•parrik•1h ago•0 comments

NovAST

https://github.com/sharkkyyy10/NovAST
3•sharkkyyy10•1h ago•0 comments

The Apprehension Engine (2022)

https://guitar.com/features/interviews/the-apprehension-engine-most-terrifying-musical-instrument/
1•turtleyacht•1h ago•1 comments

A self was never flat

https://parrik.com/puzzles/know-thyself/
1•parrik•1h ago•0 comments

Martian Glaciers with Drones

https://nautil.us/uncovering-hidden-martian-glaciers-with-drones-1280400
1•Brajeshwar•1h ago•0 comments

talkie-coder: From 1930 to SWE-bench

https://github.com/RicardoDominguez/talkie-coder
2•Philpax•1h ago•0 comments

Clojurists Together – Q2 2026 Open Source Funding Announcement

https://www.clojuriststogether.org/news/q2-2026-funding-announcement/
9•dragandj•1h ago•1 comments