news newest ask show jobs

Open Source @Github

fp.

Open in hackernews

Claude Sonnet 5 – benchmark results

https://artificialanalysis.ai/models/claude-sonnet-5

16•lucamark•55m ago

Comments

iLoveOncall•12m ago

Half of the data is missing and the rest is inconsistent between different graphs and sections. Is the benchmark having Sonnet 5 generate the page and seeing how many hallucinations it has?

Tiberium•11m ago

Seems like the model is incredibly inefficient at max reasoning, and even at high/xhigh it uses far more tokens than other models, including Gemini 3.5 Flash, GLM 5.2 and so on. GPT 5.5's efficiency in tokens is still unmatched.

See also: https://cursor.com/cursorbench

trentor•3m ago

Same with opus nothing above medium has a reasonable improvement for the tokens spent.

atemerev•2m ago

Yet another mediocre model. Mostly irrelevant among open weights alternatives. Fable wen.

datakan•1m ago

I'm so sick of Anthropics usage caps and how their model devours tokens.

The Singham Ground Game

https://www.btcpolicy.org/articles/foreign-influence-campaign-against-american-ai-part-ii-singham...

1•takoid•45s ago•0 comments

Claude Code Just Got 5x More Expensive

https://www.vincentschmalbach.com/claude-code-quietly-looks-5x-more-expensive/

1•vincent_s•45s ago•0 comments

From the Epstein Files to Inside the Manosphere

https://pocketproject.org/event/from-the-epstein-files-to-inside-the-manosphere-tending-to-fractu...

1•rendx•1m ago•0 comments

Local Reasoning for Global Properties

https://tratt.net/laurie/blog/2026/local_reasoning_for_global_properties.html

1•mpweiher•3m ago•0 comments

Underappreciated Builtin: Grand Unified Debugger

https://tusharhero.codeberg.page/underappreciated-builtin-gud.html

2•mpweiher•4m ago•0 comments

FDA allows ZYN to sell pouches on health benefits

https://www.axios.com/2026/06/30/fda-zyn-health-marketing

1•cactusplant7374•7m ago•0 comments

Show HN: Mathematical fault-Isolation accelerator engine

https://github.com/PJHkorea/egregore-flat-kernel/blob/main/pure_geodesic_core_test.py

1•PJHkorea•7m ago•1 comments

Where we are with climate change

https://everysingleindicator.com/blog/global-co2-contributors

2•mayosmith•8m ago•1 comments

The fight over stablecoin float interest

https://research.oguzbilgic.com/theses/narrow-bank-breaks

1•obilgic•9m ago•0 comments

Tickler sues FBI to get to bottom of feet

https://taonexus.com/publicfiles/jun2026/tickler-sues-fbi-to-get-to-bottom-of-feet/

1•logicallee•9m ago•0 comments

Most MCP servers don't need to exist. Your case might be an exception

https://evilmartians.com/chronicles/most-mcp-servers-dont-need-to-exist-your-case-might-be-an-exc...

1•nikolay•10m ago•0 comments

Europe's Great AC Debate

https://nymag.com/intelligencer/article/europes-great-ac-debate-highlights-hot-takes-and-hysteria...

1•standardUser•10m ago•0 comments

Reddit Age Verification in Europe

https://support.reddithelp.com/hc/en-us/articles/50368431806484-European-Union-Digital-Services-A...

2•akyuu•11m ago•0 comments

Gitpulse – AI-generated conventional commits from your terminal

https://github.com/erico964-blip/gitpulse

2•erico964-blip•11m ago•0 comments

Show HN: Morph Reflexes – Multi-head classifiers for agent traces

1•bhaktatejas922•12m ago•0 comments

Mir Books – Books from the Soviet Era

https://mirtitles.org

2•clmul•13m ago•0 comments

What I Learned at Boo.com (2000)

https://tnl.net/blog/2000/08/01/business-2-0-what-i-learned-at-boo-com/

1•aleda145•13m ago•0 comments

Too many satellites? Earth's orbit is on track for a catastrophe

https://theconversation.com/too-many-satellites-earths-orbit-is-on-track-for-a-catastrophe-but-we...

1•ColinWright•15m ago•0 comments

I ported Kubernetes to the browser

https://ngrok.com/blog/i-ported-kubernetes-to-the-browser

16•peterdemin•16m ago•2 comments

Show HN: Openleetcode – LeetCode runner where tests live in the repo

https://github.com/therepanic/openleetcode/releases/tag/v1.0.0

2•therepanic•17m ago•0 comments

What you need to know about new customs charges when buying from outside the EU

https://www.rte.ie/news/ireland/2026/0630/1581002-custom-charges-q-a/

1•austinallegro•18m ago•0 comments

It's Time to Reserve Your WhatsApp Username

https://about.fb.com/news/2026/06/its-time-to-reserve-your-whatsapp-username/

2•Markoff•19m ago•2 comments

Rails App Just Got Acquired. Now What?

https://blog.planetargon.com/blog/entries/your-rails-app-just-got-acquired-now-what

2•robbyrussell•19m ago•0 comments

Leanstral 1.5

https://docs.mistral.ai/models/model-cards/leanstral-1-5-26-06

4•vetronauta•19m ago•0 comments

DHH: Basecamp 5, Vibe Coding, and the Future of Rails

https://podcast.rubyonrails.org/2462975/episodes/19335416-dhh-basecamp-5-vibe-coding-and-the-futu...

3•robbyrussell•19m ago•1 comments

Microsoft reveals Windows 11's Docker Replacement WSL Containers to ship soon

https://www.windowslatest.com/2026/06/28/microsoft-denies-wsl-3-exists-reveals-windows-11s-wsl-co...

3•devy•20m ago•1 comments

Rails: 13 Years of Rails at the Auto Shop

https://podcast.rubyonrails.org/2462975/episodes/19319172-nikky-southerland-lessons-from-13-years...

2•robbyrussell•20m ago•1 comments

Magicbookshelf.org – Using AI to create a spoiler-free companion to any book

https://magicbookshelf.org/read/pride-and-prejudice/

1•philipfweiss•21m ago•0 comments

Labelling AI-Generated Content in China

https://ocpl.substack.com/p/labelling-ai-generated-content-in

1•petethomas•21m ago•0 comments

An arbitrary-palette positional dithering algorithm

https://bisqwit.iki.fi/story/howto/dither/jy/

1•fanf2•22m ago•0 comments