frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

XMLUI

https://blog.jonudell.net/2025/07/18/introducing-xmlui/
170•mpweiher•3h ago•89 comments

Coding with LLMs in the summer of 2025 – an update

https://antirez.com/news/154
193•antirez•6h ago•168 comments

The old Caveman Chemistry website (1996-2000)

https://cavemanchemistry.com/oldcave/
35•marcodiego•2h ago•4 comments

A Tour of Microsoft's Mac Lab (2006)

https://davidweiss.blogspot.com/2006/04/tour-of-microsofts-mac-lab.html
117•ingve•6h ago•16 comments

LLM architecture comparison

https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison
232•mdp2021•10h ago•15 comments

Digital vassals? French Government 'exposes citizens' data to US'

https://brusselssignal.eu/2025/07/digital-vassals-french-government-exposes-citizens-data-to-us/
83•ColinWright•5h ago•16 comments

Show HN: Conductor, a Mac app that lets you run a bunch of Claude Codes at once

https://conductor.build/
21•Charlieholtz•3d ago•12 comments

Async I/O on Linux in databases

https://blog.canoozie.net/async-i-o-on-linux-and-durability/
141•jtregunna•10h ago•61 comments

A human metaphor for evaluating AI capability

https://mathstodon.xyz/@tao/114881418225852441
97•bertman•8h ago•12 comments

Speeding Up My ZSH Shell

https://scottspence.com/posts/speeding-up-my-zsh-shell
3•saikatsg•1h ago•0 comments

How Tesla is proving doubters right on why its robotaxi service cannot scale

https://www.aol.com/elon-gambling-tesla-proving-doubters-090300237.html
134•Bluestein•3h ago•286 comments

Show HN: A handpicked directory to help founders find great design studios

https://finddesignagency.com/
12•iamarnob6543•3d ago•2 comments

Laminar Flow Airfoil

http://www.aviation-history.com/theory/lam-flow.htm
4•colinprince•2d ago•0 comments

I'm betting against AI agents, despite building them

https://utkarshkanwat.com/writing/betting-against-agents/
286•Dachande663•8h ago•161 comments

Show HN: MCP server for Blender that builds 3D scenes via natural language

https://blender-mcp-psi.vercel.app/
109•prono•11h ago•44 comments

Show HN: ggc – A terminal-based Git CLI written in Go

https://github.com/bmf-san/ggc
39•bmf-san•4d ago•33 comments

Hungary's oldest library is fighting to save books from a beetle infestation

https://www.npr.org/2025/07/14/nx-s1-5467062/hungary-library-books-beetles
162•smollett•4d ago•24 comments

The landlord gutting America’s hospitals

https://www.motherjones.com/politics/2025/07/the-landlord-gutting-americas-hospitals/
33•hhs•1h ago•13 comments

How the 'Minecraft' Score Became Big Business for Its Composer

https://www.billboard.com/pro/how-minecraft-score-became-big-business-for-composer/
54•tunapizza•4d ago•26 comments

Make Your Own Backup System – Part 1: Strategy Before Scripts

https://it-notes.dragas.net/2025/07/18/make-your-own-backup-system-part-1-strategy-before-scripts/
312•Bogdanp•21h ago•99 comments

Death by AI

https://davebarry.substack.com/p/death-by-ai
472•ano-ther•1d ago•182 comments

Replit AI deletes entire database during code freeze, then lies about it

https://twitter.com/jasonlk/status/1946069562723897802
55•FiddlerClamp•3h ago•9 comments

Dual interfacial H-bonding-enhanced deep-blue hybrid copper–iodide LEDs

https://www.researchsquare.com/article/rs-4114691/v1
8•gnabgib•3d ago•1 comments

Robot metabolism: Toward machines that can grow by consuming other machines

https://www.science.org/doi/10.1126/sciadv.adu6897
29•XzetaU8•8h ago•17 comments

Nobody knows how to build with AI yet

https://worksonmymachine.substack.com/p/nobody-knows-how-to-build-with-ai
461•Stwerner•1d ago•362 comments

Behind the ballistics of the 'explosive' squirting cucumber

https://phys.org/news/2025-07-ballistics-explosive-squirting-cucumber.html
41•PaulHoule•2d ago•6 comments

The bewildering phenomenon of declining quality

https://english.elpais.com/culture/2025-07-20/the-bewildering-phenomenon-of-declining-quality.html
317•geox•8h ago•553 comments

I tried vibe coding in BASIC and it didn't go well

https://www.goto10retro.com/p/vibe-coding-in-basic
148•ibobev•4d ago•156 comments

Beyond Meat fights for survival

https://foodinstitute.com/focus/beyond-meat-fights-for-survival/
150•airstrike•17h ago•390 comments

How to run an Arduino for years on a battery (2021)

https://makecademy.com/arduino-battery
92•thunderbong•3d ago•27 comments
Open in hackernews

A human metaphor for evaluating AI capability

https://mathstodon.xyz/@tao/114881418225852441
97•bertman•8h ago

Comments

chronic0262•3h ago
> Related to this, I will not be commenting on any self-reported AI competition performance results for which the methodology was not disclosed in advance of the competition.

what a badass

amelius•3h ago
Yes, I think it is disingenuous of OpenAI to make ill-supported claims about things that can affect us in important ways, having an impact on our worldview, and our place in the world as an intelligent species. They should be corrected here, and TT is doing a good job.
svat•3h ago
Great set of observations, and indeed it's worth remembering that the specific details of assistance and setup make a difference of several orders of magnitude. And ha, he edited the last post in the thread to add this comment:

> Related to this, I will not be commenting on any self-reported AI competition performance results for which the methodology was not disclosed in advance of the competition. (3/3)

(This wasn't there when I first read the thread yesterday 18 hours ago; it was edited in 15 hours ago i.e. 3 hours later.)

It's one of the things to admire about Terence Tao: he's always insightful even when he comments about stuff outside mathematics, while always having the mathematician's discipline of not drawing confident conclusions when data is missing.

I was reminded of this because of a recent thread where some HN commenter expected him to make predictions about the future (https://news.ycombinator.com/item?id=44356367). Also reminded of Sherlock Holmes (from A Scandal in Bohemia):

> “This is indeed a mystery,” I remarked. “What do you imagine that it means?”

> “I have no data yet. It is a capital mistake to theorize before one has data. Insensibly one begins to twist facts to suit theories, instead of theories to suit facts.”

Edit: BTW, seeing some other commentary (here and elsewhere) about these posts is very disappointing — even when Tao explicitly says he's not commenting about any specific claim (like that of OpenAI), many people seem to be eager to interpret his comments as being about that claim: people's tendency for tribalism / taking “sides” is so great that they want to read this as Tao caring about the same things they care about, rather than him using the just-concluded IMO as an illustration for the point he's actually making (that results are sensitive to details). In fact his previous post (https://mathstodon.xyz/@tao/114877789298562646) was about “There was not an official controlled competition set up for AI models for this year’s IMO […] Hopefully by next year we will have a controlled environment to get some scientific comparisons and evaluations” — he's specifically saying we cannot compare across different AI models so it's hard to say anything specific, yet people think he's saying something specific!

johnecheck•3h ago
My thoughts were similar. OpenAI, very cool result! Very exciting claim! Yet meaningless in the form of a Twitter thread with no real details.
roxolotl•2h ago
This does a great job illustrating the challenges with arguing over these results. Those in the agi camp will argue that the alterations are mostly what makes the ai so powerful.

Multiple days worth of processing, cross communication, picking only the best result? That’s just the power of parallel processing and how they reason so well. Altering to a more standard prompt? Communicating with a more strict natural language helps reduce confusion. Calculator access and the vast knowledge of humanity built in? That’s the whole point.

I tend to side with Tao on this one but the point is less who’s right and more why there’s so much arguing past each other. The basic fundamentals of how to judge these tools aren’t agreed upon.

johnecheck•1h ago
Would be nice if we actually knew what was done so we could discuss how to judge it.

That recent announcement might just be fluff or might be some real news, depending. We just don't know.

I can't even read into their silence - this is exactly how much OpenAI would share in the totally grifting scenario and in the massive breakthrough scenario.

algorithms432•52m ago
Well, they deliberately ignored the requests of IMO organizers to not publish AI results for some time (a week?) to not steal the spotlight from the actual participants, so clearly this announcement's purpose is creating hype. Makes me lean more towards the "totally grifting" scenario.
bgwalter•31m ago
Amazing. Stealing the spotlight from High School students is really quite something.

I'm glad that Tao has caught on. As an academic it is easy to assume integrity from others but there is no such thing in software big business.

bluefirebrand•18m ago
> As an academic it is easy to assume integrity from others

I'm not an academic, but from the outside looking in on academia I don't think academics should be so quick to assume integrity either

There seems to be a lot of perverse incentives in academia to cheat, cut corners, publish at all costs, etc

griffzhowl•1h ago
> Calculator access and the vast knowledge of humanity built in? That’s the whole point.

I think Tao's point was that a more appropriate comparison between AI and humans would be to compare it with humans that have calculator/internet access.

I agree with your overall point though: it's not straighforward to specify exactly what would be an appropriate comparison

largbae•2h ago
I feel like everyone who treats AGI as "the goal" is wasting energy that could be applied towards real problems right now.

AI in general has given humans great leverage in processing information, more than we have ever had before. Do we need AGI to start applying this wonderful leverage toward our problems as a species?