frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

ChatGPT could never get a PhD in geography

https://garymarcus.substack.com/p/chatgpt-blows-mapmaking-101
4•garymarcus•3h ago

Comments

garymarcus•3h ago
If you think AI is “smart” or “PhD level” or that it “has an IQ of 120”, take five minutes to read my latest newsletter (link below), as I challenge ChatGPT to the incredibly demanding task of drawing a map of major port cities with above average income.

The results aren’t pretty. 0/5, no two maps alike.

“Smart” means understanding abstract concepts and combining them well, not just retrieving and analogizing in shoddy ways.

No way could a system this is wonky actually get a PhD in geography. Or economics. Or much of anything else.

jqpabc123•3h ago
The only thing LLM does really well is statistical prediction.

As should be expected, sometimes it predicts correctly and sometimes it doesn't.

It's kinda like FSD mode in a Tesla. If you're not willing to bet your life on it (and why would you?), it's really not all that useful.

ben_w•3h ago
I very much appreciate all the ways we're improving our ideas of what "smart" means.

I wouldn't call LLMs "smart" either, but with a different definition than the one you use here: to me, at the moment, "smart" means being able to learn efficiently, with few examples needed to master a new challenge.

This may not be sufficient, but it does avoid any circular arguments about if any given model would have any "understanding" at all.

rvz•2h ago
> If you think AI is “smart” or “PhD level” or that it “has an IQ of 120”...

It's not there yet, it's still learning™, but a lot of progress in AI has happened recently, which I would give them that.

However, as you point out in your newsletter already, there are also lots of misleading and dubious claims alongside too much hype in the hopes to raise VC capital which comes with the overpromising in AI as well.

One of them is the true meaning of "AGI" (right now it is starting to look like a scam), since there are several conflicting definitions directly from those who benefit.

What do you think it truly means given your observations?

senordevnyc•1h ago
It's pretty amusing that we're now at the stage of AI denialism where the goalposts are "AI is only smart if it can get a PhD in an area it hasn't been trained in!"

Looking forward to where we move the next goalposts next. Perhaps AI isn't smart because it can't invent a cure for cancer in 24 hours? Or it can't challenge our core understanding of the laws of physics?

Lageos-1 (which is predicted to re-enter the atmosphere in 8.4M years[6])

https://en.wikipedia.org/wiki/LAGEOS
1•Bluestein•1m ago•0 comments

Ticket SideKick – Buy Tickets Online

https://TicketSideKick.com
1•eueeffzek•2m ago•0 comments

Who do Africans trust most? Surveys: it's not the state (more likely the army)

https://theconversation.com/who-do-africans-trust-most-surveys-show-its-not-the-state-more-likely-the-army-252902
1•PaulHoule•3m ago•0 comments

Show HN: LLM-God – An LLM Multiprompting App

https://github.com/czhou578/llm-god/tree/1.0.3
1•czhou578•5m ago•0 comments

Wanna secure US AI leadership? Stop giving the world excuses to buy Chinese

https://www.theregister.com/2025/05/09/tech_titans_wanna_secure_us/
1•Willingham•6m ago•0 comments

Nvidia Sharp In-Network Computing

https://developer.nvidia.com/blog/advancing-performance-with-nvidia-sharp-in-network-computing/
1•tanelpoder•10m ago•0 comments

FedRAMP 20x – One Month in and Moving Fast

https://www.fedramp.gov/2025-04-24-fedramp-20x-one-month-in-and-moving-fast/
2•transpute•20m ago•0 comments

Comma 3X: Initial Impressions

https://beesbuzz.biz/blog/14719-Comma-3X-Initial-impressions
2•surprisetalk•25m ago•0 comments

Polaris is giving free GPUs/CPUs for everyone

https://www.polariscloud.ai/#Home
1•GreenGames•27m ago•0 comments

Domestic Engineer Job Description

https://www.indeed.com/hire/job-description/domestic-engineer
1•RS-232•27m ago•0 comments

FlashMoE: DeepSeek-R1 671B and Qwen3MoE 235B with 1~2 Intel B580 GPU in IPEX-LLM

https://github.com/intel/ipex-llm/blob/main/docs/mddocs/Quickstart/flashmoe_quickstart.md
1•colorant•29m ago•1 comments

The Ewing Conspiracy: Was the 1985 NBA draft rigged? (2015)

https://vault.si.com/vault/2015/05/18/1-patrick-ewing
2•indigodaddy•30m ago•1 comments

AI Agatha Christie on Writing

https://www.bbcmaestro.com/courses/agatha-christie/writing
2•bookofjoe•33m ago•0 comments

Printing Everything and Owning Nothing

https://www.chrbutler.com/2025-05-12
4•delaugust•38m ago•1 comments

Google accidentally leaks Material 3 Expressive UI ahead of Android 16

https://timesofindia.indiatimes.com/technology/tech-news/google-accidentally-reveals-new-android-design-language-material-3-expressive-heres-what-changes-it-may-bring-to-android-16/articleshow/120919076.cms
2•byte-bolter•44m ago•0 comments

AWS to invest $4B in cloud infrastructure in Chile, its 3rd Latam region

https://www.reuters.com/business/energy/amazon-spend-4-billion-cloud-infrastructure-chile-2025-05-07/
3•gray_amps•47m ago•0 comments

System converts fabric images into machine-readable knitting instructions

https://techxplore.com/news/2025-05-fabric-images-machine-readable.html
1•geox•48m ago•1 comments

Google showcases AI coding agent at I/O, plans Gemini chat on XR headsets

https://www.reuters.com/business/google-is-developing-software-ai-agent-ahead-annual-conference-information-2025-05-12/
1•bit_qntum•48m ago•0 comments

Improvements in reasoning AI models may slow down soon, analysis finds

https://techcrunch.com/2025/05/12/improvements-in-reasoning-ai-models-may-slow-down-soon-analysis-finds/
2•GreenGames•49m ago•0 comments

Being Legit: On Impostor Syndrome, Impossible Tech, and the Myth of the Obvious

https://www.tedtanner.org/being-legit-on-impostor-syndrome-impossible-tech-and-the-myth-of-the-obvious/
1•tctjr•50m ago•1 comments

Choice at Different Abstraction Levels

https://www.overcomingbias.com/p/choice-at-different-abstraction-levels
1•jger15•53m ago•0 comments

Show HN: AGI Hits a Structural Wall – A Billion-Dollar Problem

3•mmschlereth•53m ago•0 comments

Last Contact (2007)

https://web.archive.org/web/20080725045740/http://solarisbooks.com/books/newbookscifi/last-contact.asp
2•vermilingua•54m ago•0 comments

How to Reduce AI Coding Errors with a Task Manager

https://shipixen.com/tutorials/reduce-ai-coding-errors-with-taskmaster-ai
2•tortilla•54m ago•0 comments

Confidently Wrong

https://aabiji.github.io/html/wrong.html
4•aabiji•54m ago•0 comments

Show HN: I built an all-in-one feedback system to ship the right features faster

https://upvoicy.com/
1•optinghost•55m ago•0 comments

The Linux Scheduler: A Decade of Wasted Cores [pdf]

https://people.ece.ubc.ca/sasha/papers/eurosys16-final29.pdf
2•aabiji•56m ago•1 comments

Rescinding the Amended Water Use Standards for Residential Dishwashers [pdf]

https://public-inspection.federalregister.gov/2025-08591.pdf
1•impish9208•58m ago•1 comments

Stacked Pull Requests on GitHub

https://github.com/ejoffe/spr
2•pabs3•1h ago•0 comments

Leftwing pundit Hasan Piker: US border agents questioned him on Trump and Gaza

https://www.theguardian.com/us-news/2025/may/12/hasan-piker-border-trump-gaza
4•mitchbob•1h ago•1 comments