frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

GPT-5 goes hard on real-world programming

https://www.omerba.dev/blog/gpt-5-evtx-zig-parser
8•obenamram•1h ago

Comments

pyman•49m ago
I used GPT-5 as my personal lawyer the other day. I uploaded contracts and agreements, and the number of things it got wrong was mind blowing. It made me look like a complete amateur with the emails it drafted. GPT-5 is extremely dumb and incompetent when it comes to understanding real-world problems and offering solutions. It wasn't even at the level of a junior lawyer, it was ten times worse. I was honestly shocked by the results.

When it comes to programming, I have to keep replaying with "Nope", "No", "Again", "Wrong", "It doesn't work" and a couple of times "Do better" before it finally produces something complex that actually works.

With coding at least I can tell when it gets things wrong. The real problem is when you can't.

ohr•42m ago
But if you let it run the tests, it’ll just do that on its own (or delete the tests, but usually it won’t), which is very useful and seems to match the article’s sentiment.
pyman•28m ago
Test-driven development is back.

Since LLMs rely on patterns learned from large amounts of text, the results are relative. If the training data contained more Rust repos, that could explain why it feels stronger in Rust.

The way AI companies talk about "intelligence" now is shifting. They admit LLMs can't truly reason with the current architecture, so intelligence is being framed as the ability to solve problems using patterns learned from text, not reasoning on their own. That's a big downgrade from the original idea of AI reaching human-level thinking and developing AGI.

Also, my understanding is that since Microsoft invests in Copilot, it doesn't want ChatGPT to get better at coding. Instead, it wants it to get better at being a lawyer.

Capricorn2481•2m ago
The creator of zig did not seem to care for the results

> this zig code is trash

https://lobste.rs/s/1qr3zy/gpt_5_goes_hard_on_real_world_pro...

Java for small coding tasks (JavaOne '25) [video]

https://www.youtube.com/watch?v=04wFgshWMdA
1•znpy•57s ago•0 comments

How to Create a Modal With htmx Only [video]

https://www.youtube.com/watch?v=atPFBgUP88k
1•indigodaddy•2m ago•0 comments

Commodore is back from the dead with a new C64. But is it new? [video]

https://www.youtube.com/watch?v=qz8EzWTb4so
1•amichail•2m ago•0 comments

Elon Musk's "thermonuclear" Media Matters lawsuit may be fizzling out

https://arstechnica.com/tech-policy/2025/08/elon-musks-thermonuclear-media-matters-lawsuit-may-be-fizzling-out/
1•duxup•2m ago•0 comments

Ask HN: Does Anyone Use Bitchat?

1•tmaly•4m ago•0 comments

BeeGFS – No matter what, it just stays alive.

https://www.beegfs.io/c/
1•rbanffy•5m ago•0 comments

Malicious LLM-Based Conversational AI Makes Users Reveal Personal Information

https://arxiv.org/abs/2506.11680
2•croes•5m ago•0 comments

How to remove "stuck" iCloud Tabs in Safari

https://manualdousuario.net/en/how-to-remove-stuck-icloud-tabs-in-safari/
1•rpgbr•10m ago•0 comments

MetaScope 1.0 Revolutionizes Comprehensive Metadata Management

https://www.zalodesignstudio.com/portfolio/metascope/press-releases/metascope-press-release/
2•gzaal•11m ago•1 comments

Ask HN: When the AI bubble is going to crash?

1•gashmol•12m ago•2 comments

AGI's Moving Finish Line

https://www.signalfire.com/blog/gold-at-the-math-olympiad-agis-moving-finish-line
3•zviugfd•13m ago•0 comments

Which are the deadliest European cities in a heatwave?

https://www.economist.com/interactive/graphic-detail/2025/08/15/which-are-the-deadliest-european-cities-in-a-heatwave
2•f_allwein•13m ago•0 comments

Texas AG accuses Meta, Character.AI of misleading kids with mental health claims

https://techcrunch.com/2025/08/18/texas-attorney-general-accuses-meta-character-ai-of-misleading-kids-with-mental-health-claims/
2•speckx•13m ago•0 comments

LLM from scratch, part 18 – residuals, shortcut connections, and the Talmud

https://www.gilesthomas.com/2025/08/llm-from-scratch-18-residuals-shortcut-connections-and-the-talmud
1•gpjt•15m ago•0 comments

T-Mobile claimed selling location data without consent is legal–judges disagree

https://arstechnica.com/tech-policy/2025/08/t-mobile-claimed-selling-location-data-without-consent-is-legal-judges-disagree/
13•Bender•15m ago•0 comments

Talk Python To Me Live Stream – 20 Years of Django with it's creators [video]

https://www.youtube.com/watch?v=qlYCUI5T_Bk
1•frankwiles•18m ago•0 comments

Cycling's governing body is introducing new rules to slow down elite riders

https://theconversation.com/cyclings-governing-body-is-introducing-new-rules-to-slow-down-elite-riders-not-everyones-happy-260917
1•PaulHoule•20m ago•0 comments

Show HN: Chroma Cloud – serverless search database for AI

https://trychroma.com/cloud
5•jeffchuber•20m ago•0 comments

All ATC message routing in Germany was done through Emacs (2021)

https://old.reddit.com/r/emacs/comments/lly7po/comment/gnvzisy/
1•xrayarx•21m ago•0 comments

The chatbot's mental health break

https://newslttrs.com/the-chatbots-mental-health-break/
1•spzb•21m ago•0 comments

Ask HN: SaaS Bookkeeping and Accounting

1•sp3ktrum•24m ago•1 comments

Show HN: Open-source Next.js project management app

https://www.completics.co/
1•maxim-fin•24m ago•0 comments

Enzo Ferrari: The Definitive Biography of an Icon

https://www.lrb.co.uk/the-paper/v47/n14/thomas-jones/lunch-with-mussolini
1•mitchbob•25m ago•1 comments

Transition from legacy (Google Duo) calls to the new Meet call experience

https://support.google.com/meet/answer/15226792?hl=en
1•tech234a•25m ago•0 comments

IgniteTech CEO would lay off 80% of staff again if they refused to adopt AI

https://fortune.com/2025/08/17/ceo-laid-off-80-percent-workforce-ai-sabotage/
2•krallja•35m ago•1 comments

ARMing GPUs: On the Memory Subsystem of Grace Hopper GH200 [video]

https://www.youtube.com/watch?v=rM6zmDCVVhM
3•matt_d•39m ago•0 comments

Far Out Company – artists of 60s counterculture

https://faroutcompany.com/
2•_rpxpx•40m ago•0 comments

Drawing: The overlooked universal HCI primitive

https://notnotrishi.substack.com/p/drawing-the-overlooked-universal
3•notnotrishi•40m ago•2 comments

List or get off the pot: Auditors demand gov improve IT reporting or give it up

https://www.theregister.com/2025/08/18/gao_it_data_management/
4•rntn•42m ago•0 comments

Subsea pods use ocean pressure to produce fresh water

https://www.oceanwellwater.com/water-farms
5•geox•44m ago•0 comments