frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

JUNO completes liquid filling and begins taking data

https://phys.org/news/2025-08-juno-liquid-neutrino-masses.html
2•jonbaer•1m ago•0 comments

Show HN: Git Well Soon – A beginner's guide to Git with a medical twist

https://github.com/cloudstreet-dev/Git-Well-Soon
1•DavidCanHelp•1m ago•0 comments

AI Lead Generation

https://dealmayker.com/features/ai-lead-generation
2•aleksam•2m ago•1 comments

Framework selling first GPU-upgradable laptop, with Nvidia's blessing

https://www.theverge.com/laptops/765528/framework-is-now-selling-the-first-gaming-laptop-that-let...
1•bsimpson•3m ago•0 comments

15-Fold increase in solar thermoelectric generator performance

https://www.nature.com/articles/s41377-025-01916-9
1•bookofjoe•3m ago•0 comments

Ask HN: Has anyone else used online communities that are archetypically "savvy"?

1•Use•4m ago•1 comments

Make and SQL: An old new way for Data Science workloads

https://vasvir.wordpress.com/2025/08/26/make-sql-an-old-new-way-for-data-science-workloads/
1•vasvir•4m ago•0 comments

Why America Still Needs Punk Rock

https://www.currentaffairs.org/news/why-america-still-needs-punk-rock
1•XzetaU8•5m ago•0 comments

More on Seed Phrase Words

https://www.johndcook.com/blog/2025/08/26/seed-phrase-words-2/
2•ibobev•5m ago•0 comments

Apple Event on September 9: 'Awe Dropping'

https://www.macrumors.com/2025/08/26/apple-september-2025-event/
1•Bogdanp•5m ago•0 comments

Novelty Is the Secret Ingredient to Product Success, Thriving Teams,Happiness

https://spin.atomicobject.com/novelty-secret-ingredient/
1•philk10•9m ago•0 comments

Show HN: Enterprise MCP Bridge – Solving the MCP Chaos for IT

https://blog.inxm.ai/p/enterprise-it-cant-afford-mcp-chaosheres
3•raelmiu•11m ago•0 comments

Principles of great DX for data infrastructure

https://clickhouse.com/blog/eight-principles-of-great-developer-experience-for-data-infrastructure
1•craneca0•13m ago•0 comments

Delta Lake: Transform Pandas Prototypes into Production

https://codecut.ai/from-pandas-to-production-delta-rs/
2•Ben5554•14m ago•0 comments

Google says China-linked cyber operations targeted Southeast Asia diplomats

https://www.cnn.com/2025/08/26/business/google-china-linked-hacking-southeast-asia-diplomats-intl...
1•mooreds•15m ago•0 comments

Intel and the Foundry State of Play

https://d2d.substack.com/p/d2d-contd-intel-and-the-foundry-state
1•mooreds•15m ago•0 comments

Titles Matter

https://joshcollinsworth.com/blog/titles-matter
2•speckx•16m ago•0 comments

What It Means to Choose Life

https://www.nytimes.com/2025/08/24/opinion/assisted-suicide-canada-orchid-embryos.html
1•whack•18m ago•0 comments

Tomorows Growth Starts with Todays

https://adia.substack.com/p/tomorrows-growth-starts-with-todays
1•jemiluv8•19m ago•1 comments

Anthropic Settles Copyright Lawsuit

https://www.courtlistener.com/docket/70991505/26/bartz-et-al-v-anthropic-pbc/
1•miohtama•19m ago•0 comments

Type Inference for Plain Data

https://www.haskellforall.com/2025/08/type-inference-for-plain-data.html
1•fanf2•20m ago•0 comments

Show HN: My Financial Pal – Free AI-Powered Personal Financial Planner

https://my-financial-pal-baf4b5e07c1c.herokuapp.com/
1•shormigo•21m ago•1 comments

Michigan Supreme Court: Unrestricted Phone Searches Violate Fourth Amendment

https://reclaimthenet.org/michigan-supreme-court-rules-phone-search-warrants-must-be-specific
22•mikece•25m ago•4 comments

Understanding Neural Networks, Visually

https://visualrambling.space/neural-network/
1•LordNibbler•25m ago•0 comments

SuperNICs Explained and Compared to DPUs

https://www.technetbooks.com/2025/08/supernics-network-accelerator-for.html
1•tanelpoder•26m ago•0 comments

Britain's datacentre boom promises growth- Ireland's grid crisis shows the costs

https://nearlyright.com/britains-data-centre-boom-promises-growth-but-irelands-grid-crisis-shows-...
3•indigodaddy•27m ago•0 comments

Squarespace Is Down

https://status.squarespace.com
2•gkolli•27m ago•0 comments

Detecting colorectal cancer with gut bacteria and AI

https://www.rts.ch/info/sciences-tech/2025/article/une-ia-detecte-90-des-cas-de-cancer-colorectal...
2•speckx•28m ago•0 comments

Serviz: Command Object Interface for Ruby

https://github.com/markets/serviz
1•thunderbong•29m ago•0 comments

Google Gemini's AI image model gets a 'bananas' upgrade

https://techcrunch.com/2025/08/26/google-geminis-ai-image-model-gets-a-bananas-upgrade/
1•breadwinner•29m ago•1 comments
Open in hackernews

GPT5 is the best coding LLM because other LLMs admit it?

1•adinhitlore•2h ago
So I vibe-code a lot these days and recently i decided to give the same prompt to several llms, then get their codes and later give each code to every single one of them to ask which one they think is the most useful without telling them that they or the other 2 llms wrote it. The overall consensus is: gpt5. True I only compared gpt5 vs claude 4.1 vs qwen 230bn. OSS 120b, gemini and grok 4 were excluded since well i don't have the time. And obvious failures like amazon nova or anything from meta weren't even planned. Deepseek (both) seem a bit underperforming . Personally I'd say it's a close call between claude opus 4.1 vs both gpt4 and gpt5 (ironically gpt5 sometimes performs worse than 4, i think this has been addressed by many people already). That's just my personal experience, i know HumanEwal or SWE or whatever give various performance but idk, Musk used the benchmarks as "proof" to hype Grok and in my experience grok 4 is between LLAMA4 and obviously behind gpt4 or some variations of qwen.

Again this is coding only: Python and C. For physics, chemistry, scifi novels or whatever the case may be very different. Another kudos to OSS 120bn btw: it's very generous on tokens...like it will write a small programming book if it takes to in one reply, unless of course you tell it to be more limited, this is a huge plus for me since the code I demand should be complex and not some 20 lines nova "pro" joke.

Comments

incomingpain•1h ago
all ive done with gpt5 for coding was a major db refactor. i had run out of gemini limit for the day.

certainly got the job done. I doubt my gpt 20b or ~30b local llm would have been as capable. Overall it was about ~2000 lines of code to change, probably only 100,000 context.

gpt5 didnt one shot it. there were many steps inbetween. At the end, few hours, i had >50 linter warnings from tripled imports, loads of dead code that wouldnt be touched and for some reason gpt5 just couldnt fix any of this. Ended up increasing the warnings and added an error. My expectation is that any of the big guys could immediately fix it. Even restarted fresh context and gpt just wasnt having any of it. im certain even gpt 20b would have completed it in a minute. Curious.

I went to gemini flash, very generic prompt about linter warnings and it fixed it in 30 seconds.

Just kind of weirdness that benchmarks will never be able to catch. It's also going to be very dependent. A rust programmer might have a favourite, whereas python programmer benefits from another model. There can never be a best.

adinhitlore•1h ago
I had similar experience, usually I'd ignore Gemini be it flash or pro but on several occasions it fixed complex errors like it's nothing. Yet when it comes to codegen it is "cheap" on tokens and struggles outputting complex logic. As a great bonus: their easy to setup API is freemium but a generous freemium (google AI studio I mean). My "ecosystem" atm will be something like: gpt5, claude 4.1 - if they both fail: try to fix with gemini. I'd skip Grok for privacy issues mostly not that I completely ignore its capabilities, qwen is good but sometimes 'overengineered' i don't need 400bn , given the large params maybe it will work for non-coding like if you ask it some exotic questions about science: casimir effect, acoustic levitation, ununennium etc etc you name it.
zahlman•1h ago
> recently i decided to give the same prompt to several llms, then get their codes and later give each code to every single one of them to ask which one they think is the most useful without telling them that they or the other 2 llms wrote it.

The fact that you expect the result of this experiment to be useful, is more interesting than the actual result.

adinhitlore•1h ago
vibe-coding is the future, drop conservatism....'free palestine' i mean you get the idea: be progressive and open minded.
pavel_lishin•1h ago
Those seem like completely orthogonal concepts.