frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Anthropic researchers discover thinking longer sometimes makes models dumber

https://venturebeat.com/ai/anthropic-researchers-discover-the-weird-ai-problem-why-thinking-longer-makes-models-dumber/
5•reasonableklout•1d ago

Comments

reasonableklout•1d ago
Also see the paper "Inverse Scaling in Test-Time Compute": https://arxiv.org/abs/2507.14417
mhrmsn•1d ago
No more "ultrathink", got it :)
duxup•1d ago
This is not my area of expertise but I'm going to try to ask an intelligent question.

For me when I use an LLM for coding the value and accuracy comes from the conversation, I give context, I evaluate the response, I give more context and prompts, sometimes I correct it and get it back on task, and ... we get there (well not always but generally).

The idea being our interactions are what gets us towards the answer I was looking for / the "right" answer.

I imagine that without me being in the loop and the LLM given lots more time it's going to take those original words of mine, do it's "word math" (I like to think of it that way) as much as possible and ... maybe go down a rabbit hole that I wasn't headed to, possibly way down.

Is that rabbit hole kinda scenario what they're talking about?

Is it also possible that because of their training data, Q & A and Q and context and Q and A is just the better path than deep thoughts that produced some of that content?

I also wonder maybe just because I'm a simpleton, and for me the "right" answer really was the simplest first.

techpineapple•1d ago
I wonder if this is similar to the idea that when your taking a test you should most often go with your initial answer, and not change your answer.

You won't believe what this AI said after deleting a database

https://smallcultfollowing.com/babysteps/blog/2025/07/24/collaborative-ai-prompting/
1•mfrw•4m ago•1 comments

Treat Your Spouse as an Investor

https://www.skmurphy.com/blog/2024/01/20/treat-your-spouse-as-an-investor/
1•skmurphy•4m ago•0 comments

Bereshit R* Analytics

https://preview--bereshit-ram-analytics.lovable.app/
1•dannyrosen•9m ago•0 comments

First permanent Pokemon theme park to open in Tokyo

https://www.rte.ie/news/business/2025/0723/1524891-first-pokemon-theme-park-to-open/
1•austinallegro•11m ago•0 comments

Show HN: NeoArchive – Offline Android File Manager with 25 Tools

https://play.google.com/store/apps/details?id=com.tool4file.neo_archive&hl=en_US
1•Quoriath•13m ago•0 comments

Ask HN: How to report an UX issue/improvement to Uber?

1•loopion•13m ago•0 comments

Show HN: I'm Tired Now

1•pratikpatwe•14m ago•0 comments

Show HN: Watch babies being born worldwide on the live Baby Map – babymap.org

https://babymap.org
1•jsamqiu•24m ago•0 comments

Pesntester that want's to live in NL

1•Dark-shadow•28m ago•0 comments

Fossils unearthed in Grand Canyon reveal new details of Cambrian explosion

https://www.cnn.com/2025/07/24/science/grand-canyon-fossils-goldilocks-cambrian-explosion
2•mooreds•31m ago•0 comments

Where Are Vacation Homes Located in the US?

https://www.construction-physics.com/p/where-are-vacation-homes-located
1•JumpCrisscross•33m ago•0 comments

Tap into the "Hemingway effect" to finish what you start

https://bigthinkmedia.substack.com/p/tap-into-the-hemingway-effect-to
2•diwank•38m ago•0 comments

Show HN: I built a SaaS that makes creating TikTok/IG slideshows dead easy

https://www.slideshowgen.com
1•waynedev9598•39m ago•1 comments

Against the Censorship of Adult Content by Payment Processors

https://soatok.blog/2025/07/24/against-the-censorship-of-adult-content-by-payment-processors/
18•SlackingOff123•53m ago•4 comments

Google ordered to pay Argentine pictured naked in garden

https://www.batimes.com.ar/news/argentina/google-ordered-to-pay-argentine-pictured-naked-in-garden.phtml
2•mgarciaisaia•1h ago•0 comments

Judge Scraps Opinion After Lawyer Flags Made-Up Quotes

https://news.bloomberglaw.com/business-and-practice/judge-withdraws-pharma-opinion-after-lawyer-flags-made-up-quotes
2•1vuio0pswjnm7•1h ago•1 comments

Show HN: Blueboots – A retro themed Fedora OS built with one Containerfile

https://github.com/bluebootsy/os
2•twelvenmonkeys•1h ago•0 comments

Efrit: A native elisp coding agent running in Emacs

https://github.com/steveyegge/efrit
1•simonpure•1h ago•0 comments

Show HN: I built a notion ai agent

https://www.youtube.com/watch?v=Uu3Np3bG9v4
1•ifeanyi_sa•1h ago•0 comments

Ask HN: Should HN introduce a "Tell HN" tab?

2•bhag2066•1h ago•2 comments

Distro-Hopping and RICEing

https://l-o-o-s-e-d.net/distro-hopping
2•l00sed•1h ago•2 comments

Show HN: AI image generator with 6 artistic mentors for better prompts

https://createvision.ai
1•yestwind•1h ago•0 comments

Equilibrium in the Embedding Space: When Novelty Becomes Familiar

https://lightcapai.medium.com/equilibrium-in-the-embedding-space-when-novelty-becomes-familiar-547862bdd38f
1•WASDAai•1h ago•0 comments

Show HN: Add viral TikTok audio to work meetings

https://soundboard.recall.ai/
2•saporito•1h ago•0 comments

RustMailer – Week 1 Update: 729 Views, 165 Clones, 13 Stars (in 9 Days)

https://www.indiehackers.com/post/rustmailer-week-1-update-729-views-165-clones-13-stars-in-9-days-9zDlC2HmjXFH7mKzcmpb
1•rustmailer•1h ago•0 comments

Good Docs Describe, Bad Docs Prescribe

https://rethinkingsoftware.substack.com/p/good-docs-describe-bad-docs-prescribe
2•aard•1h ago•1 comments

Show HN: Crawell – Extract any page as Markdown or download images in bulk

https://chromewebstore.google.com/detail/crawell/cmfcognoilmabnclomeehljmknallaaa
1•kamjin•1h ago•0 comments

Running Serverless WASM Functions on the Edge with K3s and SpinKube

https://www.fermyon.com/blog/spinkube-k3s
2•breve•1h ago•0 comments

Asciinema: Record and share your terminal sessions

https://asciinema.org
25•phendrenad2•1h ago•3 comments

Chinese drones carry 180ton of steel and concrete up mountain in pioneering feat

https://www.scmp.com/news/china/politics/article/3319460/chinese-drones-carry-180-tonnes-steel-and-concrete-mountain-pioneering-feat
6•xbmcuser•1h ago•1 comments