frontpage.

Looking for community input on current model choice for "thinking-partner" use — back-and-forth discussions about workflow design, architecture, trade-offs.

For context, I have been using Opus 4.6 via Perplexity for this in the past few months and I think it was excellent, fair pushback/ counterarguments, reasonable suggestions and discussion. Now with the new Opus 4.7, I notice it is now much more verbose, more sycophantic, and quite often confidently making statements that are wrong and without evidence.

I think in performing actual coding tasks it is great if not even slightly better, but the gap in thinking and discussion is really felt. Previously I used GPT, Gemini, and Grok too but they dont feel as productive as my Opus 4.6 experience.

A few questions - Is Opus 4.7 still the best default model for this task? - Is this solvable via system prompts or alternative setup, or what's the correct way to to think about it? - More broadly speaking, model changes and updates every few months, so how actually can we "lock-in" a reasonable setup?

PGO Build TPC-C Analysis MariaDB v11.8.6 TideSQL

LLMs – What Experienced Practitioners See

Ask HN: How does Google crawls x.com website?

Games for Change

What Anthropic's Mythos Means for the Future of Cybersecurity

Refuse to let your doctor record you

Space Reactor 1

Intel stock hits new all-time highs for first time since 2000

Content credentials – hardware signing of photo and video cameras

Why I'm Done Making Desktop Applications

Six things I'll remember when I think about Tim Cook's version of Apple

UK Biobank health data listed for sale in China, government confirms

What I learned asking 11 AI models to grade each other's AI predictions

Why High-Testosterone Men Don't Perform for the Crowd

El Salvador Adds New Tools in National Health App to Track and Treat (DoctorSV)

Asia's Billionaires Are Bankrolling a Push for More Babies

Rubbing testosterone gel on men's upper arms eliminates the audience effect

Show HN: Tarot Down Detector – a status page as a tarot reading

The Design of Design – Gordon L. Glegg(1969)

You know what consciousness is: you live in soul land

Unfounded Health Concerns Are Powering a Solar Backlash

Printing a Check for Free

Ask HN: Anyone else get fatigued by interaction with LLMs?

Show HN: PrivateClaw – AI agents running in confidential VMs you can verify

NMail Is Neat

Cloudflare Agents Week: Infrastructure for Running AI Agents at Scale

The Fall of the Theorem Economy

Anthropic's product team moves faster than anyone else [video]

A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all

Show HN: TurbineFi – Build, Backtest, Deploy Prediction Market Strategies

Ask HN: What's your current go-to LLM for "thinking-partner"?