frontpage.

Here is a question for which I cannot find an answer, and cannot yet afford to answer myself:

NoLiMa [0] and "context rot" [1] would indicate that with a ~165k request, Opus 200k would suck, and Opus 1M would be better (as a lower percentage of the context window was used)... but they are the same model, right? However, there are practical inference deployment differences that could change the whole paradigm, right? I am so confused.

Anthropic says it's the same model [2]. But, Claude Code's own source treats them as distinct variants with separate routing [3]. Closest test I found [4] asserts they're identical below 200K but it never actually A/B tests, correct?

Inside Claude Code it's probably not testable, right? According to this issue [5], the CLI is non-deterministic for identical inputs, and agent sessions branch on tool-use. Would need a clean API-level test.

The API level test is what I really want to know for the Claude based features in my own apps. Is there a real benchmark for this?

I have reached the limits of my understanding on this problem. If what I am trying to say makes any sense, any help would be greatly appreciated.

If anyone could help me ask the question better, that would also be appreciated.

[0] https://arxiv.org/abs/2502.05167

[1] https://research.trychroma.com/context-rot

[2] https://claude.com/blog/1m-context-ga

[3] https://github.com/anthropics/claude-code/issues/35545

[4] https://www.claudecodecamp.com/p/claude-code-1m-context-window

[5] https://github.com/anthropics/claude-code/issues/3370

The Mythos Threshold

We broke the O(2^N) barrier to compute AI consciousness (Phi)

Vibe Coding doesn't democratize software engineering – it democratizes liability

Europe should regulate Big Tech instead of banning kids from social media

An open source CMS/Indexer for TCGs

The FCC just saved Netgear from its router ban for no obvious reason

Quetta Browser: Chromium browser for Android, iOS supporting Chrome extensions

Show HN: An edge MCP file system with a 50ms undo button for AI agents

Anthropic Revises Claude Enterprise Pricing Structure

Show HN: AI connects your health data after a supplement nearly killed me racing

Muster – Multi-agent product team for Claude Code, built on <.md> files

Elon Musk's xAI Sued by NAACP over Memphis Data Center

Large or bright satellite constellations: Effects on observations

Show HN: Terminal-Wrench, a dataset of 331 realistic hackable environments

Nvidia should be 'shaking in their boots' as quantum computing battles AI GPUs

Authorization for LLM Tool Schemas: Formal Model with Noninterference Guarantees [pdf]

Speech-Driven Spatial Externalization for Co-Located Collaboration in AR

Now Available: WireGuard, Wi‑Fi Direct, OpenRISC, and More

Building a Tax Document Assistant with the Ragie Skill

The Infrastructure Nobody's Building for the Agent Economy

SDL3 Port to DOS

Mark Zuckerberg reportedly working on AI clone of himself

The Biggest Advance in AI Since the LLM

Don't feel like exercising? Maybe it's the wrong time of day for you

A new wave of immunotherapy is eliminating cancers

The Last Lights of Chernobyl's Skala Computer – Computer Recreation

CRISPR takes a bold leap toward silencing Down syndrome's extra chromosome

AWS announces general availability of AWS Interconnect – multicloud

1B payments per day ft TigerBeetle, Postgres

Best AI Product Adoption Software in 2026

Ask HN: At ~165k tokens, does Opus 4.6 1M outperform Opus 4.6 200k?