frontpage.

I have been a long time user of Claude Code. I never felt the need to try other coding tools, and the times I did, I always came back. Across many projects, personal and professional, Claude has been my pair programmer. I have enough experience across generations of Claude models that, just by looking at the output, I can tell when a model switch has happened, strictly in the context of coding tasks. I have noticed models lean on some aspects heavily like bullet points or emojis or markdown tables or ascii diagrams or just a walls of text (4.8). Following the Claude Code 4.7 and 4.8 releases, the performance is immensely worse. Here is how I qualify "worse":

1. Coding tasks: they don't follow instructions at all. They consistently miss close to a quarter of the ask, which cascades into bloat over time. 2. Reviewing existing code: they literally fail to read the code or use the tools properly.

Both of these have, in my opinion, eroded the time saving value of the tool. Hard opinion: real code review is harder than coding.

So I tried Codex. This is my first time, so I have only a little experience with it, but the distinction is clear. Codex is remarkably precise about the exact changes I need, reliable nearly 95% of the time. What it lacks is the flair, the bombastic ideas and presentation that Claude has. I use Claude to discuss ideas, it gives me great variety and draws ascii block diagrams Codex never could. But Codex is the most reliable coding agent.

Suggestion: Don't trust Claude when it says it finished something :) Always review it, at least post-4.6.

Is this just my experience, or do others feel the same?

It's time to fly – Codex [video]

A Man Who Reads Books for a Living (One Every Two Days)

Show HN: CLI for crawling documentation sites into Markdown with defuddle

The Approach to Equilibrium

Revealing the Frontier with Stacks and Queues

NULLs in ClickHouse can hurt performance

Why are there no good tablets at the moment?

Rewiring software delivery for the agentic era

Monitor all your servers from one beautiful dashboard

Show HN: I created a React alternative using web componnents

Multi-stage distributed query execution in ClickHouse Cloud

Stophy for AI Agents

Trump's Takeover of the American Regulatory Machine

Analysis of Canadian Surveillance Law Expansion Under Bill C-22 – CitizenLab

PaceVer (an alternative to SemVer, for mobile apps)

How ClickHouse Became 26x Faster at Joins

Can poppy seeds make you fail a drug test?

KDE Linux Is Coming Along Nicely, Ditching the AUR and Tightening Up Security

God of War Laufey: First gameplay trailer

Have a "Lifetime" Without Microsoft

No Let, No Rec, No Problem: A Gentler Introduction to the Y and Z Combinators

Resolving Feynman's restaurant problem reveals optimal solutions and strategies

Hundreds of cancer papers presented incorrect data after p16 protein mixup

djbsort

How to Debug AI Agents with Traces and Evals

Jumping Up/Down on the Shoulders of Giants, Never Talking About What Gates Did

The importance of free software to science

Self-hosted dev sandboxes with preview URLs (Docker, Go, no K8s)

Artist Corporations

The web is changing, and we are not going back

Claude Code vs. Codex