frontpage.

Shipped today. The benchmarks are real: 87.6% SWE-bench (from 80.8%), +13% on coding tasks, 3x more resolved production tasks on Rakuten-SWE-Bench. But there are a few changes that compound on each other for token consumption: a new tokenizer (1.0–1.35x more tokens for the same input depending on content type), xhigh effort mode that reasons longer per turn, and /ultrareview which spins up parallel agents for code review. The model is genuinely better. The question is whether "better reasoning on the same noisy context" is the right optimization. Coding agents still spend most of their budget exploring files they won't touch before doing anything useful — a smarter model doesn't fix that, it just does it more thoroughly. Curious what cost delta people are seeing on real codebases moving from 4.6 to 4.7, especially with the new effort levels. (I've been building in this space — vexp.dev is my attempt at the context side of the problem, happy to share more details if useful).

U.S. to Create High-Tech Manufacturing Zone in Philippines

15% of Reddit Posts are Likely AI-generated in 2025

Street Fighter 2026 Trailer

Reed Hastings is leaving Netflix after 29 years

Helpful translations from British English (2015)

Unicorn Market Cap 2026: SF Is the GenAI Super Cluster

Ollama v0.21.0-Rc0

Release PiClaw v1.8.0 – This Is Spinal Tap

Could AI's leading men become as powerful as Ford or Rockefeller?

New unsealed records reveal Amazon's price-fixing tactics, California AG claims

Data Science Weekly – Issue 647

First trailer released for western starring AI version of Val Kilmer

Visualizing 100k prime numbers in 3D

Free instant WCAG 2.2 accessibility audit

How to Deconstruct Almost Anything (1993)

Show HN: Tracking Top US Science Olympiad Alumni over Last 25 Years

A jury declared Live Nation a monopoly. But ticket prices won't drop just yet

The MacBook Neo Guide

Red hair&fair skin favored by natural selection last 10k years: vit D production

Guy builds AI driven hardware hacker arm from duct tape, old cam and CNC machine

Worm's-Eye View

Machine Learning Operations on ZYNQ FPGA Board for Real-Time Face Recognition

Objection – The AI Tribunal of Truth

'Fireproof' batteries create their own internal firewall when the heat is on

A practical guide to Git worktrees

Show HN: Talk to all your agents in one place

DuckLake 1.0 on MotherDuck

The Guitar Sounds New Again

IPv6 usage reaches historic 50% across Google services, matching IPv4

The Dangerous Illusion of AI Coding [video]

Ask HN: Opus 4.7 – is anyone measuring the real token cost on agentic tasks?