frontpage.

Experiment that I've made. The models get access to an E2B sandbox and are instructed to create an ad according to the specifications (they can choose whatever tools they want to use for it, e.g. Pillow, Chromium) as a proxy for their ability to use tools, create other kinds of images, do complex layouts etc. Currently Opus 4.8 is on top (not surprising, but it did take 66 conversation turns to create the image) and GLM-5.2 is on fifth (which I do find surprising because it doesn't have image capabilty).

Show HN: Autolectures – lecture videos from prompts with Remotion

ECBSV

Erasing Existentials

Show HN: Exploring a More Pythonic AWS SDK

AI Boom Hits Labor Market Reality Check

Claude Guillemot: Ubisoft founder killed in plane crash

Show HN: Scalable reversi – infinite undo available via a1 style

Apple Internals: Swift in the Kernel

Robotics Teams Are Rebuilding the Data Stack from Scratch

I was wrong about the Midjourney ultra-sound scanner

Backtest Is Lying to You

How to Build a Marketplace Startup That Solves the Chicken-and-Egg Problem

Why the Cookbook Endures

Electric air taxis are stuck in the courtroom

Backporting bug fixes is dead, Project Valkey now sends in the bots

Linux '95

Vulgar Materialism

Show HN: Teach your kids absolute (perfect) pitch

Printing Gaussian Splats

Show HN: TermType – a terminal typing game where words fall like Space Invaders

Anthropic to Require ID Verification for Certain Capabilities Starting July 8

Why Mizoram has shops without shopkeepers (2024)

Show HN: A GitHub app that suggests code fixes for conversion failures

Smashing the NIMBYs created modern capitalism

Safe SIMD in Rust, Even on the Inside – By Sergey "Shnatsel" Davidoff

Why do sports stadiums have different names for the World Cup? Here's the reason

Neosolve – SolveSpace fork with OpenCASCADE CAD kernel

Creativity in the form of archived web pages from the dawn of the internet

How the social media ban could reshape how all of us use the internet

Where the sun stood at the 2026 summer solstice

Show HN: AdvertBench, ranking the ability of LLMs to create image ads