frontpage.

We did experiments comparing Optuna & autoresearch. Autoresearch converges faster, is more cost-efficient, and even generalizes better.

Experiments were done on NanoChat: we let Claude define Optuna’s search space to align the priors between methods. Both optimization methods were run three times. Autoresearch is far more sample-efficient on average

In 5 min training setting, LLM tokens cost as much as GPUs, but despite a 2× higher per-step cost, AutoResearch still comes out ahead across all cost budgets

What’s more, the solution found by autoresearch generalizes better than Optuna’s. We gave the best solutions more training time; the absolute score gap widens, and the statistical significance becomes stronger

An important contributor to autoresearch’s capability is that it searches directly in code space. In the early stages, autoresearch tunes knobs within Optuna’s 16-parameter search space. However, with more iterations, it starts to explore code changes

Kicking Off the ATP Working Group at the IETF

Design for the Roller Coaster

Amazon to Apply 3.5% Fuel Surcharge to Third-Party Sellers

Arthur Brooks on Reinvention, Religion, and the Science of Happiness

Half of social-science studies fail replication test in years-long project

Ask HN: Not much interest in offline CPU LLMs that may kill SaaS?

Why Doesn't Anybody Realize We're Going Back to the Moon?

iPhones orbit the Moon on Artemis II for astronaut photos

The IDE Is Dead. Long Live the ADE

QuickTime: The mad dash to build the future of multimedia

Denuvo has been broken, company promises countermeasures against new DRM bypass

Company backed by Trump sons to sell interceptors to Gulf states being attacked

Show HN: Centel – Vibe Coding for Professionals

NetWatch – terminal network diagnostics release 0.9.0

First lab-grown T-Rex leather handbag unveiled in Amsterdam could fetch €575,000

rpg: A modern psql-compatible Postgres terminal and TUI written in Rust

Specgetty

Memo: A language that remembers only the last 12 lines of code

Blue Owl Investors Seek to Pull $5.4B from Two Private-Credit Funds

Podroid: Run Linux Containers on Android

Knightcore · Living Dungeon – Autonomous Ecosystem Update

Proton Workspace: An encrypted suite for team collaboration

Show HN: OpenVole – VoleNet Distributed AI Agent Networking

Proton Meet, Talk in total privacy

Show HN: A glitchy browser card game – does this feel playable?

Trump: "We're fighting wars. We can't take care of day care"

FDA Peptide Reclassification 2026: Which Peptides Are Coming Back

I built an Open-source Gmail MCP server: multi-account support, read/write

CUDA Tile is the biggest GPU programming shift in 20 years

Do You Need to Tune Postgres Vacuum?

Show HN: Is autoresearch better than classic hyperparameter tuning?