frontpage.

Subagents are one of the newest power features in AI coding assistants, and I have mixed feelings about them.

The basic idea is good. If a task is messy, you can split it into smaller threads, run them in parallel, and merge the results. For large refactors, multi-file debugging, or research implementation work, this can save a lot of time.

But I am starting to notice a strange manager effect.

I ask for a top-tier model because I want the best reasoning and engineering judgment. Instead, the main agent often seems to switch into "manager" mode and delegate most of the actual work to smaller models. The smaller model produces an okay solution, and then the expensive model turns it into a polished final answer.

So the smart model becomes a manager, the cheaper model does the heavy lifting, and the whole chain burns tokens. Ironically, the manager is often the most expensive part and the least useful part.

What I want from these tools is mostly transparency and control. I want to see which model handled which step. I want the option to disable delegation, or restrict it to certain subtasks. And if the tool is making a speed-versus-quality tradeoff, I want that tradeoff to be explicit.

Subagents are genuinely useful. But if I pay for a flagship model, I want flagship-level work, not a summary of a weaker model’s draft.

Curious whether others are seeing the same thing.

WikiGacha

Show HN: NERDs – Entity-centered long-term memory for LLM agents

Show HN: Traces: A new way to share and discover agent traces

Supertoast Tables

OpenFelix – Open-source AI assistant for macOS (local MLX and any cloud model)

Anthropic Open SWE Roles vs. AI Replacement Claims

DJI is >96% of RemoteID usage in the US [video]

Sick of Microsoft and Google? The Office EU suite is an open-source alternative

Forest Aboveground Carbon Storage in the Three Parallel Rivers Region

Clerk Dev Outage

I spent weeks building something I could have paid $30M for and I'd do it again

Running production AI systems at scale (GoDaddy and AWS case study)

Depleted oil reserve leaves US exposed as Iran war pushes up prices

Markdown Browser

A Defence of the Constitutions of Government of the USA (1787)

Spamming with Google Groups

U.S. Lost 92,000 Jobs in February

Can the Most Abstract Math Make the World a Better Place?

A particular kind of dark matter explains mysterious signals from the Milky Way

Show HN: LoRA gradients on Apple's Neural Engine at 2.8W

Modular Diffusers: Composable Building Blocks for Diffusion Pipelines

Show HN: Claude skill to do your taxes

Apple's 512GB Mac Studio vanishes, a quiet acknowledgment of the RAM shortage

Automating the process of validating a business idea

Apple M5 Max appears in Geekbench, tops M3 Ultra in multi-core score

Always-on detections: eliminating the WAF "log versus block" trade-off

OpenWhispr – Open-source WhisperFlow alternative, runs on your Mac

Paperclip: Open-source orchestration for zero-human companies

Minimum Viable Slop

Show HN: Anchor Engine – Deterministic Semantic Memory for LLMs Local (<3GB RAM)

Ask HN: Are subagents making flagship AI models feel like managers?