frontpage.

There is an adversarial relationship between developers and big model labs.

Model labs charged developers higher API prices to subsidize their own agent harness offerings. Think Anthropic charging 5x higher Claude API prices to subsidize consumer subscriptions. So Cursor in a way was subsidizing their own direct competitor.

DeepSeek V4 Flash totally inverted this relationship. Now you have a model that beats even Sonnet in some benchmarks and is totally opensourced. Now inference providers are racing to the bottom to optimize and give cheaper hosting. Every player with a non-SOTA is now racing to swap over to stop paying the big model lab tax, even Microsoft is switching Copilot to use DeepSeek.

On switching over to Deepseek:

- we noticed over a 100x cost decrease while similar or better performance then Gemini 3 Flash

- insane saving from the cached input tokens: $0.002/1 Million tokens

- both DeepSeek Flash and GLM 5.2 are text-only models, so clearly multimodal training is not worth the additional cost. Language is just a much more efficient sparse representation of the world/reasoning than vision

- we had a early bet on a text-only web agent harness, and now with DeepSeek this results in unique cost advantages.

- we rewrote our harness as a callable DSL library that a model can generate code to execute on. DeepSeek has proven phenomenal on code generation to drive an agent harness.

- I would highly recommend everyone to rewrite their harness to be text-only and callable via executable code leveraging DeepSeek V4 Flash.

Tabsmith-lint catches Chrome Web Store rejections before you submit

Launchpad Is Down

As SuperAgers age, they make at least twice as many new neurons as their peers

Layoffs hit Bellevue-based video game studio behind 'Destiny' franchise

Staff Framework – Tornando Desejos Em Metas Verificáveis (Smart Method )

New $1M Vesuvius Challenge Grand Prize will be announced in coming days

Anthropic Accuses Alibaba of Largest AI Distillation Attack: 28.8M Fraudulent

The Ticks That Cause Red-Meat Allergies Are Spreading Across the U.S.

Razor‐Sharp Edge–The Yakutat Slab Dissecting South‐Central Alaska

Radxa Orion O6N Review: The Powerful and Silent ARM64 Beast

How agents are transforming work

An Audit of the Bible and the God in It

Ask HN: How do you know when an Ad campaign is doing well?

Ask HN: Has anyone deployed AI tools in a trade or field service

Chasing Likes, Losing Connection: Youth Mental Health in the Digital Era

Linux Foundation Launches Akrites to Defend FOSS from AI-Enabled Exploits

The Windows Phone that "killed" Nokia

Sony Confirms 'Significant' Bungie Layoffs

DropItDown – Drop a file, get Markdown your AI agent can read (macOS)

How do you get good ideas for startups?

Extropic is Rethinking Computing

CS2-10k: A Large-Scale Egocentric Counter-Strike 2 Dataset

Using a Rust macro for concise newtypes

Ask HN: How did you set up a multi-agent orchestration for personal use?

A Deep Dive on China's "LineShine" All-CPU, Exaflops-Class Supercomputer

Anthropic's philosopher answers your questions [video]

Silicon Valley Has an Empathy Vacuum (2016)

Van Halen test

Show HN: FastPlay, a fast minimal Windows video player built in Rust

Investors bet on AI again after Micron reports 346% sales jump