frontpage.

I was frustrated that coding agents like Claude Code were basically unusable with local models. every few turns the prefix shifts, KV cache gets invalidated, and your mac has to re-prefill the entire context from scratch.

So i built oMLX. it persists KV cache blocks to SSD, and when a previous context comes back, it restores from disk instead of recomputing. this alone made Qwen3-Coder-80B on my M3 Ultra actually usable for real coding sessions.

Some other stuff it does: continuous batching, multi-model serving (LLM + embedding + reranker at once), prefix sharing with copy-on-write, and a native mac menubar app so you don't have to touch the terminal.

Just shipped a built-in benchmark tool too, so you can test your own setup easily.

The Mainframe Renaissance: Why AI Needs a 1970s Adult to Supervise Its Homework

Hyperagent

My lobster lost $450k this weekend

The Longest Line of Sight

Ductape – One SDK for any backend integration

You Can't Optimize What You Can't See. AI Cost Observability

Show HN: Fastdedup – Rust dataset deduplication (2:55 vs. 7:55 688MB vs. 22GB)

Hegseth gives Anthropic until Friday to back down on AI safeguards

Training my dog to vibe code B2B SaaS apps

Can agentic coding raise the quality bar?

Show HN: MakLock – Free macOS App Locker with Touch ID and Apple Watch

"SaaS is Dead" – they say

Show HN: YouAM – An address, contact card, and encrypted inbox for AI agents

Show HN: Shelfctl – PDF/ePub library manager backed by GitHub Release

Intel Formally Ends Four of Their Go Language Open-Source Projects

Spacydo: State machine example with own calldata for state transition rules

Data vs. Hype: How Orgs Win with AI – The Pragmatic Summit [video]

Implementing a Clear Room Z80 / ZX Spectrum Emulator with Claude Code

Coding Agent, Good?

Steel Bank Common Lisp

Forests don't just store carbon. They keep people alive, scientists say

The Deceptively Simple Act of Writing to Disk

Inception Launches Mercury 2, the Fastest Reasoning LLM

OpenAI, the US government and Persona built an identity surveillance machine

OpenAI resets spending expectations, from $1.4T to $600B

I think WebRTC is better than SSH-ing for connecting to Mac terminal from iPhone

China May Grab a Lead in the Race for Military Fusion

An AI agent bought from our WooCommerce store. Here's what we learned

Ask HN: Share a random link from your bookmarks

Ask HN: Demand for a compliance-first deterministic context compiler?

Show HN: oMLX – coding agents on local LLMs without the painful reprefill