Show HN: Mixlab, an ML arch lab in Go. JSON config, Metal and CUDA, 1.6s builds

https://github.com/mrothroc/mixlab

1•mrothroc•1h ago

I built a tool for quickly testing different ML architectures. Define a model in JSON, train on your Mac (Metal) or ship the same config to a cloud GPU (CUDA). No code changes between platforms.

Why: I wanted to compare attention vs Mamba vs GQA at different parameter budgets without writing PyTorch for each experiment. Edit a JSON config, hit enter, see loss numbers. It will race different configs for you. The number one goal is iteration speed.

JSON config lets you chain together common ML blocks (attention, GQA, mamba, RetNet, and several more) and optimizers (muon, adamw) and compiles them to MLX IR, which can either run on Metal or CUDA backends.

Why Go: 1.6s builds, built-in profiling (mixlab -cpuprofile gives you a flame graph), import-based extensibility for custom blocks. No C++ extensions, no custom build systems. And personally I prefer strongly-typed, compiled languages.

On a Shakespeare benchmark matching nanoGPT (6L, 6H, d=384, 10.8M params): val loss 1.5527 on M1 Max, 1.5588 on A40. PyTorch numerical parity confirmed to 8 decimal places.

brew install mrothroc/tap/mixlab

https://github.com/mrothroc/mixlab

Comments

mrothroc•1h ago

Simple example to show how configs are defined:

{ "name": "plain_3L",

  // Minimal causal transformer baseline: 3 attention layers plus 3 SwiGLU layers.
  "model_dim": 128,
  "vocab_size": 1024,
  "seq_len": 128,

  // Blocks execute sequentially, alternating token mixing and feed-forward mixing.
  "blocks": [
    {"type": "plain", "heads": 4},
    {"type": "swiglu"},
    {"type": "plain", "heads": 4},
    {"type": "swiglu"},
    {"type": "plain", "heads": 4},
    {"type": "swiglu"}
  ],

  // Slightly longer than smoke-test configs so the baseline loss moves visibly.
  "training": {
    "steps": 200,
    "lr": 3e-4,
    "grad_clip": 1.0,
    "weight_decay": 0.01,
    "seed": 42,
    "batch_tokens": 1024
  }
}

AI Tools Are Helping Mediocre North Korean Hackers Steal Millions

Honey, I Shrunk the Coding Agent

New York bans state employees from insider trading on prediction markets

Why Gen AI Isn't Quite Cost-Effective at Creating 3D Game Worlds

The Mystery of the Giant Blobs at the Center of the Earth

How to program computers (kos) [video]

Compiler Jokes

EML compresses calculator syntax; Phase Calculus places it one layer downstream

Trees of New York City

Show HN: We built Cursor, but for data transformations [Open Source]

New Kind of Paper (2021)

I almost signed a lease that would have cost me thousands

What if we start to draw inspiration from nature's greatest machine?

Show HN: DrakeAI – AI expense tracker you log by texting (iOS and Android)

Google's 8th Generation TPUs Power the Agentic Era [video]

Show HN: A Swift Payment message validator built from Swift Standard rules

Live hooks – simple missing patterns for predictable hooks in async React code

Building design system components with agent teams

Physicists think they've solved the muon mystery

Show HN: Clawrium – A CLI for managing AI agent fleet across multiple instances

Ask HN: Do financial stakes improve long-term consistency?

Markdown (Aaron Swartz: The Weblog)

Surveillance Pricing: Exploiting Information Asymmetries

I Forked 4 CLI coding agents to Run the Same Model. I found a 2x gap

New electric cars now cheaper than petrol models for first time in UK

Agent Vault: The Open Source Credential Proxy and Vault for Agents

Startups Can Beat Big AI Labs

Anker made its own chip to bring AI to all its products

Sauce Co Prego Sparks Privacy Concerns After Device Launch to Record Mealtimes

Mathematically Correct Breakfast