frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Mixlab, an ML arch lab in Go. JSON config, Metal and CUDA, 1.6s builds

https://github.com/mrothroc/mixlab
1•mrothroc•1h ago
I built a tool for quickly testing different ML architectures. Define a model in JSON, train on your Mac (Metal) or ship the same config to a cloud GPU (CUDA). No code changes between platforms.

Why: I wanted to compare attention vs Mamba vs GQA at different parameter budgets without writing PyTorch for each experiment. Edit a JSON config, hit enter, see loss numbers. It will race different configs for you. The number one goal is iteration speed.

JSON config lets you chain together common ML blocks (attention, GQA, mamba, RetNet, and several more) and optimizers (muon, adamw) and compiles them to MLX IR, which can either run on Metal or CUDA backends.

Why Go: 1.6s builds, built-in profiling (mixlab -cpuprofile gives you a flame graph), import-based extensibility for custom blocks. No C++ extensions, no custom build systems. And personally I prefer strongly-typed, compiled languages.

On a Shakespeare benchmark matching nanoGPT (6L, 6H, d=384, 10.8M params): val loss 1.5527 on M1 Max, 1.5588 on A40. PyTorch numerical parity confirmed to 8 decimal places.

brew install mrothroc/tap/mixlab

https://github.com/mrothroc/mixlab

Comments

mrothroc•1h ago
Simple example to show how configs are defined:

{ "name": "plain_3L",

  // Minimal causal transformer baseline: 3 attention layers plus 3 SwiGLU layers.
  "model_dim": 128,
  "vocab_size": 1024,
  "seq_len": 128,

  // Blocks execute sequentially, alternating token mixing and feed-forward mixing.
  "blocks": [
    {"type": "plain", "heads": 4},
    {"type": "swiglu"},
    {"type": "plain", "heads": 4},
    {"type": "swiglu"},
    {"type": "plain", "heads": 4},
    {"type": "swiglu"}
  ],

  // Slightly longer than smoke-test configs so the baseline loss moves visibly.
  "training": {
    "steps": 200,
    "lr": 3e-4,
    "grad_clip": 1.0,
    "weight_decay": 0.01,
    "seed": 42,
    "batch_tokens": 1024
  }
}

AI Tools Are Helping Mediocre North Korean Hackers Steal Millions

https://www.wired.com/story/ai-tools-are-helping-mediocre-north-korean-hackers-steal-millions/
1•cdrnsf•21s ago•0 comments

Honey, I Shrunk the Coding Agent

https://itayinbarr.substack.com/p/honey-i-shrunk-the-coding-agent
1•homarp•2m ago•1 comments

New York bans state employees from insider trading on prediction markets

https://www.wired.com/story/new-york-bans-government-employees-prediction-markets/
2•jmsflknr•3m ago•0 comments

Why Gen AI Isn't Quite Cost-Effective at Creating 3D Game Worlds

https://wjamesau.substack.com/p/why-gen-ai-isnt-quite-cost-effective
1•SLHamlet•4m ago•0 comments

The Mystery of the Giant Blobs at the Center of the Earth

https://nautil.us/the-mystery-of-the-giant-blobs-at-the-center-of-the-earth-1280082
1•Brajeshwar•4m ago•0 comments

How to program computers (kos) [video]

https://www.youtube.com/watch?v=kTrOg19gzP4
1•tosh•4m ago•0 comments

Compiler Jokes

https://bitsrc.org/blog/posts/compiler-jokes.html
1•kouosi•5m ago•0 comments

EML compresses calculator syntax; Phase Calculus places it one layer downstream

1•jlietz93•5m ago•0 comments

Trees of New York City

https://tree-map.nycgovparks.org/tree-map/neighborhood/177
1•jackconsidine•6m ago•0 comments

Show HN: We built Cursor, but for data transformations [Open Source]

https://github.com/zipstack/visitran
1•naren87•7m ago•0 comments

New Kind of Paper (2021)

https://mlajtos.mu/posts/new-kind-of-paper
1•tosh•9m ago•0 comments

I almost signed a lease that would have cost me thousands

https://goleazly.com/
1•pomberito•9m ago•0 comments

What if we start to draw inspiration from nature's greatest machine?

https://eversoleken.substack.com/p/signaling-is-the-intelligence
1•kennethops•10m ago•1 comments

Show HN: DrakeAI – AI expense tracker you log by texting (iOS and Android)

https://apps.apple.com/us/app/drakeai/id6762331893
1•a_protsyuk•10m ago•0 comments

Google's 8th Generation TPUs Power the Agentic Era [video]

https://www.youtube.com/watch?v=Ocf7EYHmmzo
1•sam1234apter•12m ago•0 comments

Show HN: A Swift Payment message validator built from Swift Standard rules

https://cbprstar.com/
1•phoughton•14m ago•0 comments

Live hooks – simple missing patterns for predictable hooks in async React code

https://github.com/Taltzipi/live-hooks
1•taldavidson•15m ago•1 comments

Building design system components with agent teams

https://www.kaelig.fr/design-system-components-with-ai-agent-teams/
1•kaelig•15m ago•0 comments

Physicists think they've solved the muon mystery

https://arstechnica.com/science/2026/04/physicists-think-theyve-solved-the-muon-mystery/
1•nobody9999•16m ago•0 comments

Show HN: Clawrium – A CLI for managing AI agent fleet across multiple instances

https://ric03uec.github.io/clawrium/
2•devashish86•16m ago•1 comments

Ask HN: Do financial stakes improve long-term consistency?

https://proofly.migliorarecorp.com/
1•talhaahsan•17m ago•0 comments

Markdown (Aaron Swartz: The Weblog)

http://www.aaronsw.com/weblog/001189
1•birdculture•20m ago•0 comments

Surveillance Pricing: Exploiting Information Asymmetries

https://lpeproject.org/blog/surveillance-pricing-exploiting-information-asymmetries/
3•cainxinth•20m ago•0 comments

I Forked 4 CLI coding agents to Run the Same Model. I found a 2x gap

https://charlesazam.com/blog/deepdive-benchmark/
1•grokx•20m ago•0 comments

New electric cars now cheaper than petrol models for first time in UK

https://www.evinfrastructurenews.com/emobility/new-electric-cars-now-cheaper-than-petrol-models-f...
1•MBCook•20m ago•0 comments

Agent Vault: The Open Source Credential Proxy and Vault for Agents

https://infisical.com/blog/agent-vault-the-open-source-credential-proxy-and-vault-for-agents
2•vmatsiiako•22m ago•0 comments

Startups Can Beat Big AI Labs

https://modular.cloud/blog/how-startups-can-beat-big-ai-labs
1•liamcardenas•22m ago•0 comments

Anker made its own chip to bring AI to all its products

https://www.theverge.com/tech/916463/anker-thus-chip-announcement
6•Brajeshwar•23m ago•0 comments

Sauce Co Prego Sparks Privacy Concerns After Device Launch to Record Mealtimes

https://www.inkl.com/news/prego-sparks-privacy-concerns-after-launching-a-device-designed-to-reco...
1•randycupertino•25m ago•0 comments

Mathematically Correct Breakfast

https://www.georgehart.com/bagel/bagel.html
1•_Microft•25m ago•0 comments