frontpage.

I ported the Mamba2 state-space model (“Transformers are SSMs”) to pure JAX/Flax as mamba2-jax.

– Core Mamba2 block with LM (Mamba2ForCausalLM) and time-series (Mamba2Forecaster) heads – Pure JAX/Flax (no Triton/custom CUDA), runs on CPU / CUDA / TPU via standard JAX backends – Small CPU-only parity test vs mamba2-torch: similar loss curves, final MSE diff ≈ 0.012, prediction correlation ≈ 0.99; after JIT warmup JAX was ≈ 2× faster per step

I’d really appreciate feedback on: – API design, especially for streaming/stateful inference – Performance gotchas you hit if you try it – Any hooks you’d want exposed for research use

PyPI: https://pypi.org/project/mamba2-jax/

Thanks, Cosmo

Pusat Bantuan – CS Ajaib

Layanan Call Center Ajaib

CS Ajaib Sekuritas

Cara Reschedule Tiket Pesawat Lion Air

Call Center Ajaib – Pusat Bantuan

Call Center Lion Air Group 24 Jam

There may be a link to ADHD and substance use among young sexual minority men

Cara Menghubungi Call Center Flip

Cara Reschedule Tiket Batik Air

Arc Is a Vision Problem

You can see a working Quantum Computer in IBM's London office

Layanan Bantuan Air Asia

Show HN: Mamba2-Jax; Mamba2 implemented in pure Jax/Flax

In What Universe Is Thinking Machines Lab Worth $50B

What you should know from a trove of ChatGPT conversations we analyzed

Intel is listening, don't waste your shot

Building an AI generated animated kids yoga video for $5 in 48 hours

Elon Musk's Grok chatbot ranks him as world history's greatest human

How X national origin label is not a magic 8-ball at all

GravOpt – 20k-node MAX-CUT in ~7 minutes on a single CPU core

Quantum router preserves delicate photon states

LangChain Cost Optimization with Model Cascading

Quantum Investment Bros: Have you no shame?

Court Filings Allege Meta Downplayed Risks to Children and Misled the Public

Markdown Editors

Joe Rogan Experience #2416 – Dan Farah [video]

Brazil's ex-president Bolsonaro arrested to prevent 'escape' court says

Tell HN: Archive.today Partially Inaccessible

What's Lost When Stars Disappear from View

We don't talk enough about the best part of AI agents

Show HN: Mamba2-Jax; Mamba2 implemented in pure Jax/Flax

Pusat Bantuan – CS Ajaib

Layanan Call Center Ajaib

CS Ajaib Sekuritas

Cara Reschedule Tiket Pesawat Lion Air

Call Center Ajaib – Pusat Bantuan

Call Center Lion Air Group 24 Jam

There may be a link to ADHD and substance use among young sexual minority men

Cara Menghubungi Call Center Flip

Cara Reschedule Tiket Batik Air

Arc Is a Vision Problem

You can see a working Quantum Computer in IBM's London office

Layanan Bantuan Air Asia

Show HN: Mamba2-Jax; Mamba2 implemented in pure Jax/Flax

In What Universe Is Thinking Machines Lab Worth $50B

What you should know from a trove of ChatGPT conversations we analyzed

Intel is listening, don't waste your shot

Building an AI generated animated kids yoga video for $5 in 48 hours

Elon Musk's Grok chatbot ranks him as world history's greatest human

How X national origin label is not a magic 8-ball at all

GravOpt – 20k-node MAX-CUT in ~7 minutes on a single CPU core

Quantum router preserves delicate photon states

LangChain Cost Optimization with Model Cascading

Quantum Investment Bros: Have you no shame?

Court Filings Allege Meta Downplayed Risks to Children and Misled the Public

Markdown Editors

Joe Rogan Experience #2416 – Dan Farah [video]

Brazil's ex-president Bolsonaro arrested to prevent 'escape' court says

Tell HN: Archive.today Partially Inaccessible

What's Lost When Stars Disappear from View

We don't talk enough about the best part of AI agents