frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

50x Faster Post-Training

https://www.workshoplabs.ai/blog/post-training-50x-faster
6•addiefoote8•1h ago

Comments

loudeaglenoise•54m ago
Hey all, Luke from Workshop Labs here. We're excited to get this in the hands of builders in the next 1-2 weeks.

We're doing some safety work before the release. Specifically, we're checking for bio uplift: does OSing 50x faster OS training code for (relatively) smaller GPU setups seriously democratize dangerous bio capabilities?

We expect the answer is no, but it doesn't hurt to check. Once that's done, we'll drop the repo.

kostolansky•53m ago
We took another step toward making open source models truly open, namely by creating a training stack that actually allows for finetuning of a large frontier OSS model, Kimi K2 Thinking. (The OSS stack for big models is surprisingly pretty abysmal these days!)

There is lots of value to be unlocked by people using language models for their own purposes, and our work here hopefully moves the needle towards making that more accessible to more people. (The training code will be released soon, pending safety testing.)

We are very excited by what we can all build :D

- Tim from WSL

addiefoote8•17m ago
I'm also excited about the research that could be enabled by having weight-level access and fine tuning access on frontier open source models. There's a lot of interesting behavior that just doesn't exist in 8B parameter models, not to mention with architectural and training differences.
addiefoote8•24m ago
Some more details on this: After realizing Hugging Face would be messy to work with to train Kimi-k2-thinking, we decided to do it ourselves.

We started with PrimeRL and implemented Kimi in it, verifying it against the Moonshot API. The initial distributed training method, FSDP, is not ideal for memory bottlenecked MoEs, so we added support for Expert Parallel. This enabled faster training, but many optimizations remained. We discuss several in the post, and collectively, these efforts took us from training 125 tokens/s to 6,660 tokens/s on a single 8xH200 node! Per token, our codebase is cheaper than anything on the market, including training APIs like Tinker.

We plan to open source in the coming week or two, pending safety evals!

Sigma's New Rice Company Is Less About Rice and More About Aizu

https://petapixel.com/2026/03/12/sigmas-new-rice-company-is-less-about-rice-and-more-about-aizu/
1•lastofthemojito•1m ago•0 comments

ICE agents reveal daily arrest quotas and surveillance app in court testimony

https://www.theguardian.com/us-news/2026/mar/13/ice-agent-court-testimony-oregon
1•mitchbob•1m ago•0 comments

Everything's Casino

https://www.joanwestenberg.com/everythings-casino/
1•alcazar•3m ago•0 comments

Yet another Valve lawsuit on loot boxes

https://www.windowscentral.com/gaming/pc-gaming/steams-valve-responds-to-lawsuit-from-new-york-at...
1•s3r3nity•4m ago•0 comments

Account regional namespaces for Amazon S3 general purpose buckets

https://aws.amazon.com/blogs/aws/introducing-account-regional-namespaces-for-amazon-s3-general-pu...
1•timoth•7m ago•1 comments

How we built a prompt optimization agent

https://www.extend.ai/resources/how-we-built-composer
2•kbyatnal•12m ago•0 comments

The Great AI Silicon Shortage

https://newsletter.semianalysis.com/p/the-great-ai-silicon-shortage
2•akyuu•12m ago•0 comments

H-1B Visa employers database goes offline, key public records disappear

https://timesofindia.indiatimes.com/technology/tech-news/h-1b-visa-employers-database-goes-offlin...
1•alexfromapex•13m ago•1 comments

AMUX – Tmux and Tailscale powered offline-first agent multiplexer

https://amux.io/
1•Beefin•15m ago•0 comments

Digg Is Gone Again

https://digg.com/
4•hammerbrostime•16m ago•4 comments

Show HN: Mac wallpaper that updates daily – calendar, quote, affirmation, fact

https://thecalendarwallpaper.com
2•TheOmkarBirje•19m ago•0 comments

Show HN: Mjmx – render mjml using JSX

https://mjmx.dev/
1•skwee357•20m ago•0 comments

Nitrogen, Ammonia, and the Strait of Hormuz

https://www.science.org/content/blog-post/nitrogen-ammonia-and-strait-hormuz
2•nbernard•20m ago•0 comments

Show HN: Fastest Enterprise AI Gateway

https://docs.getbifrost.ai/benchmarking/getting-started
2•aanthonymax•21m ago•0 comments

Instagram Ending Encrypted DMs

2•01-_-•21m ago•1 comments

Gilmour's 'Black Strat' Sells for $14.55M, the Most Expensive Guitar Ever Sold

https://www.rollingstone.com/music/music-news/david-gilmour-black-strat-sells-14-million-auction-...
2•thm•21m ago•0 comments

A.I. Chatbots Want Your Health Records. Tread Carefully

https://www.nytimes.com/2026/03/12/technology/personaltech/microsoft-copilot-health-ai-chatbots.html
4•JumpCrisscross•21m ago•0 comments

Why does entering a submission url prevent me from submitting a new post?

1•avionics-guy•22m ago•0 comments

An AI agent that claws through your network

https://github.com/automateyournetwork/netclaw
1•mooreds•22m ago•0 comments

Marknote 1.5 Released for KDE

https://blogs.kde.org/2026/03/13/marknote-1.5/
1•jandeboevrie•23m ago•0 comments

Things I Wish I'd Known Before Buying an EV

https://www.wsj.com/business/autos/driving-electric-vehicle-downsides-9e2b51ee
2•sam345•24m ago•0 comments

Show HN: A social network where AI agents have public profiles and earn money

https://socialtense.com/
2•keshav_1806•24m ago•0 comments

We Built a Cathedral in the Wrong City

https://blog.shanemac.com/we-built-a-cathedral-in-the-wrong-city/
1•mooreds•25m ago•0 comments

An open source alternative to Logi-Plus mouse software

3•avionics-guy•25m ago•0 comments

Undefined Roles: Pe

https://www.andismith.com/blogs/2026/03/undefined-roles
2•AndiSmith•26m ago•0 comments

CLI Has a New Super User

https://rsnodgrass.substack.com/p/your-cli-has-a-new-super-user
2•galexyending•28m ago•2 comments

Show HN: AgentLog – a lightweight event bus for AI agents using JSONL logs

https://github.com/sumant1122/agentlog
2•paperplaneflyr•29m ago•0 comments

Show HN: FrameFit – AI-powered photo cropping for digital photo frames

https://framefit.photo
1•farskid•30m ago•0 comments

Ask HN: Why is USA starting world war 3 now?

3•roschdal•30m ago•3 comments

Show HN: AgentClick – Human-in-the-loop review UI for AI coding agents

https://github.com/agentlayer-io/AgentClick
1•harvenstar•30m ago•1 comments