frontpage.

Hello HN!

I’ve been working on `benchmax`, a open-source framework for building, running, and parallelizing environments, to fine-tune LLMs with reinforcement learning.

What I wanted to solve for:

- Environments are tightly coupled with RL trainers, leading to fragmentation and limited compatibility.

- These coupled environments are tend to be mostly competitive math and coding → for OSS RL + LLMs to scale, we need more complex, real-world environments.

- Scaling these environments in parallel is still not easily possible

What I'm excited about:

- benchmax is training framework agnostic with adapters already built out for verl and verifiers. we’re gonna build more adapters for other frameworks (e.g. SkyRL, etc.), instead of forcing others to adopt our standard (though ofc they’re welcome to )

- benchmax comes with a few interesting environments out of the box: spreadsheet processing, CRM, etc. → more coming soon!

- benchmax supports MCP as a first class citizen. there has been an explosion of MCP servers/tools built out for usecases ranging from browser use to excel to game creation.`benchmax` allow folks to leverage and compose these existing MCP servers to build environments integrated with real world systems

- Multi-node environment parallelization coming soon!

If you like what you see, feel free to *star* the *repo* to support the project!! Our hope’s to really let anyone benchmax on their tasks, with benchmax

https://github.com/cgftinc/benchmax

It’s still very early! And I expect to be shipping a lot more things → more environments, more trainer integrations. Would love y’all’s thoughts what environments and trainer integrations could be interesting!

How Do You Handle Branching for Database GitOps?

Analysis Shows Competitive LCOE Target for Small Modular Reactors

Show HN: I built a free backlink exchange marketplace

Bitchat Mesh

Apisix Integration with AI/ML API

Automatic A2A Service Discovery in Kubernetes with Inference Gateway

The Online Safety Act for forum and blog owners

Most Watched Software Engineering Talks Of 2025 (so far)

Parity of Zero

Hypercube 3d ultimate tic tac toe

Tell HN: NISAR Satellite to Launch Today

New battery manufacturer with European software: GAZ Energy

Nostr Auth Provider · clerk · Discussion #6435

Show HN: Deno is amazing. I built a toy TUI text editor to make sure of that

Happy 20th Birthday MDN

Do LLMs Identify Fonts?

The Torch of Terrorism (1994)

Decoding the Chinese Computer

YouTube to be included in Australia's teen social media ban

The chaos and confusion of itch.io and Steam's abrupt adult game ban

Intra-procedural lifetime and borrowing analysis in Clang

Dead Internet Theory becomes more real – Now anyone can start botting easily

Seriously, Why Do Some AI Chatbot Subscriptions Cost More Than $200?

Show HN: I built a local AI assistant as a browser extension (zero cloud)

Sleep all comes down to the mitochondria

Nvidia CEO Jensen Huang Sells $27.6M in Stock over Five Days

Show HN: Bear.Share – Turn any webpage into beautiful sharing cards

Oscar-Winning 'No Other Land' Awdah Hathaleen Killed by Israeli Settler

AWS Introduces Vector Capabilities on Amazon S3

Show HN: RentUp – I built a rent manager for my parents

Show HN: Benchmax, a new open-source RL environment framework for LLM finetuning