fp.

Environments Hub is a new open platform by Prime Intellect where anyone can share RL environments for training or evaluating LLMs. Think of them as software packages: data, harness and scoring rules.

Agents today incorporate models and tools (from APIs to a terminal), so environments need to capture that complexity.

I wrote a hands-on walkthrough covering:

  - RL + LLM basics

  - Navigating the Environments Hub

  - Evaluating models and agents

  - GRPO-style training of a tiny model on an alphabetical sort task

If you want to experiment with RL for LLMs or just see how open environments can accelerate learning, this walkthrough is a practical starting point.