frontpage.

I trained a live RL agent inside a pixel platformer you can play against on a desktop browser (needs a keyboard or a controller):

https://rlplays.com/game

This is NOT a quick-one-off vibe/LLM-coded project.

I started this project out of curiosity: I wanted to build an RL-based game as there were very few out there (e.g. Sony GT Sophy). And I wanted to learn the core RL foundation in a practical/useful manner.

I built on top of Puffer - but the training speed was not up to my needs so I rewrote the core with a ground-up native eval/training loop with multithreaded GPU batching (gonna be a part of the next Puffer release). [ Unaffiliated plug: Puffer is an excellent OSS library - check out https://puffer.ai ]

I trained the RL agent using curriculum learning + self-play. The demo showcases this self-play as well - which you can play against yourself, like an RL agent would!

Technical details in my blog in the link above.

Blackalicious – Alphabet Aerobics [video]

Show HN: Fitdle – A Curve Fitting Contest

Accessing Hardware in Rust

Official Dashtera Launch

How to identify your Apple keyboard layout by country or region

Ask HN: What are some science or tech facts you know?

Show HN: Swik – catalog of asset-specific sentiment inversions for financial NLP

The new best free project management tool

AMD GPU-Initiated I/O

I rebuilt Claude Desktop in 10 days. Here's why

Been using this Tourist eSIM while traveling, super cheap unlimited data

OpenClaw is just cron, Markdown and a chat bot and that's why it matters

Show HN: Get a quick skincare analysis by uploading a photo

Show HN: EasyShot – macOS screenshot thumbnails that don't disappear after 5s

AI Hairstyle Changer

Why Whisper Notes for Mac Left the App Store

"I hope you don't use Generative AI"

The AI Morning Show: Automating German Humor

Rippling AI

The Five Companies You Can Build in 2026

AI Council: run mupliple LLMs on your question, get consolidated opinion

TBM 406: Seeing Everything, Understanding Nothing (The Context Trap)

Gea: A Compile-Time Reactive UI Framework That's Just JavaScript

The Reason Most People Are Terrible Communicators (and How to Fix It)

Bombadil: Property-based testing for web UIs by Antithesis

Management in the Age of AI – Stay SaaSy

'Alright mate?': Amazon pins UK hopes on AI upgrade of Alexa

Wikigacha – Collect cards from articles on Wikipedia and use them in battle

Taste at scale. Why the hardest part of building products stayed human

Context Engineering for Coding Agents

Show HN: Play against a live RL agent in a pixel platformer