After losing hundreds of games of
https://pokelike.xyz/ it occurred to me that the state space was small enough that maybe a small neural net trained with PPO could beat it somewhat consistently. After some reward engineering it works! The PPO-trained neural net can beat 9% of all runs all the way to the Elite Four!