Using cloud sandboxes to run them in I tested: - A single coding agent just told to make a better parser - An agent told to write a better parser within the constraints of tests/benchmarks - An agent swarm that self-improved the premise with extra tests/benchmarks in order to more "truly" write a better parser
The results were a success! I was able to end up with both performance (up to 3.07× faster) and memory (up to 5.75× less) in locally runnable benchmarks.