I can rule out the secret sauce being: - system prompt (i've changed it, awesome performance despite) - underlying model (works great with models other harnesses suck with) - tools, skills, etc. (all generic)
Is there a chance Pi is good because it is just... not a lot of "cognitive effort" for the model?