More expensive than Sonnet 4.5, but no comparison benchmarks. I think I’ll pass.
leerob•1h ago
We've found it to be a strong mix of speed and intelligence. It scores higher than Sonnet 4.5 on Terminal-Bench 2, maybe we will post more on this later.
fishpham•1h ago
You should! This blog post doesn't really give any reason to use it besides "it's better on Cursor's internal benchmark". A full model card would be great.
enraged_camel•25m ago
Yeah, please do. Because when the AI labs you are competing with are posting extensive benchmarks and you just say "well we used our own internal benchmark" it is a bit sus, especially given the fact that the price has tripled.
enraged_camel•1h ago
leerob•1h ago
fishpham•1h ago
enraged_camel•25m ago