Has anyone tried the latest and greatest models of both camps, with the highest thinking level and maximum possible context window setting, and compared performances and observed patterns / specific behaviors which make you choose one over the other? [Of course, everyone's mileage varies, but still want to gather insights from folks who have the privilege to be able to use both extensively]
I'm talking about $200 versions of both.
I couldn't find any such detail over the web for the *present best* versions of both camps, and my current usage / repository is too narrow to provide me with meaningful comparison unless I try sending each task to both at the same time. Any inputs would be greatly appreciated, thank you!