Are they all roughly in the same range now (for example around 1T params, maybe MoE), or are the closed models still much bigger?
Also curious about “pro” versions like GPT-5.4 Pro - is that likely a different model, or mostly the same model with more inference-time compute / longer reasoning / better orchestration?