> The model, called VibeThinker-3B, scored 94.3 on AIME 2026 — the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters
Overfitting, no need to argue about anything I think?
The rest of the article seems to echoing people's misunderstanding of pretty elementary stuff.
embedding-shape•24m ago
Overfitting, no need to argue about anything I think?
The rest of the article seems to echoing people's misunderstanding of pretty elementary stuff.