Note: Actual title is Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs, Ken Tsui, 2025.
Quite an interesting result. Per my chats with one of the online models, this magic token "wait" is generally applicable across all models.
learningmore•4mo ago
“We uncover a systematic failure: LLMs cannot correct errors in their own outputs while successfully correcting identical errors from external sources - a limitation we term the Self-Correction Blind Spot.”
yubblegum•4mo ago
Quite an interesting result. Per my chats with one of the online models, this magic token "wait" is generally applicable across all models.