Note: Actual title is Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs, Ken Tsui, 2025.
Quite an interesting result. Per my chats with one of the online models, this magic token "wait" is generally applicable across all models.
learningmore•1h ago
“We uncover a systematic failure: LLMs cannot correct errors in their own outputs while successfully correcting identical errors from external sources - a limitation we term the Self-Correction Blind Spot.”
yubblegum•2h ago
Quite an interesting result. Per my chats with one of the online models, this magic token "wait" is generally applicable across all models.