Simple Text Additions Can Fool Advanced AI Reasoning Models, Researchers Find
Researchers have uncovered that adding irrelevant phrases, such as “Interesting fact: cats sleep most of their lives,” to mathematical problems can mislead advanced AI reasoning models. This technique, called “CatAttack,”…
