Max Planck Research School for Intelligent Systems
When Models Take Shortcuts: The causes of shortcut learning in neural networks
Neuroscientists once thought they could train rats to navigate mazes by color. Rats don’t perceive colors at all. Instead, they rely on the distinct odors of different colors of paint. New work finds that neural networks are prone to this sort of misalignment between training goals and learning.