Video: Understanding why XOR requires multiple layers (2:34)
The Puzzle That Stumped Early AI
In 1969, Marvin Minsky and Seymour Papert published a book called "Perceptrons" that had a devastating impact on neural network research. They proved something shocking: a single perceptron cannot solve the XOR problem.
💡 What is XOR?
XOR (exclusive OR) outputs 1 when exactly one of the inputs is 1. It's like asking "Is one or the other true, but not both?"
The XOR Truth Table
Notice the pattern: the 1s form a diagonal!
Why Can't a Single Line Work?
Try to draw a single straight line that separates the red dots (output=1) from the blue dots (output=0). It's impossible!
The Key Insight
XOR is not linearly separable. No matter how you rotate or position a single straight line, you cannot separate the positive cases (0,1 and 1,0) from the negative cases (0,0 and 1,1).
The Solution: Multiple Lines
While one line can't solve XOR, two lines can! This is the foundation of multi-layer networks.
🔑 The Multi-Layer Trick
First layer: Two perceptrons each draw a line
Second layer: A perceptron combines the outputs
This insight - that stacking layers allows us to solve non-linear problems - is what made the deep learning revolution possible decades later.