Tag: neural network

  • Transformer: A Sentence Is Not a Chain

    Transformer: A Sentence Is Not a Chain

    A sentence is not a chain. It’s a web. “The animal didn’t cross the street because it was too tired”—what does “it” refer to? Before 2017, AI had to read word by word to find out. Transformer broke the chain and wove a web: every word sees every other word at once. That’s why we…

  • How Neural Networks Learn

    How Neural Networks Learn

    Paul Werbos invented backpropagation in 1974. No one noticed. Twelve years later, the same algorithm was “rediscovered” and changed everything. The math didn’t change—the world did. Here’s how neural networks actually learn: by propagating errors backward and following gradients downhill.

  • Why Linear Regression Can’t Recognize a Cat

    Why Linear Regression Can’t Recognize a Cat

    In 1969, two MIT researchers proved that neural networks couldn’t solve XOR—a logic problem any child can understand. Funding collapsed. The first AI winter began. But they missed one thing: stack the layers, and the world changes. The real difference between machine learning and deep learning isn’t about neural networks. It’s about who designs the…