chapter eleven
11 Learning through representation: LeCun, Bengio, Hinton, and the mathematics of neural networks
This chapter covers
- Yann LeCun, Yoshua Bengio, and Geoffrey Hinton’s Deep learning (2015), which cemented neural networks as the dominant AI framework
- The evolution from biologically inspired neural models to modern deep learning systems
- How deep learning integrates foundational ideas into a unified learning framework
- Why representation learning replaced handcrafted features and fixed kernels
- How depth allows neural networks to learn hierarchical representations
Neural networks are not a recent invention. Their origins stretch back to the middle of the twentieth century, when researchers first asked whether simple mathematical units, wired together, might reproduce aspects of biological intelligence. Early models were elegant and ambitious, but their limitations quickly became apparent. What followed was a long period of skepticism—punctuated by brief resurgences—during which neural networks were often dismissed as unstable, opaque, or mathematically unserious. By the late 1990s, many of the most influential advances in machine learning came from elsewhere: margin-based classifiers, probabilistic models, and carefully engineered features grounded in statistical theory.