4 Sequential Ensembles: Boosting

This chapter covers

Training sequential ensembles of weak learners
Implementing and understanding how AdaBoost works
Using AdaBoost in practice
Implementing and understanding how LogitBoost works

The ensembling strategies we have seen thus far have been parallel ensembles. These include homogeneous ensembles such as bagging and random forests (where the same base learning algorithm is used to train base estimators) and heterogeneous ensemble methods such as stacking (where different base learning algorithms are used to train base estimators).

Now, we will explore a new family of ensemble methods: sequential ensembles. Unlike parallel ensembles, which exploit the independence of each base estimator, sequential ensembles exploit the dependence of base estimators.

More specifically, during learning, sequential ensembles train a new base estimator in such a manner that it minimizes mistakes made by the base estimator trained in the previous step.

The first sequential ensemble method we will investigate is boosting. Boosting aims to combine weak learners, or “simple” base estimators. Put another way, boosting literally aims to boost the performance of a collection of weak learners.

4.1 Sequential Ensembles of Weak Learners

4.2 AdaBoost: ADAptive BOOSTing

4.2.1 Intuition: Learning with Weighted Examples

4 Sequential Ensembles: Boosting

This chapter covers

4.1 Sequential Ensembles of Weak Learners

4.2 AdaBoost: ADAptive BOOSTing

4.2.1 Intuition: Learning with Weighted Examples

4.2.2 Implementing AdaBoost

4.2.3 AdaBoost with scikit-learn

4.3 AdaBoost in Practice

4.3.1 Learning Rate

4.3.2 Early Stopping and Pruning

4.4 Case Study: Handwritten Digit Classification

4.4.1 Dimensionality Reduction with t-SNE

4.4.2 Boosting

4.5 LogitBoost: Boosting with the Logistic Loss

4.6 Summary

4 Sequential Ensembles: Boosting

This chapter covers

4.1 Sequential Ensembles of Weak Learners

4.2 AdaBoost: ADAptive BOOSTing

4.2.1 Intuition: Learning with Weighted Examples

4.2.2 Implementing AdaBoost

4.2.3 AdaBoost with scikit-learn

4.3 AdaBoost in Practice

4.3.1 Learning Rate

4.3.2 Early Stopping and Pruning

4.4 Case Study: Handwritten Digit Classification

4.4.1 Dimensionality Reduction with t-SNE

4.4.2 Boosting

4.5 LogitBoost: Boosting with the Logistic Loss

4.6 Summary

Unable to load book!