chapter seven

7 Selected Supervised Learning Algorithms

This chapter covers

Markov Models: Page Rank and HMM
Imbalanced Learning: Undersampling and Oversampling strategies
Active Learning: Uncertainty Sampling and Query by Committee Strategies
Model Selection: Hyperparameter Tuning
Ensemble Methods: Bagging, Boosting, and Stacking
ML Research: Supervised Learning Algorithms

In the previous two chapters, we looked at supervised algorithms for classification and regression. In this chapter, we focus on a selected set of supervised learning algorithms. The algorithms are selected to give exposure to a variety of applications: from time-series models used in computational finance to imbalanced learning used in fraud detection, to active learning used to reduce the number of training labels, to model selection and ensemble methods used in all data science competitions. Finally, we conclude with ML research and exercises. Let’s begin by reviewing the fundamentals of Markov models.

7.1 Markov Models

In this section, we discuss probabilistic models for a sequence of observations. Timeseries models have a wide range of applications from computational finance to speech recognition to computational biology. We are going to look at two popular algorithms built upon the properties of Markov chains: the page rank algorithm and the EM algorithm for Hidden Markov Models (HMMs).

7.1.1 Page Rank Algorithm

7.1.2 Hidden Markov Model

7.2 Imbalanced Learning

7.2.1 Undersampling Strategies

7.2.2 Oversampling Strategies

7.3 Active Learning

7.3.1 Query Strategies

7.4 Model Selection: Hyperparameter Tuning

7.4.1 Bayesian Optimization

7.5 Ensemble Methods

7.5.1 Bagging

7.5.2 Boosting

7.5.3 Stacking

7.6 ML Research: Supervised Learning Algorithms

7.7 Exercises

7.8 Summary