7 Selected Supervised Learning Algorithms
This chapter covers
- Markov Models: Page Rank and HMM
- Imbalanced Learning: Undersampling and Oversampling strategies
- Active Learning: Uncertainty Sampling and Query by Committee Strategies
- Model Selection: Hyperparameter Tuning
- Ensemble Methods: Bagging, Boosting, and Stacking
- ML Research: Supervised Learning Algorithms
In the previous two chapters, we looked at supervised algorithms for classification and regression. In this chapter, we focus on a selected set of supervised learning algorithms. The algorithms are selected to give exposure to a variety of applications: from time-series models used in computational finance to imbalanced learning used in fraud detection, to active learning used to reduce the number of training labels, to model selection and ensemble methods used in all data science competitions. Finally, we conclude with ML research and exercises. Let’s begin by reviewing the fundamentals of Markov models.
7.1 Markov Models
In this section, we discuss probabilistic models for a sequence of observations. Timeseries models have a wide range of applications from computational finance to speech recognition to computational biology. We are going to look at two popular algorithms built upon the properties of Markov chains: the page rank algorithm and the EM algorithm for Hidden Markov Models (HMMs).