5 Probability distributions in machine learning

 

This chapter covers

  • The role of probability distributions in machine learning
  • Working with binomial, multinomial, categorical, Bernoulli, beta, and Dirichlet distributions
  • The significance of entropy and cross-entropy in machine learning

Life often requires us to estimate the chances of an event occurring or make a decision in the face of uncertainty. Probability and statistics form the common toolbox to use in such circumstances. In machine learning, we take large feature vectors as inputs. As stated earlier, we can view these feature vectors as points in a high-dimensional space. For instance, gray-level images of size 224 × 224 can be viewed as points in a 50, 176-dimensional space, with each pixel corresponding to a specific dimension. Inputs with common characteristics, such as images of animals, will correspond to a cluster of points in that space. Probability distributions provide an effective tool for analyzing such loosely structured point distributions in arbitrarily high-dimensional spaces. Instead of simply developing a machine that emits a class given an input, we can fit a probability distribution to the clusters of input points (or a transformed version of them) satisfying some property of interest. This often lends more insight into the problem we are trying to solve.

5.1 Probability: The classical frequentist view

 
 

5.1.1 Random variables

 
 
 

5.1.2 Population histograms

 
 
 

5.2 Probability distributions

 
 
 

5.3 Basic concepts of probability theory

 
 
 

5.3.1 Probabilities of impossible and certain events

 

5.3.2 Exhaustive and mutually exclusive events

 
 
 
 

5.3.3 Independent events

 
 

5.4 Joint probabilities and their distributions

 
 
 
 

5.4.1 Marginal probabilities

 
 
 
 

5.4.2 Dependent events and their joint probability distribution

 
 

5.5 Geometrical view: Sample point distributions for dependent and independent variables

 
 
 
 

5.6 Continuous random variables and probability density

 
 
 

5.7 Properties of distributions: Expected value, variance, and covariance

 
 

5.7.1 Expected value (aka mean)

 

5.7.2 Variance, covariance, and standard deviation

 
 
 
 

5.8 Sampling from a distribution

 
 

5.9 Some famous probability distributions

 
 
 

5.9.1 Uniform random distributions

 

5.9.2 Gaussian (normal) distribution

 
 
 
 

5.9.3 Binomial distribution

 
 

5.9.4 Multinomial distribution

 
 
sitemap

Unable to load book!

The book could not be loaded.

(try again in a couple of minutes)

manning.com homepage