chapter six
6 Leveraging information theory with entropy-based policies
This chapter covers
- Entropy as an information theoretic measure of uncertainty
- Information gain as a method of reducing entropy
- Bayesian optimization policies that leverage information theory for their search
We saw in Chapter 4 that by aiming to improve from the best value that we have seen so far, we can design improvement-based Bayesian optimization (BayesOpt) policies such as Probability of Improvement and Expected Improvement. In Chapter 5, we leveraged multi-armed bandit policies to obtain Upper Confidence Bound and Thompson sampling, each of which uses a unique heuristic to balance between exploration and exploitation in the search for the global optimum of the objective function.