catalog books video projects audio free content register pBook

Table of Contents

Brief Table of Contents

Table of Contents

Acknowledgments

About This Book

About the Authors

About the Cover Illustration

Chapter 1. What is reinforcement learning?

1.1. The “deep” in deep reinforcement learning

1.2. Reinforcement learning

1.3. Dynamic programming versus Monte Carlo

1.4. The reinforcement learning framework

1.5. What can I do with reinforcement learning?

1.6. Why deep reinforcement learning?

1.7. Our didactic tool: String diagrams

1.8. What’s next?

Chapter 2. Modeling reinforcement learning problems: Markov decision processes

2.1. String diagrams and our teaching methods

2.2. Solving the multi-arm bandit

2.2.1. Exploration and exploitation

2.2.2. Epsilon-greedy strategy

2.2.3. Softmax selection policy

2.3. Applying bandits to optimize ad placements

sitemap

@font-face { font-family: 'livebook'; src:url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.eot?1.9.0'); src:url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.eot?1.9.0') format('embedded-opentype'), url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.woff?1.9.0') format('woff'), url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.ttf?1.9.0') format('truetype'), url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.svg?1.9.0') format('svg'); font-weight: normal; font-style: normal; }