chapter seven

7 Attention

This chapter covers:

How to implement attention in multi-layer perceptrons (static attention) and LSTMs (temporal attention).
How attention may help to improve performance of a deep learning model.
How attention helps to explain the outcomes of models, by highlighting attention patterns connected to input data.

The following picture displays the chapter organization:

Figure 7.1. Chapter organization.

mental model chapter7 all

7.1 Neural attention

mental model chapter7 intro

7.2 Data

7.3 Static attention: MLP

7.4 Temporal attention: LSTM

7.4.1 Experiments

7.5 Summary

7.6 Further reading

@font-face { font-family: 'livebook'; src:url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.eot?1.9.0'); src:url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.eot?1.9.0') format('embedded-opentype'), url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.woff?1.9.0') format('woff'), url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.ttf?1.9.0') format('truetype'), url('https://d19npu3b8zepp3.cloudfront.net/assets/fonts/livebook.svg?1.9.0') format('svg'); font-weight: normal; font-style: normal; }