chapter seventeen

17 Using predictions to make more predictions

This chapter covers

Examining the autoregressive LSTM (ARLSTM) architecture
Discovering the caveat of the ARLSTM
Implementing an ARLSTM

In the last chapter, we examined and built a convolutional neural network (CNN). We even combined it with the LSTM architecture to test whether we could outperform the LSTM models. The results were mixed, as the CNN models performed worse as single-step models, performed best as multi-step models, and performed equally well as multi-output models.

Now we’ll focus entirely on the multi-step models, as all of them output the entire sequence of predictions in a single shot. We’re going to modify that behavior and gradually output the prediction sequence, using past predictions to make new predictions. That way, the model will create rolling forecasts, but using its own predictions to inform the output.

This architecture is commonly used with LSTM and is called autoregressive LSTM (ARLSTM). In this chapter, we’ll first explore the general architecture of the ARLSTM model, and then we’ll build it in Keras to see if we can build a new top-performing multi-step model.

17.1 Examining the ARLSTM architecture

We have built many multi-step models that all output predictions for traffic volume in the next 24 hours. Each model has generated the entire prediction sequence in a single shot, meaning that we get 24 values from the model right away.

17 Using predictions to make more predictions

This chapter covers

17.1 Examining the ARLSTM architecture

17.2 Building an autoregressive LSTM model

17.3 Next steps

17.4 Exercises

Summary