concept checkpoint in category spark
appears as: checkpoints, checkpoint, A checkpoint, checkpoint

This is an excerpt from Manning's book Spark in Action, Second Edition.
Spark will need a checkpoint directory to store its intermediate states and checkpoints (you’ll learn more about checkpoints in chapter 14). You can specify it here per output streams or globally at the
SparkSession
level by usingSparkSession.conf.set ("spark.sql.streaming.checkpointLocation",
. . .)
.
... 1995 ... 1337 Processing times Without cache ............... 3618 ms With cache .................. 2559 ms With checkpoint ............. 1860 ms With non-eager checkpoint ... 1420 ms