Part 1 The building blocks of scalable computing

 

This part of the book covers some fundamental concepts in scalable computing to give you a good basis for understanding what makes Dask different and how it works “under the hood.”

In chapter 1, you’ll learn what a directed acyclic graph (DAG)is and why it’s useful for scaling out workloads across many different workers.

Chapter 2 explains how Dask uses DAGs as an abstraction to enable you to analyze huge datasets and take advantage of scalability and parallelism whether you’re running your code on a laptop or a cluster of thousands of machines.

Once you’ve completed part 1, you’ll have a basic understanding of the internals of Dask, and you’ll be ready to get some hands-on experience with a real dataset.