In the last chapter, you learned how to work with a fundamental Azure service, the Storage account. Storage accounts provide nearly unlimited storage for many Azure services, with high throughput and high redundancy. Storage accounts also host other file-based services, such as file shares and queues.
In this chapter, you’ll learn about another storage service, Azure Data Lake Storage (ADLS). You’ll create a Data Lake store and learn how to structure your data lake to increase maintainability and security. You’ll learn how this service supports other Azure services through Azure Active Directory authentication. This will be the central service around which you construct the analytics system.
ADLS resembles a local file system, with folders and files. Azure Active Directory (AAD) controls access to folders and files, with assignable read/write/execute permissions. ADLS provides the primary storage backbone for the master data set, a source of data for batch layer processing. ADLS also stores batch analysis artifacts, including the report files that make up the output of the Serving layer (see figure 4.1).