2 Creating Your First Data Pipeline
This chapter covers
- Staging a file for ingesting into Snowflake
- Loading data from a staged file into a staging table
- Merging data from a staging table into the target table
- Transforming data with SQL
- Automating the pipeline with a task
In this chapter, you will learn how to build your first data pipeline in Snowflake. We will ingest data from a CSV file into Snowflake, transform the data from its raw form into a target data model required for reporting, and automate the pipeline. This chapter does not include underlying data pipeline components such as security, data governance, software engineering, or DataOps introduced in Chapter 1 because we want to keep it simple initially. These topics are described in more detail in subsequent chapters. To get started with data engineering, we will create a simple data pipeline that will illustrate the core Snowflake functionality required to ingest data from a file into Snowflake and transform the data.