Chapter 2: Generating features
Chapter 4 from Machine Learning Systems by Jeff Smith.
This chapter covers
- Extracting features from raw data
- Transforming features to make them more useful
- Selecting among the features you’ve created
- How to organize feature-generation code
This chapter is the next step on our journey through the components, or phases, of a machine learning system, shown in chapter 1.
In this chapter, I’ll guide you through the three main types of operations in a feature pipeline: extraction, transformation, and selection. Not all systems do all the types of operations shown in this chapter, but all feature engineering techniques can be thought of as falling into one of these three buckets. I’ll use type signatures to assign techniques to groups and give our exploration some structure, as shown in table 4.1.