19 Data standardization

 

This chapter covers

  • Methods of data standardization
  • Z-score method
  • Standard deviation method
  • Centering method
  • Range method
  • Coloring data frames and highlighting observations
  • Comparing data sets

It’s not often that an NBA player scores 50 or more points in a game. In fact, during the 2021-22 regular season (1,230 games), it happened only 12 times. The top five scorers in NBA history—Lebron James (still playing), Kareem Abdul-Jabbar, Karl Malone, Kobe Bryant, and Michael Jordan—scored at least 50 points in a regular season game 73 times over a grand total of 93 seasons between them.

19.1 Loading a package

19.2 Importing and viewing data

19.3 Wrangling data

19.3.1 Treating duplicate records

19.3.2 Final trimmings

19.4 Standardizing data

19.4.1 Z-score method

19.4.2 Standard deviation method

19.4.3 Centering method

19.4.4 Range method

Summary