concept `box plot` in category `R`

appears as: box plot, The box plot, box plots, box plots, The box plots

R in Action, Second Edition: Data analysis and graphics with R

This is an excerpt from Manning's book R in Action, Second Edition: Data analysis and graphics with R. Login to get full access to this book.

Listing 3.4. Fine placement of figures in a graph

opar <- par(no.readonly=TRUE) par(fig=c(0, 0.8, 0, 0.8)) plot(mtcars$mpg, mtcars$wt, #1 xlab="Miles Per Gallon", #1 ylab="Car Weight") #1 par(fig=c(0, 0.8, 0.55, 1), new=TRUE) #2 boxplot(mtcars$mpg, horizontal=TRUE, axes=FALSE) #2 par(fig=c(0.65, 1, 0, 0.8), new=TRUE) #3 boxplot(mtcars$wt, axes=FALSE) #3 mtext("Enhanced Scatterplot", side=3, outer=TRUE, line=-3) par(opar) #1: Sets up the scatter plot #2: Adds a box plot above #3: Adds a box plot to the right

copy

to see more go to 3.5.1. Creating a figure arrangement with fine control

The following sections explore the use of bar plots, pie charts, fan charts, histograms, kernel density plots, box plots, violin plots, and dot plots. Some of these may be familiar to you, whereas others (such as fan plots or violin plots) may be new to you. The goal, as always, is to understand your data better and to communicate this understanding to others. Let’s start with bar plots.

to see more go to Chapter 6. Basic graphs

where formula is a formula and dataframe denotes the data frame (or list) providing the data. An example of a formula is y ~ A, where a separate box plot for numeric variable y is generated for each value of categorical variable A. The formula y ~ A*B would produce a box plot of numeric variable y, for each combination of levels in categorical variables A and B.

to see more go to 6.5.1. Using parallel box plots to compare groups

Box plots are very versatile. By adding notch=TRUE, you get notched box plots. If two boxes’ notches don’t overlap, there’s strong evidence that their medians differ (Chambers et al., 1983, p. 62). The following code creates notched box plots for the mpg example:
boxplot(mpg ~ cyl, data=mtcars,
        notch=TRUE,
        varwidth=TRUE,
        col="red",
        main="Car Mileage Data",
        xlab="Number of Cylinders",
        ylab="Miles Per Gallon")
copy
The col option fills the box plots with a red color, and varwidth=TRUE produces box plots with widths that are proportional to their sample sizes.

to see more go to 6.5.1. Using parallel box plots to compare groups

Figure 6.13. Notched box plots for car mileage vs. number of cylinders

Finally, you can produce box plots for more than one grouping factor. Listing 6.9 provides box plots for mpg versus the number of cylinders and transmission type in an automobile (see figure 6.14). Again, you use the col option to fill the box plots with color. Note that colors recycle; in this case, there are six box plots and only two specified colors, so the colors repeat three times.

Figure 6.14. Box plots for car mileage vs. transmission type and number of cylinders

Listing 6.9. Box plots for two crossed factors

mtcars$cyl.f <- factor(mtcars$cyl, #1 levels=c(4,6,8), #1 labels=c("4","6","8")) #1 mtcars$am.f <- factor(mtcars$am, #2 levels=c(0,1), #2 labels=c("auto", "standard")) #2 boxplot(mpg ~ am.f *cyl.f, #3 data=mtcars, #3 varwidth=TRUE, #3 col=c("gold","darkgreen"), #3 main="MPG Distribution by Auto Type", #3 xlab="Auto Type", ylab="Miles Per Gallon") #3 #1: Creates a factor for the number of cylinders #2: Creates a factor for transmission type #3: Generates the box plot

copy

From figure 6.14, it’s again clear that median mileage decreases with number of cylinders. For four- and six-cylinder cars, mileage is higher for standard transmissions. But for eight-cylinder cars, there doesn’t appear to be a difference. You can also see from the widths of the box plots that standard four-cylinder and automatic eight-cylinder cars are the most common in this dataset.

to see more go to 6.5.1. Using parallel box plots to compare groups

concept box plot in category R

R in Action, Second Edition: Data analysis and graphics with R

Listing 3.4. Fine placement of figures in a graph

Figure 6.13. Notched box plots for car mileage vs. number of cylinders

Figure 6.14. Box plots for car mileage vs. transmission type and number of cylinders

Listing 6.9. Box plots for two crossed factors

Unable to load book!

concept `box plot` in category `R`