Numerical Summaries of Data: Learn It 3

Five-Number Summary

Where standard deviation is a measure of variation based on the mean, quartiles are based on the median.

quartiles

Quartiles are values that divide the data in quarters.

 

  • The first quartile, also known as [latex]Q1[/latex], can be thought of as the median of the values that lie below the median for the whole data set. It is the [latex]25[/latex]th percentile of the data set.
  • The third quartile, also known as [latex]Q3[/latex], can be thought of as the median of the values that lie above the median for the whole data set. It is the [latex]75[/latex]th percentile of the data set.

This divides the data into quarters; [latex]25\%[/latex] of the data is between the minimum and [latex]Q1[/latex], [latex]25\%[/latex] is between [latex]Q1[/latex] and the median, [latex]25\%[/latex] is between the median and [latex]Q3[/latex], and [latex]25\%[/latex] is between [latex]Q3[/latex] and the maximum value.

While quartiles are not a [latex]1[/latex]-number summary of variation like standard deviation, the quartiles are used with the median, minimum, and maximum values to form a [latex]5[/latex]-number summary of the data.

five-number summary

The five-number summary takes this form:

 

Minimum, [latex]Q1[/latex], Median, [latex]Q3[/latex], Maximum

To find the first quartile, we need to find the data value so that [latex]25\%[/latex] of the data is below it. If [latex]n[/latex] is the number of data values, we compute a locator by finding [latex]25\%[/latex] of [latex]n[/latex]. If this locator is a decimal value, we round up, and find the data value in that position. If the locator is a whole number, we find the mean of the data value in that position and the next data value. This is identical to the process we used to find the median, except we use [latex]25\%[/latex] of the data values rather than half the data values as the locator.

How To: Find the First Quartile, [latex]Q1[/latex]

  1. Begin by ordering the data from smallest to largest
  2. Compute the locator: [latex]L = 0.25n[/latex]
  3. If [latex]L[/latex] is a decimal value:
    • Round up to [latex]L+[/latex]
    • Use the data value in the [latex]L+[/latex]th position
  4. If [latex]L[/latex] is a whole number:
    • Find the mean of the data values in the [latex]L[/latex]th and [latex]L+1[/latex]th positions.

How To: Find the Third Quartile, [latex]Q3[/latex]

Use the same procedure as for [latex]Q1[/latex], but with locator: [latex]L = 0.75n[/latex]

To find minimum and maximum values of a data set, find the least and the greatest value in the data set. It might be best to order your data from smallest to the largest because you can can also find the measure of center, median, through the ordered data set as well. Recall: the median is the value that splits the data in half.

Suppose we have measured [latex]9[/latex] females, and their heights (in inches) sorted from smallest to largest are:

[latex]59, 60, 62, 64, 66, 67, 69, 70, 72[/latex]

What are the first and third quartiles?


Suppose we had measured [latex]8[/latex] females, and their heights (in inches) sorted from smallest to largest are:

[latex]59, 60, 62, 64, 66, 67, 69, 70[/latex]

What are the first and third quartiles? What is the [latex]5[/latex] number summary?


The [latex]5[/latex]-number summary combines the first and third quartile with the minimum, median, and maximum values.

What are the [latex]5[/latex]-number summaries for each of the previous [latex]2[/latex] examples?