Achievable logoAchievable logo
Praxis Core: Math (5733)
Sign in
Sign up
Purchase
Textbook
Practice exams
Support
How it works
Exam catalog
Mountain with a flag at the peak
Textbook
Introduction
1. Number and quantity
2. Data analysis, statistics, and probability
2.1 Understanding central tendencies
2.2 Understanding and representing data
2.3 Interpreting data
2.4 Interpreting scatterplots
2.5 Computing probabilities
3. Algebra and geometry
Wrapping up
Achievable logoAchievable logo
2.3 Interpreting data
Achievable Praxis Core: Math (5733)
2. Data analysis, statistics, and probability
Our Praxis Core: Math course is currently in development and is a work-in-progress.

Interpreting data

8 min read
Font
Discuss
Share
Feedback

Spotting patterns, trends, and outliers

You’ll often use data displays to spot overall shapes and patterns and to notice values that stand out. You’ll also use the five-number summary - minimum, Q1​, median (Q2​), Q3​, and maximum - to describe a data set and create a boxplot. These tools help you summarize data, identify clusters or gaps, and support conclusions with specific evidence.

Definitions
Cluster
A concentration of data values within a specific range, indicating a subgroup or phase in the data. Clusters show where values tend to group together and can highlight common behaviors or conditions.
Outlier
A data value that is substantially higher or lower than the rest of the data set. You can often spot an outlier by eye as a value lying far from the main grouping, and you can verify it mathematically by computing Q1​ and Q3​, then IQR=Q3​−Q1​, and checking if the value is greater than Q3​+1.5×IQR or less than Q1​−1.5×IQR.
Trend
A general direction in which data values are moving over time or across categories. Trends can be increasing, decreasing, or constant, and they help reveal long-term patterns in a data set.

Patterns and trends describe how data values behave as a group - for example, steadily rising, steadily falling, or clustering around certain values. Outliers matter because they can distort summary measures like the mean or the range.

A box-and-whisker plot (or boxplot) summarizes a data set using five key numbers: minimum, first quartile Q1​, median Q2​, third quartile Q3​, and maximum values. It shows the center and spread of the distribution.

In a boxplot, outliers are data values that lie far from the rest of the dataset. Formally:

Outlier if: x>Q3​+1.5×IQRorx<Q1​−1.5×IQR

where IQR=Q3​−Q1​.

  • Values beyond this range are often plotted individually as points or small circles.
  • Outliers can indicate unusual cases, errors in data collection, or natural variability.

Finding the five-number summary

Take the following data set:

65,68,71,74,77,82,82,89

  • The values below show the five-number summary for this data set. (If you need a refresher on how each value is found, refer back to section 1 of this unit.)

  • Minimum (Min) = 65

  • Maximum (Max) = 89

  • Median (Q2​) = 75.5

  • First quartile (Q1​) = 69.5

  • Third quartile (Q3​) = 82

Visual interpretation

Some problems ask you to choose the correct box-and-whisker plot (also called a box plot) for a given data set. To do that, you need to know how the five-number summary maps onto the picture.

  • A box plot can be drawn horizontally or vertically.
  • In a horizontal box plot, the minimum is on the left and the maximum is on the right.
  • In a vertical box plot, the minimum is at the bottom and the maximum is at the top.

Let’s sketch one using the five-number summary from the previous example.

Example: Drawing a five-number summary

Suppose you want to manually construct the boxplot.

To create a box plot, start by ordering the data set:

65,68,71,74,77,82,82,89

To draw the box plot:

  • Draw a box from Q1​=69.5 to Q3​=82
  • Place a vertical line inside the box at the median Q2​=75.5
  • Extend whiskers from the box out to the minimum (65) and maximum (89)

Answer: Draw the box from 69.5 to 82, place the median line at 75.5, and draw whiskers to 65 and 89.

Boxplot of quiz scores
Boxplot of quiz scores

Example: Identify the outlier and compare means

10,12,15,14,13,100,16,14

Find the following:

  • The outlier
  • The mean of all eight values
  • The mean of the seven typical values excluding the outlier
(spoiler)

Order the data: 10,12,13,14,14,15,16,100

  • First we need to compute the IQR by finding the corresponding Q1​ and Q3​ values.
  • Q1​=212+13​=12.5
  • Q3​=215+16​=15.5
  • Recall we find the IQR by taking the Q1​ and subtracting it from the Q3​ value. IQR=15.5−12.5=3

By the definition of outliers, if the number in the data set exceeds the sum of Q3​ and 1.5×IQR then it is an outlier.

  • Outlier cutoff: Q3​+1.5×IQR=15.5+(1.5×3)=15.5+4.5=20 Since 100>20, it is an outlier.
  • Mean of all values: 810+12+13+14+14+15+16+100​=8194​=24.25
  • Mean without outlier: 794​≈13.43

Answer: The outlier is 100. The mean of all values is 24.25, and the mean without the outlier is approximately 13.43.

  • Why this matters: The outlier inflates the mean by over 80%, which can hide what’s typical in the data. For example, in home prices, a few multi million dollar mansions can pull the mean upward, which is one reason real estate often reports the middle value instead.

Example: Detect trend, cluster, and outlier Given the following data set describe cluster(s), trend, or outlier(s) if applicable.

5,7,8,9,10,15,16,17,20,21

(spoiler)
  • Two clusters: 5-10 and 15-21
    • Could also argue three clusters 5-10, 15-17, and 20-21

Given the data set: 5,7,8,9,10,15,16,17,20,21

  • The data is already ordered from least to greatest.

  • Median (Q2​): There are 10 values in the data set. Since 10 is even, the median is the average of the 5th and 6th values: Q2​=210+15​=225​=12.5

  • Lower quartile (Q1​): The lower half of the data includes the values below the median: 5,7,8,9,10 This set has 5 values, so Q1​ is the middle (third) value: Q1​=8

  • Upper quartile (Q3​): The upper half of the data includes the values above the median: 15,16,17,20,21 This set also has 5 values, so Q3​ is the middle (third) value: Q3​=17

  • Interquartile range (IQR): IQR=Q3​−Q1​=17−8=9

To determine if any values in the data set are outliers, we use the 1.5×IQR rule.

We already found:

  • Q1​=8
  • Q3​=17
  • IQR=Q3​−Q1​=17−8=9

Now calculate the lower and upper bounds for outliers:

  • Lower bound = Q1​−1.5×IQR=8−1.5×9=8−13.5=−5.5
  • Upper bound = Q3​+1.5×IQR=17+1.5×9=17+13.5=30.5

Any value below −5.5 or above 30.5 is considered an outlier. Since all values in the data set fall between 5 and 21, there are no outliers.

Answer: The data show two clusters (5-10 and 15-21) and no outliers.

Why this matters: Recognizing clusters can reveal meaningful subgroups, and confirming that there are no outliers helps you trust that your summaries represent the full data set.

Justifying conclusions with data

Strong conclusions point to specific numbers, trends, or features in the display, and they avoid claims the data can’t support.

Example: Trend description and justification Monthly website visits are shown below:

Month Visits
Jan 1200
Feb 1500
Mar 1800

Write a sentence describing the trend and justify it with data.

(spoiler)

Visits rose by exactly 300 each month from 1200 in January to 1800 in March, indicating a consistent upward trend likely driven by marketing or seasonal factors.

Answer: Visits increased by 300 each month from January to March.

Why this matters: Using exact changes (like “up 300 each month”) keeps your conclusion precise and tied directly to the data.

  • Look for overall patterns, trends, clusters, or cycles before focusing on individual values
  • Identify any values that stand apart and confirm them as outliers using the 1.5×IQR rule
  • Use the five-number summary to describe the spread and center of a data set, and represent it visually with a boxplot
  • Anchor conclusions in specific numeric changes or clear features from the display
  • Avoid claims that extend beyond what the data actually show

Sign up for free to take 5 quiz questions on this topic

All rights reserved ©2016 - 2026 Achievable, Inc.

Interpreting data

Spotting patterns, trends, and outliers

You’ll often use data displays to spot overall shapes and patterns and to notice values that stand out. You’ll also use the five-number summary - minimum, Q1​, median (Q2​), Q3​, and maximum - to describe a data set and create a boxplot. These tools help you summarize data, identify clusters or gaps, and support conclusions with specific evidence.

Definitions
Cluster
A concentration of data values within a specific range, indicating a subgroup or phase in the data. Clusters show where values tend to group together and can highlight common behaviors or conditions.
Outlier
A data value that is substantially higher or lower than the rest of the data set. You can often spot an outlier by eye as a value lying far from the main grouping, and you can verify it mathematically by computing Q1​ and Q3​, then IQR=Q3​−Q1​, and checking if the value is greater than Q3​+1.5×IQR or less than Q1​−1.5×IQR.
Trend
A general direction in which data values are moving over time or across categories. Trends can be increasing, decreasing, or constant, and they help reveal long-term patterns in a data set.

Patterns and trends describe how data values behave as a group - for example, steadily rising, steadily falling, or clustering around certain values. Outliers matter because they can distort summary measures like the mean or the range.

A box-and-whisker plot (or boxplot) summarizes a data set using five key numbers: minimum, first quartile Q1​, median Q2​, third quartile Q3​, and maximum values. It shows the center and spread of the distribution.

In a boxplot, outliers are data values that lie far from the rest of the dataset. Formally:

Outlier if: x>Q3​+1.5×IQRorx<Q1​−1.5×IQR

where IQR=Q3​−Q1​.

  • Values beyond this range are often plotted individually as points or small circles.
  • Outliers can indicate unusual cases, errors in data collection, or natural variability.

Finding the five-number summary

Take the following data set:

65,68,71,74,77,82,82,89

  • The values below show the five-number summary for this data set. (If you need a refresher on how each value is found, refer back to section 1 of this unit.)

  • Minimum (Min) = 65

  • Maximum (Max) = 89

  • Median (Q2​) = 75.5

  • First quartile (Q1​) = 69.5

  • Third quartile (Q3​) = 82

Visual interpretation

Some problems ask you to choose the correct box-and-whisker plot (also called a box plot) for a given data set. To do that, you need to know how the five-number summary maps onto the picture.

  • A box plot can be drawn horizontally or vertically.
  • In a horizontal box plot, the minimum is on the left and the maximum is on the right.
  • In a vertical box plot, the minimum is at the bottom and the maximum is at the top.

Let’s sketch one using the five-number summary from the previous example.

Example: Drawing a five-number summary

Suppose you want to manually construct the boxplot.

To create a box plot, start by ordering the data set:

65,68,71,74,77,82,82,89

To draw the box plot:

  • Draw a box from Q1​=69.5 to Q3​=82
  • Place a vertical line inside the box at the median Q2​=75.5
  • Extend whiskers from the box out to the minimum (65) and maximum (89)

Answer: Draw the box from 69.5 to 82, place the median line at 75.5, and draw whiskers to 65 and 89.

Example: Identify the outlier and compare means

10,12,15,14,13,100,16,14

Find the following:

  • The outlier
  • The mean of all eight values
  • The mean of the seven typical values excluding the outlier
(spoiler)

Order the data: 10,12,13,14,14,15,16,100

  • First we need to compute the IQR by finding the corresponding Q1​ and Q3​ values.
  • Q1​=212+13​=12.5
  • Q3​=215+16​=15.5
  • Recall we find the IQR by taking the Q1​ and subtracting it from the Q3​ value. IQR=15.5−12.5=3

By the definition of outliers, if the number in the data set exceeds the sum of Q3​ and 1.5×IQR then it is an outlier.

  • Outlier cutoff: Q3​+1.5×IQR=15.5+(1.5×3)=15.5+4.5=20 Since 100>20, it is an outlier.
  • Mean of all values: 810+12+13+14+14+15+16+100​=8194​=24.25
  • Mean without outlier: 794​≈13.43

Answer: The outlier is 100. The mean of all values is 24.25, and the mean without the outlier is approximately 13.43.

  • Why this matters: The outlier inflates the mean by over 80%, which can hide what’s typical in the data. For example, in home prices, a few multi million dollar mansions can pull the mean upward, which is one reason real estate often reports the middle value instead.

Example: Detect trend, cluster, and outlier Given the following data set describe cluster(s), trend, or outlier(s) if applicable.

5,7,8,9,10,15,16,17,20,21

(spoiler)
  • Two clusters: 5-10 and 15-21
    • Could also argue three clusters 5-10, 15-17, and 20-21

Given the data set: 5,7,8,9,10,15,16,17,20,21

  • The data is already ordered from least to greatest.

  • Median (Q2​): There are 10 values in the data set. Since 10 is even, the median is the average of the 5th and 6th values: Q2​=210+15​=225​=12.5

  • Lower quartile (Q1​): The lower half of the data includes the values below the median: 5,7,8,9,10 This set has 5 values, so Q1​ is the middle (third) value: Q1​=8

  • Upper quartile (Q3​): The upper half of the data includes the values above the median: 15,16,17,20,21 This set also has 5 values, so Q3​ is the middle (third) value: Q3​=17

  • Interquartile range (IQR): IQR=Q3​−Q1​=17−8=9

To determine if any values in the data set are outliers, we use the 1.5×IQR rule.

We already found:

  • Q1​=8
  • Q3​=17
  • IQR=Q3​−Q1​=17−8=9

Now calculate the lower and upper bounds for outliers:

  • Lower bound = Q1​−1.5×IQR=8−1.5×9=8−13.5=−5.5
  • Upper bound = Q3​+1.5×IQR=17+1.5×9=17+13.5=30.5

Any value below −5.5 or above 30.5 is considered an outlier. Since all values in the data set fall between 5 and 21, there are no outliers.

Answer: The data show two clusters (5-10 and 15-21) and no outliers.

Why this matters: Recognizing clusters can reveal meaningful subgroups, and confirming that there are no outliers helps you trust that your summaries represent the full data set.

Justifying conclusions with data

Strong conclusions point to specific numbers, trends, or features in the display, and they avoid claims the data can’t support.

Example: Trend description and justification Monthly website visits are shown below:

Month Visits
Jan 1200
Feb 1500
Mar 1800

Write a sentence describing the trend and justify it with data.

(spoiler)

Visits rose by exactly 300 each month from 1200 in January to 1800 in March, indicating a consistent upward trend likely driven by marketing or seasonal factors.

Answer: Visits increased by 300 each month from January to March.

Why this matters: Using exact changes (like “up 300 each month”) keeps your conclusion precise and tied directly to the data.

Key points
  • Look for overall patterns, trends, clusters, or cycles before focusing on individual values
  • Identify any values that stand apart and confirm them as outliers using the 1.5×IQR rule
  • Use the five-number summary to describe the spread and center of a data set, and represent it visually with a boxplot
  • Anchor conclusions in specific numeric changes or clear features from the display
  • Avoid claims that extend beyond what the data actually show