Achievable logoAchievable logo
Praxis Core: Math (5733)
Sign in
Sign up
Purchase
Textbook
Practice exams
Support
How it works
Exam catalog
Mountain with a flag at the peak
Textbook
Introduction
1. Number and quantity
2. Data analysis, statistics, and probability
2.1 Understanding central tendencies
2.2 Understanding and representing data
2.3 Interpreting data
2.4 Interpreting scatterplots
2.5 Computing probabilities
3. Algebra and geometry
Wrapping up
Achievable logoAchievable logo
2.3 Interpreting data
Achievable Praxis Core: Math (5733)
2. Data analysis, statistics, and probability
Our Praxis Core: Math course is currently in development and is a work-in-progress.

Interpreting data

7 min read
Font
Discuss
Share
Feedback

Spotting patterns, trends, and outliers

A box-and-whisker plot (or boxplot) summarizes a data set using five key numbers: minimum, first quartile Q1​, median Q2​, third quartile Q3​, and maximum. It shows the center and spread of the distribution.

Outliers are values that lie far from the rest of the data. The IQR (interquartile range) is Q3​−Q1​. The 1.5×IQR rule is the standard method for deciding whether a value qualifies. A value x is an outlier if:

Outlier if: x>Q3​+1.5×IQRorx<Q1​−1.5×IQR

  • Values beyond this range are often plotted individually as points or small circles.
  • When outliers are present, the whiskers extend to the most extreme non-outlier values (the adjacent values inside the fence), not to the actual data minimum or maximum.
  • Outliers can indicate unusual cases, errors in data collection, or natural variability.
Definitions
Cluster
A concentration of data values within a specific range, indicating a subgroup or phase in the data. Clusters show where values tend to group together and can highlight common behaviors or conditions.
Outlier
A data value that is substantially higher or lower than the rest of the data set.
Trend
A general direction in which data values are moving over time or across categories. Trends can be increasing, decreasing, or constant, and they help reveal long-term patterns in a data set.

Patterns and trends describe how data values behave as a group - for example, steadily rising, steadily falling, or clustering around certain values. Outliers matter because they can distort summary measures like the mean or the range.

Finding the five-number summary

To find Q1​ and Q3​: order the data, locate the median, then split into a lower half and an upper half. If n is odd, exclude the median from both halves; if n is even, split evenly. Q1​ is the median of the lower half and Q3​ is the median of the upper half.

Take the following data set:

65,68,71,74,77,82,82,89

There are n=8 values, already ordered. The median is the average of the 4th and 5th values:

Q2​=274+77​=75.5

The lower half is 65,68,71,74, so:

Q1​=268+71​=69.5

The upper half is 77,82,82,89, so:

Q3​=282+82​=82

This gives the five-number summary:

  • Minimum (Min) = 65
  • First quartile (Q1​) = 69.5
  • Median (Q2​) = 75.5
  • Third quartile (Q3​) = 82
  • Maximum (Max) = 89
Sidenote
Watch out

Always order the data before computing any quartile. A common mistake is reading Q1​ as the smallest value in the lower half - it’s actually the median of the lower half.

Visual interpretation

Some problems ask you to choose the correct box-and-whisker plot (also called a boxplot) for a given data set. To do that, you need to know how the five-number summary maps onto the picture.

The box spans from Q1​ to Q3​, the median Q2​ is marked by a line inside the box, and whiskers extend from the box out to the minimum and maximum. Boxplots can be drawn horizontally or vertically; the box still spans from Q1​ to Q3​ with the median line inside - only the axis orientation changes.

Here’s what the five-number summary from the previous example looks like as a boxplot. Since no values fall outside the 1.5×IQR fences, the whiskers extend all the way to the actual minimum and maximum.

Boxplot of quiz scores
Boxplot of quiz scores

Example: Identify the outlier and compare means

10,12,15,14,13,100,16,14

Find the following:

  • The outlier
  • The mean of all eight values
  • The mean of the seven typical values excluding the outlier
(spoiler)

Order the data: 10,12,13,14,14,15,16,100

First, compute Q1​ and Q3​ to find the IQR.

  • Q1​=212+13​=12.5
  • Q3​=215+16​=15.5
  • IQR=Q3​−Q1​=15.5−12.5=3

Now apply the 1.5×IQR rule. A value is an outlier if it exceeds Q3​+1.5×IQR:

  • Upper outlier bound: Q3​+1.5×IQR=15.5+(1.5×3)=15.5+4.5=20

Since 100>20, it is an outlier.

  • Mean of all values: 810+12+13+14+14+15+16+100​=8194​=24.25
  • Mean without outlier: 794​≈13.43

Answer: The outlier is 100. The mean of all values is 24.25, and the mean without the outlier is approximately 13.43.

Example: Detect trend, clusters, and outliers

Given the following data set, describe any clusters, trends, or outliers.

5,7,8,9,10,15,16,17,20,21

(spoiler)

The data is already ordered from least to greatest.

Clusters: There are two visible clusters - one from 5 to 10 and another from 15 to 21, with a gap in between.

Outliers: All values fall between 5 and 21 with no extreme gaps from the group, so there are no outliers.

Answer: The data show two clusters (5-10 and 15-21) and no outliers.

Justifying conclusions with data

Strong conclusions point to specific numbers, trends, or features in the display, and they avoid claims the data can’t support. When describing a trend, name the direction and support it with at least one specific value - for example, “visits increased by 300 each month from January (1,200) to March (1,800).”

Correlation is not causation. A trend or association between two variables tells you they move together - it doesn’t tell you that one causes the other. Describing a trend accurately is fine; claiming a cause-and-effect relationship requires more than a pattern in the data.

Sidenote
Choosing the right display

Use the right display for your data type - this is a frequently tested skill.

  • Boxplot - numerical data; best for showing distribution and spotting outliers (e.g., comparing test score spreads across two classes)
  • Histogram - numerical data; best when the shape of the distribution matters. A symmetric histogram has roughly equal tails on both sides; a right-skewed histogram has a longer tail on the right (a few high values pull the mean up); a left-skewed histogram has a longer tail on the left.
  • Bar graph - categorical data; compares counts or values across distinct groups (e.g., favorite subject by number of students)
  • Line graph - data that changes over time; connects points to show trends (e.g., monthly rainfall over a year)
  • Scatterplot - two numerical variables; reveals relationships or correlations (e.g., hours studied vs. exam score)
  • Look for overall patterns, trends, clusters, or cycles before focusing on individual values
  • Identify any values that stand apart and confirm them as outliers using the 1.5×IQR rule
  • Use the five-number summary to describe the spread and center of a data set, and represent it visually with a boxplot
  • Anchor conclusions in specific numeric changes or clear features from the display
  • Avoid claims that extend beyond what the data actually show

Sign up for free to take 5 quiz questions on this topic

All rights reserved ©2016 - 2026 Achievable, Inc.

Interpreting data

Spotting patterns, trends, and outliers

A box-and-whisker plot (or boxplot) summarizes a data set using five key numbers: minimum, first quartile Q1​, median Q2​, third quartile Q3​, and maximum. It shows the center and spread of the distribution.

Outliers are values that lie far from the rest of the data. The IQR (interquartile range) is Q3​−Q1​. The 1.5×IQR rule is the standard method for deciding whether a value qualifies. A value x is an outlier if:

Outlier if: x>Q3​+1.5×IQRorx<Q1​−1.5×IQR

  • Values beyond this range are often plotted individually as points or small circles.
  • When outliers are present, the whiskers extend to the most extreme non-outlier values (the adjacent values inside the fence), not to the actual data minimum or maximum.
  • Outliers can indicate unusual cases, errors in data collection, or natural variability.
Definitions
Cluster
A concentration of data values within a specific range, indicating a subgroup or phase in the data. Clusters show where values tend to group together and can highlight common behaviors or conditions.
Outlier
A data value that is substantially higher or lower than the rest of the data set.
Trend
A general direction in which data values are moving over time or across categories. Trends can be increasing, decreasing, or constant, and they help reveal long-term patterns in a data set.

Patterns and trends describe how data values behave as a group - for example, steadily rising, steadily falling, or clustering around certain values. Outliers matter because they can distort summary measures like the mean or the range.

Finding the five-number summary

To find Q1​ and Q3​: order the data, locate the median, then split into a lower half and an upper half. If n is odd, exclude the median from both halves; if n is even, split evenly. Q1​ is the median of the lower half and Q3​ is the median of the upper half.

Take the following data set:

65,68,71,74,77,82,82,89

There are n=8 values, already ordered. The median is the average of the 4th and 5th values:

Q2​=274+77​=75.5

The lower half is 65,68,71,74, so:

Q1​=268+71​=69.5

The upper half is 77,82,82,89, so:

Q3​=282+82​=82

This gives the five-number summary:

  • Minimum (Min) = 65
  • First quartile (Q1​) = 69.5
  • Median (Q2​) = 75.5
  • Third quartile (Q3​) = 82
  • Maximum (Max) = 89
Sidenote
Watch out

Always order the data before computing any quartile. A common mistake is reading Q1​ as the smallest value in the lower half - it’s actually the median of the lower half.

Visual interpretation

Some problems ask you to choose the correct box-and-whisker plot (also called a boxplot) for a given data set. To do that, you need to know how the five-number summary maps onto the picture.

The box spans from Q1​ to Q3​, the median Q2​ is marked by a line inside the box, and whiskers extend from the box out to the minimum and maximum. Boxplots can be drawn horizontally or vertically; the box still spans from Q1​ to Q3​ with the median line inside - only the axis orientation changes.

Here’s what the five-number summary from the previous example looks like as a boxplot. Since no values fall outside the 1.5×IQR fences, the whiskers extend all the way to the actual minimum and maximum.

Example: Identify the outlier and compare means

10,12,15,14,13,100,16,14

Find the following:

  • The outlier
  • The mean of all eight values
  • The mean of the seven typical values excluding the outlier
(spoiler)

Order the data: 10,12,13,14,14,15,16,100

First, compute Q1​ and Q3​ to find the IQR.

  • Q1​=212+13​=12.5
  • Q3​=215+16​=15.5
  • IQR=Q3​−Q1​=15.5−12.5=3

Now apply the 1.5×IQR rule. A value is an outlier if it exceeds Q3​+1.5×IQR:

  • Upper outlier bound: Q3​+1.5×IQR=15.5+(1.5×3)=15.5+4.5=20

Since 100>20, it is an outlier.

  • Mean of all values: 810+12+13+14+14+15+16+100​=8194​=24.25
  • Mean without outlier: 794​≈13.43

Answer: The outlier is 100. The mean of all values is 24.25, and the mean without the outlier is approximately 13.43.

Example: Detect trend, clusters, and outliers

Given the following data set, describe any clusters, trends, or outliers.

5,7,8,9,10,15,16,17,20,21

(spoiler)

The data is already ordered from least to greatest.

Clusters: There are two visible clusters - one from 5 to 10 and another from 15 to 21, with a gap in between.

Outliers: All values fall between 5 and 21 with no extreme gaps from the group, so there are no outliers.

Answer: The data show two clusters (5-10 and 15-21) and no outliers.

Justifying conclusions with data

Strong conclusions point to specific numbers, trends, or features in the display, and they avoid claims the data can’t support. When describing a trend, name the direction and support it with at least one specific value - for example, “visits increased by 300 each month from January (1,200) to March (1,800).”

Correlation is not causation. A trend or association between two variables tells you they move together - it doesn’t tell you that one causes the other. Describing a trend accurately is fine; claiming a cause-and-effect relationship requires more than a pattern in the data.

Sidenote
Choosing the right display

Use the right display for your data type - this is a frequently tested skill.

  • Boxplot - numerical data; best for showing distribution and spotting outliers (e.g., comparing test score spreads across two classes)
  • Histogram - numerical data; best when the shape of the distribution matters. A symmetric histogram has roughly equal tails on both sides; a right-skewed histogram has a longer tail on the right (a few high values pull the mean up); a left-skewed histogram has a longer tail on the left.
  • Bar graph - categorical data; compares counts or values across distinct groups (e.g., favorite subject by number of students)
  • Line graph - data that changes over time; connects points to show trends (e.g., monthly rainfall over a year)
  • Scatterplot - two numerical variables; reveals relationships or correlations (e.g., hours studied vs. exam score)
Key points
  • Look for overall patterns, trends, clusters, or cycles before focusing on individual values
  • Identify any values that stand apart and confirm them as outliers using the 1.5×IQR rule
  • Use the five-number summary to describe the spread and center of a data set, and represent it visually with a boxplot
  • Anchor conclusions in specific numeric changes or clear features from the display
  • Avoid claims that extend beyond what the data actually show