2.6.4 Scatterplots and Other Graphs

Achievable SAT

2. SAT Math

2.6. SAT Problem Solving and Data

Scatterplots and Other Graphs

13 min read

Font

Discuss

Feedback

Introduction

Scatterplots are collections of individual data points shown on a graph. For purposes of the SAT, the shape of a scatterplot will always mirror one of the major types of functions seen on the test: linear, quadratic, or exponential. Of these, linear scatterplots are the most common. This means that everything related to linear equations, particularly their slopes and y-intercepts, arises again as we deal with scatterplots. A good chance to review such a crucial concept as linear equations!

When the shape of a scatterplot graph resembles a line, often a line of best fit is drawn to model the data. This helps the viewer evaluate the general contours of the graph, particularly its slope and intercepts. You will get plenty of practice with the line of best fit in this lesson.

Finally, you will see in this lesson, and in the Achievable practice tests more broadly, the occasional bar graph and line graph. These kinds of graphs are typically familiar to students and need little explanation; like scatterplots, they tend to have $x$ - and $y$ -axes; understanding what those axes represent and reading them carefully is crucial to solving problems with these sorts of graphs.

Approach Question

Scatterplot
Which of the following equations is the most appropriate linear model for the data shown in the scatterplot?

A. $y = 1.18 x + 7.7$
B. $y = 1.18 x - 7.7$
C. $y = - 1.18 x + 7.7$
D. $y = - 1.18 x - 7.7$

Explanation

Why are all the answer choices in slope-intercept form, you may ask? Because, as noted in the introduction, scatterplot questions are typically going to surface again the concept of linear equations. The figure shows a set of points that, as the graph is viewed from left to right, trend generally (though not uniquely) in a downward direction. (Remember that we read graphs, just like English text, left to right.) Imagine that you were to draw a line of best fit for these data. Such a line should start approximately at the point in the upper left and trend downward through the middle of the data, hitting the $x$ -axis somewhere between about $x = 5$ and $x = 5.5$ .

Clearly, this line of best fit has a negative slope. As we remember $y = m x + b$ as slope-intercept form and note that $m$ is the slope, we can eliminate the two answers that have a positive value for $m$ ( $0.78$ ). What would be the $y$ -intercept of the line of best fit we imagine here? The good news is that if you use the UnCLES method and consider the answers carefully, you will realize you don’t need to determine that value exactly. Since, of the two remaining answers, one has a positive value and one has a negative, we can confidently choose the one with the positive value because the $y$ -intercept is clearly positive. The answer is $y = - 1.18 x + 7.7$ .

Definitions

Scatterplot

A diagram that shows the relationship between two variables using dots for the data points.

Line of Best Fit

A straight line that minimizes the distance between it and the data points in a scatterplot. The line of best fit reveals the predicted value of $y$ at any given value of $x$ .

Topics for Cross-Reference

Mean, Median, and Standard Deviation

Variations

Although there are no examples of scatterplots in the shape of a parabola in this lesson, this relationship does occur occasionally on the SAT. As long as you remember that parabolas are modeled by quadratic equations and can assess the features of a quadratic function to understand its graph, you will be able to apply the general principles of scatterplot questions to a quadratic model.

Strategy Insights

With a scatterplot question, always ask whether the relationship is linear, quadratic, or exponential. The answer to that question will go along way toward revealing the best solution strategy.

Flashcard Fodder

None here, but use this opportunity to review your flashcards on linear equations, since that concept is vital to most scatterplot questions.

Sample Questions

Difficulty 1

Dollars raised bar graph
Students at Pleasantville High split up into 10 groups for 10 fundraising carwashes. The bar graph shows the money raised by each of the 10 groups. How many groups raised at least $$500$ ? (Note: this is a free-response question.)

(spoiler)

The answer is 3. To answer this question, we first identify which axis represents the dollar amount raised; in this case, that’s the $y$ -axis. Finding $500$ on the $y$ -axis, we draw our finger across the screen to carefully identify how many of the bars exceed that amount. Groups 4, 7, and 8 raised more than $$500$ –a total of $3$ .

Difficulty 2

Points scored and game number line graph
A team manager plots Anna Grace’s points scored for each of nine games of the season. The line graph above shows the results. Between which two consecutive games did Anna Grace’s scoring increase the most?

A. 1 and 2
B. 3 and 4
C. 4 and 5
D. 6 and 7

(spoiler)

The answer is 3 and 4. Reading the slope of the graph’s changes is enough to identify a clear winner. All four of the answer choices represent intervals in which Anna Grace’s scoring went up, but the increase between 3 and 4 is an increase of $5$ , whereas none of the other increases are more than $4$ .

Difficulty 3

Weight and age scatterplot
The scatterplot shows the weight of a boy at various ages from 4 to 12, inclusive, along with a line of best fit for the data. At age 11, which is greater: the boy’s actual weight or his predicted weight?

A. The boy’s actual weight
B. The boy’s predicted weight
C. The two weights are equal.
D. There is not enough information given to answer the question.

(spoiler)

The answer is The boy’s actual weight. This question can be answered quickly; the reason we have labeled it medium difficulty is that, in our experience, students have modest experience with scatterplots and often confuse the actual value with the predicted value. If you remember that the dots present the actual data points that are originally plotted, it will be easier to then recognize that the line of best fit is only a prediction, not a representation of actual values.

At 11 years on the $x$ -axis, the dot is above the line. This means the actual value is greater than the predicted value.

Difficulty 4

Soccer program scatterplot
The scatterplot above shows the number of participants in a soccer program each year from the year the program was started (year 0) to five years later (year 5). What was the average rate of change in participants in the program from year 2 to year 5?

A. $\frac{100}{3}$
B. $25$
C. $\frac{4}{3}$
D. $\frac{3}{4}$

(spoiler)

The answer is $\frac{100}{3}$ . You may recall from lessons on linear equations and on modeling equations that “rate of change” is a phrase that should immediately conjure the word “slope” in your mind. As long as the relationship described is linear, you can always read the rate of change according to the slope.

But be careful: in this case, we are looking only at the slope from year 2 to year 5. And since the graph displays no single line showing the slope from $x = 2$ to $x = 5$ , we’ll need to envision our own line. Better (and quicker) yet, let’s use the rise/run formula $\frac{y _{2} - y _{1}}{x _{2} - x _{1}}$ to evaluate what happens in this interval. The $y$ -value at $x = 2$ is $75$ and the $y$ -value at $x = 5$ is $175$ . This means the two points in view are $(2, 75)$ and $(5, 175)$ . If we subtract $175 - 75$ for the numerator and $5 - 2$ for the denominator, we get $\frac{100}{3}$ . (This, by the way, is equivalent to $33 \frac{1}{3}$ ; another way of describing the rate of change is that, on average, the program’s participant total increased by $33 \frac{1}{3}$ every year from year 2 to year 5.)

Difficulty 5

Two variable scatterplot
The scatterplot above shows the relationship between two variables, $x$ and $y$ . An equation modeling this relationship can be written as $y = a (b)^{x}$ , where $a$ and $b$ are constants and $a > 1$ . Which of the following could be the value of $b$ ?

A. $- 0.7$
B. $0.7$
C. $1$
D. $1.7$

(spoiler)

The answer is 1.7. To answer this question, we have to think back to the lesson on exponential equations. Look back if you need to! Recall that in the form $y = a b^{x}$ , $y$ is the final amount, $a$ is the original amount, and $x$ is the independent variable that changes the result (usually this variable is time). What about $b$ , the unknown asked about in this situation? To understand what’s happening with $b$ , we need to think about when a base is raised to a power greater than $1$ .

Consider that, in common parlance, the word “exponential” is typically used to describe accelerating growth over time. There is a good reason for this; when we think of raising a number to a power, we are usually envisioning the power of $2$ , $3$ , or something larger. But our study of exponents should have reminded you that other powers behave differently: for example, anything raised to the power of $1$ stays the same, and anything raised to the power of $0$ is $1$ .

The graph of an exponential function typically shows accelerating growth from left to right because the graph eventually shows powers greater than $1$ , as long as you read far enough to the right. But there is another factor: the base. Not all bases grow as they are raised to higher and higher powers. Thinking about what happens if you keep cutting something in half, over and over again. It certainly isn’t going to get larger! Indeed, if the base is between $0$ and $1$ , the overall quantity will decrease the larger the exponent gets. This is known as exponential decay and is contrasted with the more common growth function.

OK, you might be saying, but our graph clearly shows growth in this case. This means we need to avoid decay! We can therefore rule out $0.7$ , which is the only answer choice between $0$ and $1$ . But there are still three answers left. What happens when you raise $1$ to an increasing power? It remains $1$ , so its graph would be a horizontal line. Clearly, that’s not what we want. What about the negative base? There’s a reason negative bases are never used in exponential function models; their $y$ -values would alternate between negative and positive based on whether the exponent is odd or even. And it’s even more complicated than that if the exponent does not have to be an integer. For the curious: if you graph $y = (- 1.5)^{x}$ in Desmos, it doesn’t even connect the points with a curve, which shows how tenuous the graph would be in this case. Here’s a picture of that result:

This leaves our correct answer, $1.7$ . If you already understood that exponential growth can only be represented by a base greater than $1$ , you could have skipped the explanation in the previous paragraph. Memorize that simple fact now, and you’ll be able to answer any question like this very quickly!

For Reflection

Compared to how you’ve done scatterplot questions in the past, what new strategies has this module given you?
Rate the difficulty of these questions for you from 1 (no problem) to 5 (problem!). This will help you decide when to answer them and when to skip them on test day.
What have you learned about scatterplots in your math education? If you learned very little, did this lesson clear up any confusion? If not, you might want to do a little more reading from a textbook or web page about how scatterplots work.

Scatterplots and Graph Types

Scatterplots display data as individual points; often modeled by linear, quadratic, or exponential functions
Linear scatterplots most common; concepts of slope and y-intercept apply
Bar and line graphs also appear; careful reading of axes is crucial

Line of Best Fit

Straight line approximating data trend in a scatterplot
Reveals predicted y-value for any x-value

Linear Models in Scatterplots

Slope-intercept form: $y = m x + b$
Negative slope: line trends downward left to right
Positive y-intercept: line crosses y-axis above zero

Strategy for Scatterplot Questions

Identify relationship type: linear, quadratic, or exponential
Use slope ( $m$ ) and y-intercept ( $b$ ) to match models to data
For rate of change, use slope formula: $\frac{y _{2} - y _{1}}{x _{2} - x _{1}}$

Bar and Line Graphs

Bar graphs: compare quantities across categories; read y-axis for values
Line graphs: track changes over intervals; steepest slope = greatest change

Quadratic and Exponential Scatterplots

Quadratic: modeled by $y = a x^{2} + b x + c$ ; shape is a parabola
Exponential: modeled by $y = a b^{x}$ ; growth if $b > 1$ , decay if $0 < b < 1$

Key Definitions

Scatterplot: diagram showing relationship between two variables with dots
Line of best fit: straight line minimizing distance to all data points

Exponential Models

For $y = a b^{x}$ :
- $b > 1$ : exponential growth
- $0 < b < 1$ : exponential decay
- $b = 1$ : constant function (horizontal line)
- Negative $b$ : not used in standard models

Sample Problem Takeaways

Count bars above a threshold by reading the y-axis
Steepest increase on a line graph = largest difference between consecutive points
Actual value = data point (dot); predicted value = line of best fit
Average rate of change = slope between two points
Exponential growth requires base $b > 1$

Scatterplots and Other Graphs

Introduction

Approach Question

Which of the following equations is the most appropriate linear model for the data shown in the scatterplot?

A. $y = 1.18 x + 7.7$
B. $y = 1.18 x - 7.7$
C. $y = - 1.18 x + 7.7$
D. $y = - 1.18 x - 7.7$

Explanation

Definitions

Scatterplot

A diagram that shows the relationship between two variables using dots for the data points.

Line of Best Fit

A straight line that minimizes the distance between it and the data points in a scatterplot. The line of best fit reveals the predicted value of $y$ at any given value of $x$ .

Topics for Cross-Reference

Mean, Median, and Standard Deviation

Variations

Strategy Insights

With a scatterplot question, always ask whether the relationship is linear, quadratic, or exponential. The answer to that question will go along way toward revealing the best solution strategy.

Flashcard Fodder

None here, but use this opportunity to review your flashcards on linear equations, since that concept is vital to most scatterplot questions.

Sample Questions

Difficulty 1

Students at Pleasantville High split up into 10 groups for 10 fundraising carwashes. The bar graph shows the money raised by each of the 10 groups. How many groups raised at least $$500$ ? (Note: this is a free-response question.)

(spoiler)

Difficulty 2

A team manager plots Anna Grace’s points scored for each of nine games of the season. The line graph above shows the results. Between which two consecutive games did Anna Grace’s scoring increase the most?

A. 1 and 2
B. 3 and 4
C. 4 and 5
D. 6 and 7

(spoiler)

Difficulty 3

The scatterplot shows the weight of a boy at various ages from 4 to 12, inclusive, along with a line of best fit for the data. At age 11, which is greater: the boy’s actual weight or his predicted weight?

A. The boy’s actual weight
B. The boy’s predicted weight
C. The two weights are equal.
D. There is not enough information given to answer the question.

(spoiler)

At 11 years on the $x$ -axis, the dot is above the line. This means the actual value is greater than the predicted value.

Difficulty 4

The scatterplot above shows the number of participants in a soccer program each year from the year the program was started (year 0) to five years later (year 5). What was the average rate of change in participants in the program from year 2 to year 5?

A. $\frac{100}{3}$
B. $25$
C. $\frac{4}{3}$
D. $\frac{3}{4}$

(spoiler)

Difficulty 5

The scatterplot above shows the relationship between two variables, $x$ and $y$ . An equation modeling this relationship can be written as $y = a (b)^{x}$ , where $a$ and $b$ are constants and $a > 1$ . Which of the following could be the value of $b$ ?

A. $- 0.7$
B. $0.7$
C. $1$
D. $1.7$

(spoiler)

For Reflection

Compared to how you’ve done scatterplot questions in the past, what new strategies has this module given you?
Rate the difficulty of these questions for you from 1 (no problem) to 5 (problem!). This will help you decide when to answer them and when to skip them on test day.
What have you learned about scatterplots in your math education? If you learned very little, did this lesson clear up any confusion? If not, you might want to do a little more reading from a textbook or web page about how scatterplots work.

Achievable SAT

2. SAT Math

2.6. SAT Problem Solving and Data

Scatterplots and Other Graphs

13 min read

Font

Discuss

Feedback

Introduction

Approach Question

Scatterplot
Which of the following equations is the most appropriate linear model for the data shown in the scatterplot?

A. $y = 1.18 x + 7.7$
B. $y = 1.18 x - 7.7$
C. $y = - 1.18 x + 7.7$
D. $y = - 1.18 x - 7.7$

Explanation

Definitions

Scatterplot

A diagram that shows the relationship between two variables using dots for the data points.

Line of Best Fit

A straight line that minimizes the distance between it and the data points in a scatterplot. The line of best fit reveals the predicted value of $y$ at any given value of $x$ .

Topics for Cross-Reference

Mean, Median, and Standard Deviation

Variations

Strategy Insights

With a scatterplot question, always ask whether the relationship is linear, quadratic, or exponential. The answer to that question will go along way toward revealing the best solution strategy.

Flashcard Fodder

None here, but use this opportunity to review your flashcards on linear equations, since that concept is vital to most scatterplot questions.

Sample Questions

Difficulty 1

Dollars raised bar graph
Students at Pleasantville High split up into 10 groups for 10 fundraising carwashes. The bar graph shows the money raised by each of the 10 groups. How many groups raised at least $$500$ ? (Note: this is a free-response question.)

(spoiler)

Difficulty 2

Points scored and game number line graph
A team manager plots Anna Grace’s points scored for each of nine games of the season. The line graph above shows the results. Between which two consecutive games did Anna Grace’s scoring increase the most?

A. 1 and 2
B. 3 and 4
C. 4 and 5
D. 6 and 7

(spoiler)

Difficulty 3

Weight and age scatterplot
The scatterplot shows the weight of a boy at various ages from 4 to 12, inclusive, along with a line of best fit for the data. At age 11, which is greater: the boy’s actual weight or his predicted weight?

A. The boy’s actual weight
B. The boy’s predicted weight
C. The two weights are equal.
D. There is not enough information given to answer the question.

(spoiler)

At 11 years on the $x$ -axis, the dot is above the line. This means the actual value is greater than the predicted value.

Difficulty 4

Soccer program scatterplot
The scatterplot above shows the number of participants in a soccer program each year from the year the program was started (year 0) to five years later (year 5). What was the average rate of change in participants in the program from year 2 to year 5?

A. $\frac{100}{3}$
B. $25$
C. $\frac{4}{3}$
D. $\frac{3}{4}$

(spoiler)

Difficulty 5

Two variable scatterplot
The scatterplot above shows the relationship between two variables, $x$ and $y$ . An equation modeling this relationship can be written as $y = a (b)^{x}$ , where $a$ and $b$ are constants and $a > 1$ . Which of the following could be the value of $b$ ?

A. $- 0.7$
B. $0.7$
C. $1$
D. $1.7$

(spoiler)

For Reflection

Compared to how you’ve done scatterplot questions in the past, what new strategies has this module given you?
Rate the difficulty of these questions for you from 1 (no problem) to 5 (problem!). This will help you decide when to answer them and when to skip them on test day.
What have you learned about scatterplots in your math education? If you learned very little, did this lesson clear up any confusion? If not, you might want to do a little more reading from a textbook or web page about how scatterplots work.

Scatterplots and Graph Types

Scatterplots display data as individual points; often modeled by linear, quadratic, or exponential functions
Linear scatterplots most common; concepts of slope and y-intercept apply
Bar and line graphs also appear; careful reading of axes is crucial

Line of Best Fit

Straight line approximating data trend in a scatterplot
Reveals predicted y-value for any x-value

Linear Models in Scatterplots

Slope-intercept form: $y = m x + b$
Negative slope: line trends downward left to right
Positive y-intercept: line crosses y-axis above zero

Strategy for Scatterplot Questions

Identify relationship type: linear, quadratic, or exponential
Use slope ( $m$ ) and y-intercept ( $b$ ) to match models to data
For rate of change, use slope formula: $\frac{y _{2} - y _{1}}{x _{2} - x _{1}}$

Bar and Line Graphs

Bar graphs: compare quantities across categories; read y-axis for values
Line graphs: track changes over intervals; steepest slope = greatest change

Quadratic and Exponential Scatterplots

Quadratic: modeled by $y = a x^{2} + b x + c$ ; shape is a parabola
Exponential: modeled by $y = a b^{x}$ ; growth if $b > 1$ , decay if $0 < b < 1$

Key Definitions

Scatterplot: diagram showing relationship between two variables with dots
Line of best fit: straight line minimizing distance to all data points

Exponential Models

For $y = a b^{x}$ :
- $b > 1$ : exponential growth
- $0 < b < 1$ : exponential decay
- $b = 1$ : constant function (horizontal line)
- Negative $b$ : not used in standard models

Sample Problem Takeaways

Count bars above a threshold by reading the y-axis
Steepest increase on a line graph = largest difference between consecutive points
Actual value = data point (dot); predicted value = line of best fit
Average rate of change = slope between two points
Exponential growth requires base $b > 1$

Scatterplots and Other Graphs

Introduction

Approach Question

Which of the following equations is the most appropriate linear model for the data shown in the scatterplot?

A. $y = 1.18 x + 7.7$
B. $y = 1.18 x - 7.7$
C. $y = - 1.18 x + 7.7$
D. $y = - 1.18 x - 7.7$

Explanation

Definitions

Scatterplot

A diagram that shows the relationship between two variables using dots for the data points.

Line of Best Fit

A straight line that minimizes the distance between it and the data points in a scatterplot. The line of best fit reveals the predicted value of $y$ at any given value of $x$ .

Topics for Cross-Reference

Mean, Median, and Standard Deviation

Variations

Strategy Insights

With a scatterplot question, always ask whether the relationship is linear, quadratic, or exponential. The answer to that question will go along way toward revealing the best solution strategy.

Flashcard Fodder

None here, but use this opportunity to review your flashcards on linear equations, since that concept is vital to most scatterplot questions.

Sample Questions

Difficulty 1

Students at Pleasantville High split up into 10 groups for 10 fundraising carwashes. The bar graph shows the money raised by each of the 10 groups. How many groups raised at least $$500$ ? (Note: this is a free-response question.)

(spoiler)

Difficulty 2

A team manager plots Anna Grace’s points scored for each of nine games of the season. The line graph above shows the results. Between which two consecutive games did Anna Grace’s scoring increase the most?

A. 1 and 2
B. 3 and 4
C. 4 and 5
D. 6 and 7

(spoiler)

Difficulty 3

The scatterplot shows the weight of a boy at various ages from 4 to 12, inclusive, along with a line of best fit for the data. At age 11, which is greater: the boy’s actual weight or his predicted weight?

A. The boy’s actual weight
B. The boy’s predicted weight
C. The two weights are equal.
D. There is not enough information given to answer the question.

(spoiler)

At 11 years on the $x$ -axis, the dot is above the line. This means the actual value is greater than the predicted value.

Difficulty 4

The scatterplot above shows the number of participants in a soccer program each year from the year the program was started (year 0) to five years later (year 5). What was the average rate of change in participants in the program from year 2 to year 5?

A. $\frac{100}{3}$
B. $25$
C. $\frac{4}{3}$
D. $\frac{3}{4}$

(spoiler)

Difficulty 5

The scatterplot above shows the relationship between two variables, $x$ and $y$ . An equation modeling this relationship can be written as $y = a (b)^{x}$ , where $a$ and $b$ are constants and $a > 1$ . Which of the following could be the value of $b$ ?

A. $- 0.7$
B. $0.7$
C. $1$
D. $1.7$

(spoiler)

For Reflection

Compared to how you’ve done scatterplot questions in the past, what new strategies has this module given you?
Rate the difficulty of these questions for you from 1 (no problem) to 5 (problem!). This will help you decide when to answer them and when to skip them on test day.
What have you learned about scatterplots in your math education? If you learned very little, did this lesson clear up any confusion? If not, you might want to do a little more reading from a textbook or web page about how scatterplots work.