What is SAT Data Analysis?

SAT Data Analysis, also known as Problem Solving and Data Analysis, is a major SAT Math domain that tests your ability to interpret charts, tables, and statistical data. It accounts for about 30 percent of the SAT Math section.

What types of charts appear in SAT Data Analysis?

Common charts include bar graphs, line graphs, scatter plots, two-way tables, and pie charts. These test your ability to interpret and compare data accurately.

What is the most common mistake in SAT Data Analysis?

The most common mistake is misreading the data, especially using the wrong denominator in tables or missing chart labels and units.

Is SAT Data Analysis easier than algebra?

For many students, SAT Data Analysis is easier because all required information is provided in charts and tables, but it requires strong reading and interpretation skills.

How can I improve my SAT Data Analysis score?

Focus on reading questions carefully, understanding chart labels, using the correct denominator, and practicing the common question types like tables, statistics, and probability.

SAT Data Analysis: Charts, Tables & Statistics - Complete Strategy Guide

Q: How many data analysis questions are on the SAT?

There are typically 13 to 15 data analysis questions in the SAT Math section, making it one of the most important scoring areas.

Edu Shaale
4 days ago
28 min read

White "SAT" text on a bright blue background, with a registered trademark symbol. Simple, bold, and clear design. — *SAT coaching designed to help you reach a 1500+ score*

Serious About Your SAT Score? Let’s Get You There

Whether you're starting your prep or aiming for a top score, EduShaale’s SAT coaching is built for results — with personalised strategy, small batches, and proven score improvement methods.

Book A Free Demo Session

10 Question Types · Bar Charts · Scatter Plots · Two-Way Tables · Statistics · Probability · Every Type Mastered

Published: May 2026 | Updated: May 2026 | ~14 min read

~30%

Problem Solving & Data Analysis: share of SAT Math

15-17

Data Analysis questions per 44-question SAT Math section

Distinct question types in SAT Data Analysis covered here

No formula

No single formula covers all 10 types -- strategy is everything

Bar/Line

Most common display: bar charts and line graphs

Scatter

Scatter plots + line of best fit: 2-3 questions per exam

2-Way

Two-way tables: tested in almost every administration

Stats

Mean, median, mode, range, SD: tested every exam

Laptop displaying colorful charts and graphs on a desk, next to a red mug with the number 7. Bright, modern office setting.

Why Data Analysis Is the Most Consistently Tested SAT Math Domain
The SAT Math Scoring Framework: Where Data Analysis Lives
Quick Reference: All 10 Data Analysis Question Types
The Universal 5-Step Data Analysis Strategy
Type 1: Reading Bar Charts and Histograms
Type 2: Reading Line Graphs
Type 3: Scatter Plots and Line of Best Fit
Type 4: Two-Way Tables (Categorical Data)
Type 5: Pie Charts and Part-to-Whole
Type 6: Mean, Median, Mode, and Range
Type 7: Standard Deviation and Spread
Type 8: Probability (Single and Conditional)
Type 9: Data Displays With Context (Inference Questions)
Type 10: Comparing Data Sets
The Desmos Advantage for Data Analysis Questions
Data Analysis in the R&W Section: What Students Miss
The 5 Most Common Data Analysis Errors
Data Analysis Practice Plan
Frequently Asked Questions (12 FAQs)
EduShaale -- Expert SAT Coaching
References & Resources

1. Why Data Analysis Is the Most Consistently Tested SAT Math Domain

Property	Why It Makes Data Analysis Uniquely Learnable
All information is provided on the page	Every Data Analysis question contains the chart, table, or graph needed to answer it. There are no 'trick' questions where the necessary data is missing. The skill is reading what is there accurately.
No advanced mathematics required	Data Analysis questions rarely require algebra, calculus, or advanced formulas. The mathematics involved is typically arithmetic (percentages, ratios, averages) applied to data correctly identified from the display.
Consistent question types	The same 10 question types in this guide appear on every SAT administration. Unlike Advanced Math questions that can vary widely, Data Analysis follows predictable patterns that students can specifically prepare for.
Questions test reasoning, not recall	Data Analysis questions test whether students can interpret what data means -- not whether they have memorised formulas. A student who has not memorised a single formula can answer most Data Analysis questions if they read carefully.
The most common error is misreading -- not miscalculating	Students lose Data Analysis points not because they cannot do the arithmetic, but because they misidentify which row, column, or data point the question is asking about. This error is preventable through a systematic reading approach.

The Strategic Implication: Students preparing for the SAT should prioritise Data Analysis because it is the domain with the highest ratio of questions to preparation effort required. A student who systematically learns the 10 question types and the 5-step reading strategy can expect to answer 12-15 of 15-17 Data Analysis questions correctly. No other SAT Math domain offers this return on preparation investment.

2. The SAT Math Scoring Framework: Where Data Analysis Lives

Domain	Approximate Weight	Questions per Exam	What It Tests	This Guide
Algebra	35%	~15-16 questions	Linear equations, systems, functions, inequalities	See EduShaale Algebra guide
Problem Solving & Data Analysis	30%	~13-15 questions	Charts, tables, statistics, probability, data interpretation	THIS GUIDE -- complete coverage
Advanced Math	25%	~11-12 questions	Quadratics, polynomials, exponentials, rational expressions	See EduShaale Advanced Math guide
Geometry & Trigonometry	10%	~4-5 questions	Area, volume, angles, trig ratios	See EduShaale Geometry guide

Digital SAT Data Analysis Format In the Digital SAT (Bluebook, 2024+), Data Analysis questions appear in both Module 1 (no calculator) and Module 2 (calculator with Desmos). The majority of PSDA questions appear in the calculator module where Desmos can assist with ratio calculations and regression line evaluation. However, some straightforward PSDA questions appear in Module 1 and must be solved without a calculator -- typically simple percentage, proportion, or basic statistics questions.

3. Quick Reference: All 10 Data Analysis Question Types

#	Question Type	Frequency	Display Type	Core Skill	Time Budget
1	Bar Charts and Histograms	Very High	Bar chart, histogram	Read specific bar values; compare bars; identify categories	45-60 sec
2	Line Graphs	High	Line graph, time series	Track trends; read specific points; calculate rates of change	45-60 sec
3	Scatter Plots + Line of Best Fit	High	Scatter plot with trend line	Read trend direction; estimate from line; identify outliers	60-75 sec
4	Two-Way Tables	Very High	Table with row and column categories	Read specific cells; calculate row/column totals; compute percentages from table	45-75 sec
5	Pie Charts / Part-to-Whole	Moderate	Pie chart, percentage display	Convert between percentage and count; calculate from proportions	30-45 sec
6	Mean, Median, Mode, Range	Very High	Any data display or list	Calculate each measure; understand when each is appropriate	45-90 sec
7	Standard Deviation and Spread	Moderate	Distribution descriptions or data sets	Compare variability; understand what SD measures	30-45 sec
8	Probability (Single + Conditional)	High	Two-way tables, sample descriptions	Identify the sample space correctly; apply P = favourable/total	45-75 sec
9	Data + Context Inference	High	Any display with passage description	Identify what can and cannot be concluded from the data	60-90 sec
10	Comparing Two Data Sets	Moderate	Two displays side by side, or described sets	Apply comparison language correctly (greater, smaller, more variable)	45-75 sec

4. The Universal 5-Step Data Analysis Strategy

This strategy applies to all 10 question types. Master it once and it guides every Data Analysis question.

Read the Question Stem FIRST -- Before the Chart
The stem tells you exactly what to find. Read it before looking at the data. Ask: What specific number, category, or comparison is being requested? Is this asking about a row total, a column percentage, a trend, an average? The stem defines your search -- the data is where you find it.
Read All Labels on the Data Display
Title, axes labels (including units), legend, footnotes. These 4-5 elements contain the information needed to interpret every data point correctly. A student who misses the unit on the y-axis ('thousands of dollars' not 'dollars') will calculate an answer off by a factor of 1,000. Take 15 seconds to read all labels before reading any data values.
Locate the Specific Data Point(s) the Question References
Using the stem's specific reference (a particular year, category, group, or variable), find the exact data point(s). On two-way tables: trace to the specific row AND column intersection. On bar charts: read the bar height for the specific category. On scatter plots: locate the specific point or the trend line at the specific x-value.
Perform the Required Calculation -- and Check the Units
Most calculations are simple arithmetic: percentage = part/total, average = sum/count, ratio = one value/another value. Perform the calculation carefully. Verify the units in your answer match what the question asks for. If the question asks for 'thousands of dollars,' ensure your answer is in thousands.
Verify Against the Answer Choices -- Eliminate Then Select
After calculating, compare your result to the answer choices. If your answer exactly matches one choice, select it. If not: check which of your 5 steps may have an error. Re-read the stem to confirm you answered the right question. Check your calculation. The error is almost always in Step 1 (misread what was asked) or Step 3 (used wrong data point).

⚠️ The Most Expensive Step to Skip: Step 2 (reading all labels). Students who skip label-reading and go directly to data values consistently make errors on units, categories, and time periods. Spending 15 seconds on labels before reading data saves 60-90 seconds of re-checking after a wrong calculation.

5. Type 1: Reading Bar Charts and Histograms

Type 1: Bar Charts and Histograms | Frequency: Very High (3-4 per exam)

Recognise by stem: 'According to the graph, which year had the highest...?' or 'By approximately how much did X exceed Y in [category]?'

Read first: Title (what is being measured) + y-axis label (units) + x-axis label (categories or time) + scale intervals

✅ Winning approach: Identify the specific bar(s) the question references. Read the bar height against the y-axis scale. If comparing bars: read both heights, then subtract. If finding a proportion: calculate the ratio of the specific bar to the total of all bars.

⚠️ Classic trap: Reading the bar height at the wrong scale interval. Bar charts with y-axes that start at a non-zero value (like 50,000 instead of 0) -- students read the bar as if it started from 0, dramatically misestimating values. Always verify: where does the y-axis start?

Worked example: Bar chart shows monthly sales: January $45,000, February $62,000, March $38,000. Question: 'By approximately how much did February sales exceed January sales?' Read February bar: $62,000. Read January bar: $45,000. Subtract: $17,000.

Histogram vs Bar Chart: The Critical Difference

Feature	Bar Chart	Histogram
Data type	Categorical (separate, distinct groups: years, countries, products)	Continuous (ranges: age groups 20-30, 30-40, heights)
Bars	Separate -- gaps between bars	Adjacent -- no gaps (continuous data)
X-axis	Named categories	Numerical ranges (intervals)
What the height means	Frequency or value for that specific category	Frequency of data falling within that continuous range
Most common SAT error	Reading the wrong category's bar	Using the midpoint of the interval rather than the full interval in calculations

6. Type 2: Reading Line Graphs

Type 2: Line Graphs | Frequency: High (2-3 per exam)

Recognise by stem: 'What was the value of X at year Y?' or 'Between which two periods did X increase the most?' or 'What is the approximate rate of change between [year A] and [year B]?'

Read first: Title + y-axis label (units) + x-axis label (time or variable) + legend (if multiple lines) + scale on both axes

✅ Winning approach: For specific value: trace from the x-axis value up to the line, then horizontally to the y-axis value. For trend questions: identify whether the line is rising, falling, or flat in the relevant section. For rate of change between two points: calculate (y2 - y1) / (x2 - x1) -- this is the slope of the line between those points.

⚠️ Classic trap: Misidentifying which line the question references when multiple lines are displayed. With 2-3 lines on one graph (common in SAT comparative line graphs), rushing through and reading the wrong line is the most expensive error. Always confirm the line colour/label matches what the question asks about before reading any value.

Worked example: Graph shows temperature over 12 months for two cities. Question: 'In June, by how many degrees Fahrenheit did City A exceed City B?' Step 1: identify City A line and City B line from the legend. Step 2: read each line's value at June. Step 3: subtract. Do not reverse the subtraction.

Line Graph Question Type	What It Asks	Calculation Required
Single point value	What was [variable] in [time period]?	Read directly from graph at that x-value
Maximum/minimum	When was [variable] highest/lowest?	Visually identify the peak/trough; confirm x-value (time)
Change between two points	How much did [variable] change from [t1] to [t2]?	(Value at t2) - (Value at t1); positive = increase, negative = decrease
Rate of change	What was the average rate of change from [t1] to [t2]?	[(y2 - y1) / (x2 - x1)] -- the slope between two points on the line
Comparison (two lines)	By how much did X exceed Y in [period]?	Read both lines at the same x-value; subtract the smaller from the larger
Trend description	Over the entire period shown, did [variable] generally...?	Identify overall direction (up/down/flat/mixed) across the full x-range shown

7. Type 3: Scatter Plots and Line of Best Fit

Type 3: Scatter Plots + Line of Best Fit | Frequency: High (2-3 per exam)

Recognise by stem: 'Which of the following best describes the relationship between X and Y?' or 'Based on the line of best fit, what is the predicted value of Y when X = [value]?' or 'Which point is farthest from the line of best fit?'

Read first: Title + axes labels (both x and y, with units) + whether a trend line is drawn + the equation of the trend line (if given)

✅ Winning approach: For trend questions: identify positive (upward left-to-right), negative (downward), or no correlation. For best-fit line prediction: use the line's position at the given x-value (not the actual data points -- use the LINE). For outlier identification: find the point farthest from the line (largest vertical distance from the line).

⚠️ Classic trap: Using actual data points for prediction questions instead of the line of best fit. If the question asks 'based on the line of best fit, predict Y when X = 8,' the answer comes from the LINE at x=8 -- not from any individual data point that happens to be near x=8.

Worked example: Scatter plot shows hours of study (x) vs test score (y). Line of best fit: y = 5x + 60. Question: 'Based on the line of best fit, what score would be predicted for a student who studies 7 hours?' Substitute x=7: y = 5(7) + 60 = 35 + 60 = 95. Do not average nearby data points.

Scatter Plot Concept	What It Means	How to Identify on SAT
Positive correlation	As x increases, y tends to increase	Points generally slope upward from left to right
Negative correlation	As x increases, y tends to decrease	Points generally slope downward from left to right
No correlation	x and y show no consistent relationship	Points scattered randomly with no directional pattern
Strong correlation	Points cluster tightly around the trend line	Narrow scatter around the line
Weak correlation	Points are spread widely around the trend line	Wide scatter around the line; trend line still present but imprecise
Outlier in scatter plot	A data point far from the main cluster or the trend line	A point visually isolated from the cluster or far from the line
Line of best fit (regression line)	The line that minimises average squared vertical distance from all data points	Usually drawn as a straight line through the scatter -- use this for predictions

8. Type 4: Two-Way Tables (Categorical Data)

Type 4: Two-Way Tables | Frequency: Very High (3-4 per exam)

Recognise by stem: 'What fraction of [group A] had [characteristic B]?' or 'Of those who [condition], what percentage also [second condition]?' or 'What is the probability that a randomly selected [group] is [category]?'

Read first: Row headers (first categorical variable) + column headers (second categorical variable) + row totals + column totals + grand total (bottom-right cell). Read ALL before answering.

✅ Winning approach: Identify EXACTLY which sub-group the question refers to. Read the question twice: 'Of the [row group], what fraction [column condition]?' = specific row total as denominator, specific cell as numerator. 'Of those who [column condition], what fraction [row group]?' = specific column total as denominator -- completely different calculation from the same table.

⚠️ Classic trap: Using the grand total as the denominator when the question specifies a sub-group. 'Of the females surveyed, what fraction preferred option A?' denominator = total females (row total), not the grand total. Using the grand total here produces the wrong answer -- the question restricts the sample to females only.

Worked example: Two-way table: 120 students surveyed. 50 female, 70 male. Of females: 30 prefer coffee, 20 prefer tea. Question: 'Of the females surveyed, what fraction preferred tea?' Denominator: 50 (total females). Numerator: 20. Answer: 20/50 = 2/5. NOT 20/120.

The 3-Question Framework for Any Two-Way Table

Before answering any two-way table question, answer these 3 questions:

Q1: What is the DENOMINATOR? (Is it the grand total, a row total, or a column total?)

Q2: What is the NUMERATOR? (Which specific cell, or combination of cells, is being counted?)

Q3: Is the question asking for fraction, percentage, probability, or count? (Format the answer accordingly.)

Phrasing in Question Stem	Denominator	Numerator
'What fraction of all [people/students/respondents] had [X]?'	Grand total (bottom-right cell)	Specific cell for [X]
'Of the [row group], what fraction had [column property]?'	Row total for [row group]	Cell at intersection of [row group] and [column property]
'Of those who [column property], what fraction were [row group]?'	Column total for [column property]	Cell at intersection of [row group] and [column property]
'What percentage of [row group] did NOT have [column property]?'	Row total for [row group]	Sum of all cells in [row group] except [column property]
'What is the probability that a randomly selected person from [row group] has [column property]?'	Row total for [row group]	Cell at intersection -- this is conditional probability

9. Type 5: Pie Charts and Part-to-Whole

Type 5: Pie Charts and Part-to-Whole | Frequency: Moderate (1-2 per exam)

Recognise by stem: 'If the total number of [items] is [N], how many [items] fall in the [category] sector?' or 'What percentage of the total is [category]?'

Read first: All sector labels (category names) + percentage of each sector + total count (usually given in the question or passage, not on the chart itself)

✅ Winning approach: For count from percentage: Multiply (percentage / 100) by the total count. For percentage from count: Divide specific count by total count, multiply by 100. Always confirm whether the total is stated in the chart or must be inferred from the question stem.

⚠️ Classic trap: Treating the percentage labels on a pie chart as counts. If the pie chart shows '35%' for category A and the total is 200, then 35% is NOT the count -- you must calculate 0.35 * 200 = 70. Answering 35 as if the label is a count is one of the most common pie chart errors.

Worked example: Pie chart: Transportation survey. 40% drive, 35% take public transit, 25% walk. Total respondents: 300. Question: 'How many respondents take public transit?' Calculation: 0.35 * 300 = 105 respondents. Do not answer '35.'

10. Type 6: Mean, Median, Mode, and Range

Type 6: Mean, Median, Mode, and Range | Frequency: Very High (2-3 per exam)

Recognise by stem: 'What is the mean score of...?' or 'If one value is added/removed, how does the median change?' or 'Which data set has a greater range?' or 'What value would be the mode?'

Read first: The complete data set (may be listed directly or displayed in a table/chart) + which specific group is being asked about

✅ Winning approach: Mean = sum of all values / count of values. Median = middle value when data is sorted (or average of two middle values for even-count sets). Mode = most frequent value. Range = maximum value - minimum value. For 'effect on mean/median when value is added or removed': recalculate after the change.

⚠️ Classic trap: Confusing mean and median when the question specifically asks for one. The SAT frequently presents data where mean and median differ significantly (skewed data), then asks about one or the other. A high-value outlier increases the mean dramatically but moves the median only slightly. Read the question: which specific measure is asked for?

Worked example: Data set: 12, 15, 18, 20, 25, 90. Mean = (12+15+18+20+25+90)/6 = 180/6 = 30. Median (even count: 6 values) = average of 3rd and 4th values = (18+20)/2 = 19. Mode = no mode (no repeated values). Range = 90-12 = 78. The mean (30) and median (19) differ greatly due to the outlier 90.

Measure	Definition	When SAT Tests It	Effect of Outlier
Mean (average)	Sum of all values divided by the count	When data is listed or displayed; when adding/removing a value	LARGE effect -- outliers pull mean toward them significantly
Median	Middle value when data is sorted (or avg of two middle values)	When SAT asks which measure is 'most representative' of skewed data	SMALL effect -- outliers shift median by at most one position
Mode	Most frequently occurring value	When SAT provides repeated values and asks which is most common	No mathematical effect -- outliers do not change mode unless they repeat
Range	Maximum value minus minimum value	When SAT asks about spread or asks for the 'difference between highest and lowest'	LARGE effect -- outliers directly change range if they are the new max or min
When to use median instead of mean	When data is skewed by outliers or when the question asks which measure is 'most appropriate' for a skewed distribution	SAT frequently asks 'which measure best represents the typical value' for a data set with an outlier -- answer: median	Median is preferred for skewed data; mean is preferred for symmetric data

11. Type 7: Standard Deviation and Spread

Type 7: Standard Deviation and Spread | Frequency: Moderate (1-2 per exam)

Recognise by stem: 'Which data set has greater standard deviation?' or 'Which of the following would decrease the standard deviation?' or 'Student A's score is 1.5 standard deviations above the mean. What is the score?'

Read first: The data descriptions, or the summary statistics (mean and SD), or the data distribution displayed

✅ Winning approach: SD measures how spread out values are from the mean. HIGHER SD = data more spread out. LOWER SD = data more clustered around mean. For 'which has greater SD' questions: compare the visual spread or the stated SD values. For score calculation: score = mean + (number of SDs * SD value). For effect questions: adding a value NEAR the mean DECREASES SD; adding a value FAR from the mean INCREASES SD.

⚠️ Classic trap: Confusing standard deviation with range. Range is the distance between max and min. SD measures how spread the entire distribution is relative to the mean. A data set can have a large range but small SD if most values cluster near the mean with a few extreme outliers. The SAT specifically tests whether students know this distinction.

Worked example: Data set A: {10, 10, 10, 10, 10}. Mean = 10, SD = 0 (no spread). Data set B: {1, 5, 10, 15, 19}. Mean = 10, SD is much larger (significant spread). Both have the same mean but very different SDs. SAT question type: 'Which set has greater standard deviation?' -- answer B.

SD Concept	Plain Language	SAT Application
Low SD (close to 0)	Values are clustered tightly around the mean	A class where almost everyone scored between 78-82 on a test has low SD
High SD	Values are spread widely from the mean	A class where scores ranged from 40 to 100 has high SD
Adding a value near the mean	This decreases SD (makes the distribution more concentrated)	SAT asks: 'If a student scoring exactly the mean joined the group, how does SD change?' Answer: decreases
Adding a value far from the mean	This increases SD (makes the distribution more spread)	SAT asks: 'If an outlier is added, how does SD change?' Answer: increases
Removing an outlier	This decreases SD (distribution becomes more concentrated)	Common SAT question: 'If the highest value is removed, how does SD change?' Answer: decreases (if it was far from mean)

12. Type 8: Probability (Single and Conditional)

Type 8: Probability -- Single and Conditional | Frequency: High (2-3 per exam)

Recognise by stem: 'What is the probability that a randomly selected [subject] has [property]?' or 'Given that [condition is true], what is the probability of [event]?'

Read first: The complete sample description or two-way table providing all group counts + the specific condition or restriction in the question

✅ Winning approach: P(event) = (number of favourable outcomes) / (total outcomes in sample space). For CONDITIONAL probability (given that...): the denominator changes to the restricted sample. P(A given B) = P(A and B) / P(B) = (count of A and B) / (count of B). Always identify whether the question is restricting the sample before calculating.

⚠️ Classic trap: Using the total sample size as the denominator for a conditional probability question. 'Given that the student is a junior, what is the probability they prefer online learning?' denominator = total juniors (NOT total students). Misidentifying the denominator is the most common probability error on the SAT.

Worked example: Two-way table: 200 students. 80 juniors, 120 seniors. Of 80 juniors: 50 prefer online, 30 prefer in-person. Question: 'What is the probability that a randomly selected junior prefers online learning?' P = 50/80 = 5/8. Denominator: 80 (total juniors), NOT 200 (total students).

Probability Question Type	Denominator	Numerator	Example
Single event from full sample	Total sample size (grand total)	Count of favourable outcomes	P(randomly selected student is female) = total females / all students
Conditional: given group A, what is P(property B)?	Total count of group A	Count with BOTH group A AND property B	P(online \| junior) = juniors who prefer online / all juniors
Both A and B	Total sample	Count with BOTH properties simultaneously	P(female AND prefers online) = females preferring online / all students
At least one of...	Total sample or total trials	Count where at least one condition is satisfied (easier: 1 - P(none)	P(at least one head in 2 flips) = 1 - P(all tails) = 1 - 1/4 = 3/4

13. Type 9: Data Displays With Context (Inference Questions)

Type 9: Data + Context Inference | Frequency: High (2-3 per exam)

Recognise by stem: 'Based on the data, which of the following conclusions is best supported?' or 'Which statement about [variable] is supported by the data?' or 'The data in the table best supports which claim?'

Read first: The full passage describing the study methodology + the data display + answer choices (which contain both supportable and unsupportable conclusions)

✅ Winning approach: Accept conclusions that are directly stated or directly calculable from the data. Reject conclusions that: (a) go beyond the scope of the sample, (b) imply causation from correlation, (c) generalise beyond the study population, (d) make predictions the data cannot support. The correct answer is the one that is exactly and only what the data shows -- not an interesting extension of it.

⚠️ Classic trap: Accepting a conclusion that implies causation from a correlation. If data shows that students who sleep more score higher on tests, the data SUPPORTS the correlation between sleep and test scores -- it does NOT support 'sleeping more CAUSES higher test scores.' The SAT specifically tests this distinction. Causation requires experimental design (randomised control); correlation is shown by observational data.

Worked example: Study: survey of 500 college students; those who exercise 3+ days/week report higher GPA. Conclusion A: 'Exercise causes higher GPA among college students.' WRONG -- correlation, not causation. Conclusion B: 'In this sample, students who exercised more frequently reported higher GPAs.' CORRECT -- directly reflects the data without implying causation.

Conclusion Type	Supportable by Data?	Key Signal Words
What the sample shows, in the sample	YES -- always supportable	'In the sample surveyed...', 'Among the participants...', 'Based on the study...'
Correlation between two variables	YES -- if the data displays a relationship between them	'Associated with', 'related to', 'tended to', 'on average'
Causation from observational study	NO -- correlation does not equal causation	'Caused by', 'results in', 'leads to', 'because of' -- not supportable from observation
Generalisation beyond the study group	NO -- if the study used a specific sample	'All students', 'people in general', 'always' -- too broad unless study was a random sample of that population
Prediction beyond data range	NO -- extrapolation beyond displayed data is not directly supported	'By 2050, the trend will continue...' -- unsupportable without stated model
What data does NOT show	YES -- a conclusion can be supported by noting what is absent	'The data does not support the claim that...' -- supportable

14. Type 10: Comparing Two Data Sets

Type 10: Comparing Two Data Sets | Frequency: Moderate (1-2 per exam)

Recognise by stem: 'Compared to Group A, Group B has a [greater/smaller/equal] [mean/median/range/SD]?' or 'Which of the following statements correctly compares the two data sets?'

Read first: Both data sets or displays being compared + what specific measure is being compared

✅ Winning approach: Calculate or estimate the comparison measure for BOTH groups. Then apply the correct comparison language: 'greater mean' means one average is numerically higher. 'Greater variability' means one SD or range is numerically larger. When comparing on multiple dimensions (e.g., 'Group A has greater median AND greater SD'), verify each independently.

⚠️ Classic trap: Confusing 'greater mean' with 'greater median' or 'greater variability' with 'greater range.' These are all different measures. Answer choices on comparison questions often mix and match these measures correctly and incorrectly. Eliminate choices that misstate any single comparison, even if other parts of the choice are correct.

Worked example: Group A test scores: mean 75, median 72, SD 8. Group B: mean 71, median 74, SD 12. Which statement is correct? A: 'Group A has higher mean AND higher median.' FALSE -- A has higher mean (75>71) but LOWER median (72<74). B: 'Group B has greater variability.' TRUE -- Group B's SD (12) exceeds Group A's SD (8). Answer: B.

15. The Desmos Advantage for Data Analysis Questions

Data Analysis Situation	Desmos Use	Time Saved
Scatter plot with regression line equation given	Enter the equation, evaluate at a specific x-value by typing it	15-20 seconds vs manual substitution
Calculating mean from a large data set	Type all values into a list: mean([1,2,3,4,...]) -- Desmos computes it instantly	30-60 seconds vs manual sum + division
Checking percentage calculations	Use Desmos as a precise calculator for part/whole percentage	5-10 seconds
Verifying probability fractions	Convert fraction to decimal in Desmos to match with percentage answer choices	5-10 seconds
Comparing two regression lines	Enter both equations graphically and compare slopes and y-intercepts visually	20-30 seconds vs algebraic comparison

Desmos -- Scientific/Graphing Calculator (same as Bluebook)

✅ Desmos Mean Calculation: For data analysis questions involving mean calculation from a table or chart, type the values as a list in Desmos: mean([12, 15, 18, 22, 30]) and it computes instantly. This eliminates arithmetic errors on multi-value mean calculations and takes 10 seconds rather than 60 seconds of manual addition and division.

16. Data Analysis in the R&W Section: What Students Miss

Data Analysis is not limited to the SAT Math section. The Reading and Writing (R&W) section contains data displays -- bar charts, tables, or line graphs -- paired with short passages, and questions asking students to describe what the data shows or how it relates to the passage's claims. These questions are distinct from R&W reading comprehension questions and require the same data-reading skills as Math PSDA questions.

Feature	R&W Data Questions	Math PSDA Questions
Where they appear	Reading & Writing section (both modules)	Math section (both modules)
Data display type	Usually a single simple bar chart, table, or line graph	All 10 types including scatter plots, two-way tables, and complex displays
What the question asks	'Which choice most accurately describes the data shown?' or 'What data from the chart supports the claim in the passage?'	Numerical calculations, probability, statistical measures, inference
Calculation required	Rarely -- usually direct reading and comparison	Often -- percentages, averages, probability
Strategy	Read passage claim first, then find data that confirms or contradicts it	5-step universal strategy: stem first, labels, locate data, calculate, verify
How many per exam	3-5 R&W questions involve data displays	15-17 Math PSDA questions

R&W Data Question Trap The most common R&W data question error: choosing an answer that accurately describes what the passage CLAIMS rather than what the DATA SHOWS. These may be different. The question asks about the data. An answer that echoes the passage's argument but contradicts the chart is wrong even if it sounds like the most logical conclusion.

17. The 5 Most Common Data Analysis Errors

Error	How It Happens	Score Cost	Prevention
Wrong denominator in table/probability questions	Student uses grand total instead of row or column total for conditional questions	1-2 questions per exam	Apply the 3-question framework before every two-way table: What is the denominator? What is the numerator? What format does the answer need?
Reading chart labels after data values	Student jumps to the data before reading axes, units, and legend	1-2 questions per exam (unit errors compound)	Mandatory 15-second label-reading step before reading any data value on any chart
Using data points instead of the trend line for prediction	Student reads a nearby data point rather than the line of best fit value for prediction questions	1 question per exam	'Based on the line of best fit' always means use the LINE, not any point
Confusing causation and correlation in inference questions	Student accepts a causal conclusion from observational data	1-2 questions per exam	Memorise: observational studies show correlation only. Experiments (randomised assignment) can show causation. Reject any conclusion using 'causes' from survey/observational data.
Computing mean when question asks for median (or vice versa)	Student defaults to mean without reading which measure is requested	1 question per exam	Read the question twice: 'What is the MEAN?' or 'What is the MEDIAN?' Underline the specific measure before calculating.

18. Data Analysis Practice Plan

Week	Focus	Daily Practice	Milestone
Week 1	Master Types 1-4 (Bar, Line, Scatter, Two-Way Table)	10 questions per type; apply 5-step strategy on every question; write the denominator before calculating on every table question	All 4 types consistently correct; 3-question framework automatic for two-way tables
Week 2	Master Types 5-8 (Pie, Statistics, SD, Probability)	10 mean/median/mode questions; 5 SD comparison questions; 10 probability questions (mix single and conditional)	Mean vs median distinction automatic; conditional probability denominator correctly identified
Week 3	Master Types 9-10 + Inference and Comparison	10 inference questions: practice eliminating causation conclusions from observation; 5 two-set comparison questions	Causation vs correlation distinction applied automatically; no comparison errors
Week 4	R&W data questions + full section timed	5 R&W data questions from official materials; one full timed Math section with all 10 types integrated	R&W data questions reliable; PSDA questions averaging under 60 seconds each
Week 5+	Error analysis + official past exams	After each practice section: categorise every wrong data analysis answer by error type (wrong denominator, wrong measure, causation trap, label misread)	Error pattern identified and targeted; consistent 90%+ on PSDA questions

College Board -- Official SAT Practice Tests (Bluebook)

Khan Academy -- SAT Problem Solving and Data Analysis Practice

College Board -- SAT Math Domain: Problem Solving & Data Analysis

Ready to Start Your SAT Journey?

EduShaale's Digital SAT program is built for students targeting 1400+. Small batches, adaptive mocks, personalised mentorship, and a curriculum fully aligned to the 2026 Digital SAT format.

📞 Book a Free Demo Class: +91 90195 25923

🌐 www.edushaale.com/sat-coaching-bangalore

🧪 Free Mock Test: testprep.edushaale.com

✉️ info@edushaale.com

Book A Free Demo Session

19. Frequently Asked Questions (12 FAQs)

Based on Digital SAT specifications and common student questions about data analysis questions.

What is the Data Analysis section of the SAT?

Problem Solving and Data Analysis (PSDA) is one of the four domains of SAT Math, accounting for approximately 30% of the 44-question Math section -- roughly 13-15 questions per exam. PSDA questions test a student's ability to read and interpret data from charts (bar charts, line graphs, scatter plots), tables (especially two-way tables with categorical data), and statistical summaries (mean, median, standard deviation, probability). Unlike Algebra questions, PSDA questions provide all necessary information in the display -- the primary skill is reading and interpreting that information accurately, not performing complex calculations.

How many data analysis questions are on the SAT?

Approximately 13-15 of the 44 SAT Math questions come from the Problem Solving and Data Analysis domain -- about 30% of the Math section. Additionally, the Reading and Writing section contains 3-5 questions paired with data displays (charts or tables), bringing the total number of data-related questions on the full SAT to approximately 16-20. PSDA is the largest single domain in SAT Math, making it the single most impactful domain for overall Math score improvement.

How do I read a two-way table on the SAT?

Two-way tables show two categorical variables simultaneously. The key strategy: before calculating, identify the correct denominator by reading the question carefully. If the question asks 'what fraction of all respondents had X' -- denominator is the grand total (bottom-right cell). If it asks 'of the females, what fraction had X' -- denominator is the female row total. If it asks 'of those who had X, what fraction were female' -- denominator is the X column total. The numerator is always the specific cell at the intersection of the row and column conditions specified. Apply the 3-question framework: What is the denominator? What is the numerator? What format is needed (fraction, percentage, count)?

What is the difference between correlation and causation on the SAT?

Correlation means two variables tend to change together -- as one increases, the other tends to increase (positive) or decrease (negative). Causation means one variable directly causes the other to change. The critical SAT distinction: observational studies (surveys, retrospective data collection) can demonstrate correlation only -- they cannot establish causation. Only randomised controlled experiments (where subjects are randomly assigned to conditions) can support causal conclusions. SAT inference questions frequently offer answer choices that imply causation from observational data -- these are always wrong. Reject any answer choice that says one variable 'causes,' 'results in,' or 'leads to' another if the data came from a survey or observational study.

What is a line of best fit and how do I use it on the SAT?

A line of best fit (regression line) is a straight line drawn through a scatter plot that best represents the overall trend of the data points. On SAT questions, the line of best fit is used for predictions: when asked 'based on the line of best fit, what is the predicted value of Y when X = [value]?' -- substitute the x-value into the line's equation (if given) or read directly from the line at that x-value (if only the graph is shown). Critical: use the LINE, not any individual data point. A data point at exactly x=8 and y=45 does not mean the predicted y-value at x=8 is 45 -- the line might pass through y=43 at x=8. Always use the line for prediction questions.

How do mean, median, and mode differ on the SAT?

Mean (average) = sum of all values divided by the count. Median = the middle value when sorted (or the average of the two middle values for even-count sets). Mode = the most frequently occurring value. The SAT tests when each measure is most appropriate: for data with significant outliers, the median is usually the 'better representation' of typical values because outliers heavily distort the mean but move the median only slightly. A single very large value can increase the mean dramatically while the median barely changes. SAT questions frequently ask which measure 'best represents the typical value' for a skewed data set -- the answer is almost always median.

What is standard deviation and how is it tested on the SAT?

Standard deviation (SD) measures how spread out data values are from the mean. A low SD means values cluster tightly around the mean. A high SD means values are spread widely. The SAT does not ask students to calculate SD manually. Instead, it tests conceptual understanding: which data set has greater SD (compare visual or described spread); how does SD change when a value is added (adding near the mean decreases SD; adding far from the mean increases SD); and what does a specific SD tell us about the distribution ('the mean is 70 and SD is 10; a score of 90 is 2 standard deviations above the mean'). No formula computation is required.

How do I calculate probability on the SAT?

Basic probability: P(event) = number of favourable outcomes / total outcomes in the sample space. The most important SAT probability skill is correctly identifying the sample space (denominator). For simple probability: denominator = total sample size. For conditional probability ('given that [condition], what is the probability of [event]?'): denominator = the restricted group size, not the total sample. For two-way table probability questions: find the cell at the intersection of the two conditions (numerator) and divide by the appropriate total -- grand total for unconditional, row or column total for conditional.

Are data analysis questions harder than algebra on the SAT?

For most students, Data Analysis questions are easier than Algebra or Advanced Math questions because all information is provided on the page and the calculations are simpler (arithmetic, percentages, averages) rather than algebraic manipulation. The difficulty in Data Analysis comes from careful reading -- misidentifying which row or column a question refers to, or confusing the denominator in conditional probability questions. Students who are strong in algebra but careless readers sometimes perform better on Algebra than on Data Analysis. The most effective preparation approach: treat every Data Analysis question as a reading problem first and a calculation problem second.

What types of charts appear on the SAT?

The SAT includes bar charts, histograms, line graphs, scatter plots (with and without a line of best fit), two-way tables, pie charts/donut charts, and occasionally box plots or dot plots. Bar charts and two-way tables are the most frequently appearing data displays. Scatter plots appear on 2-3 questions per exam. Pie charts appear less frequently. Box plots and dot plots are rare but are tested. The core skill is the same for all display types: read the labels first, identify the specific data the question references, and calculate from there.

How do I improve my SAT Data Analysis score quickly?

The fastest improvements come from fixing the 5 common errors in this guide: (1) use the correct denominator for table and probability questions -- always identify the restricted sample first; (2) read all chart labels before reading any data values; (3) use the trend line (not data points) for scatter plot predictions; (4) reject causation conclusions from observational data; (5) identify which statistical measure the question asks for before calculating. These five fixes can be implemented in the next practice test without learning any new content. For most students, systematic error analysis after each practice test produces 3-5 additional correct PSDA answers within 2 weeks.

Does the SAT test statistics like normal distribution or z-scores?

The Digital SAT does not test formal statistical concepts like z-scores, normal distribution probabilities, or hypothesis testing at a college statistics level. The statistics content on the SAT is limited to: measures of central tendency (mean, median, mode), measures of spread (range, standard deviation conceptually), basic probability (single and conditional), data interpretation from displays, and making inferences about what data does and does not support. Students do not need college statistics preparation -- the required statistical concepts are all at the descriptive statistics level and are fully covered in this guide.

20. EduShaale -- Expert SAT Coaching

EduShaale builds SAT Data Analysis mastery through systematic type-by-type strategy instruction and targeted error analysis using official Bluebook practice materials.

Type-by-Type Strategy Instruction: We teach all 10 Data Analysis question types with their specific strategies in sequence, building from the most common types (bar charts, two-way tables) to the more nuanced (inference, SD comparison). Each type is drilled to the point where the strategy triggers automatically.
Two-Way Table 3-Question Framework: The two-way table denominator error is the most expensive recurring error in PSDA. We build the 3-question framework (denominator? numerator? format?) as a pre-calculation reflex that eliminates this error entirely within 2-3 practice sessions.
Causation vs Correlation Drilling: Inference questions are among the most commonly missed PSDA questions despite having the simplest strategy: reject causation from observation. We drill this distinction until it triggers automatically on any inference question.
R&W Data Questions Included: We cover the 3-5 R&W data questions per exam explicitly -- most SAT prep programmes focus only on Math PSDA. Including R&W data questions in preparation adds 3-5 additional reliable points.

📋 Free Digital SAT Diagnostic — test under real timed conditions at testprep.edushaale.com

📅 Free Consultation — personalised study plan based on your diagnostic timing data

🎓 Live Online Expert Coaching — Bluebook-format mocks, pacing training, content mastery

💬 WhatsApp +91 9019525923 | edushaale.com | info@edushaale.com

EduShaale's finding: Data Analysis is the SAT Math domain where students most consistently leave points behind -- not because the content is hard, but because the strategy is not taught explicitly. Students who learn the 10 type-specific strategies and the 5-step universal approach add an average of 4-7 Data Analysis questions to their correct-answer count within 3 weeks. That translates to 40-80 scaled score points from a single domain.

21. References & Resources

Official College Board Resources

SAT Data Analysis Strategy Guides

EduShaale SAT Resources

SAT and Bluebook are registered trademarks of the College Board. All Digital SAT content domain information based on College Board specifications as of May 2026. This guide is for educational purposes only.