College Board ยท Chief Reader

AP Psychology Chief Reader ReportsWhat Examiners Actually Want

The post exam reports from Chief Reader Elliott Hammer explaining how Readers scored the AAQ and EBQ, what cost points, and the structural rules that govern every response.

AP Psychology Chief Reader Report archive (2022 to 2025)

Type
Year

4 of 4 resources

2025

1 file
  • 2025 AP Psychology Chief Reader Report (new format: AAQ plus EBQ)

    Chief Reader Report

    Open PDF

2024

1 file
  • 2024 AP Psychology Chief Reader Report (previous exam format, pre-2025 restructure)

    Chief Reader Report

    Open PDF

2023

1 file
  • 2023 AP Psychology Chief Reader Report (previous exam format, pre-2025 restructure)

    Chief Reader Report

    Open PDF

2022 and earlier

1 file
  • AP Psychology Chief Reader Reports archive (2022 and earlier)

    Chief Reader Report ยท official archive

    Open PDF

Post exam analysis of student FRQ responses written by the Chief Reader

What it is

Elliott Hammer, Professor of Psychology, Xavier University of Louisiana

Chief Reader

Late summer after the May exam administration

Published

334,960 students scored; 1,005 Readers; global mean 3.20

2025 exam scope

May 2025 (Article Analysis Question plus Evidence Based Question)

New format since

2025 (new format); 2024 and 2023 (previous format, labeled)

Reports synthesized here

What do AP Psychology Chief Reader Reports reveal?

The AP Psychology Chief Reader Report describes the structural scoring rules Readers applied to the AAQ and EBQ, the examiner perspective account of which response patterns earned or lost points, and the consistent failure modes observed across hundreds of thousands of responses in the 2025 administration.

After each May administration, the Chief Reader publishes a report describing how student responses to every free response question were evaluated: what successful responses contained, what the common failure patterns looked like, and what teachers should reinforce. The report is written for teachers, but for a student preparing for the exam it is the most direct public account of how the scoring rules were applied across hundreds of thousands of real responses. It tells you not just what the rubric required, but how students fell short of it and why. Reading it alongside that year's question booklet and scoring guidelines provides the full picture: the prompt, the rubric, and the examiner's own description of what separated responses that earned points from those that did not.

Multi year synthesis: the persistent themes

An important framing note: the AP Psychology exam was restructured for May 2025. The Article Analysis Question (AAQ) and Evidence Based Question (EBQ) replaced the previous format's Concept Application and Research Design questions. Only one Chief Reader Report, the 2025 report, describes the current exam. The 2023 and 2024 reports describe the previous format and are labeled accordingly throughout this page. A genuine multi year synthesis of the new format will be possible only from the 2026 report onward, when two administrations of AAQ and EBQ data exist. What follows is a detailed reading of the first new format report, authored by Chief Reader Elliott Hammer of Xavier University of Louisiana, covering 334,960 students scored by 1,005 Readers with a global exam mean of 3.20. The 2025 report centers on two structural principles the Chief Reader states emphatically. The first is strict task verb adherence. Chief Reader Hammer directs students and teachers to the task verb definitions in the AP Psychology Course and Exam Description at page 150: Identify, State, Describe, and Explain each require a different level of response, and responses that treat all four as interchangeable lose points. The second is the no contradiction rule, which the report states explicitly in the context of both the AAQ and the EBQ. On Part A of the AAQ, naming multiple research methods (for example, experiment plus correlation plus meta analysis) alongside the correct answer is treated by Readers as self contradictory and does not earn the point. On Part D of the AAQ, naming ethical guidelines not explicitly presented in the source summary (such as confidentiality, debriefing, or IRB) is treated as contradictory and does not earn the point. The principle is consistent: one correct, specific, sourced answer earns the point; hedging or adding alternatives voids it. On the AAQ as a whole, the 2025 report draws a sharp distinction between operational specificity and conceptual paraphrase. Part B of the AAQ (Research Variable, 69% success rate, the lowest scoring part of Question 1) required students to state the measurable, quantified definition used by the researchers: that high misinformation meant 80% of the sentences in the narrative, or 32 out of 40, contained incorrect information. Responses that described the misinformation effect in general terms, or named the topic of the study, did not earn the point. The Chief Reader is explicit: the operational definition is present in the source summary and must be read precisely, not paraphrased. Similarly, Part C of the AAQ (Statistic Interpretation, 65% success) required stating what the difference in means indicated, including the direction of the difference, not merely restating the numbers or defining the word mean. On the EBQ, the 2025 report identifies the reasoning parts as the highest difficulty scoring areas on the exam. Part B(ii) (Reasoning with concept integration, 2 points) was earned by only 22% of students in full. Part C(ii) (Reasoning with a different concept, 2 points) was earned in full by only 14% of students, making it the single hardest scoring area of the entire exam. The Chief Reader states clearly that providing a definition of a psychological concept is not sufficient to earn these points. The concept must be integrated into the explanation of how the evidence supports the claim, not merely named or defined alongside it. For Part C specifically, the concept used must also be different from the one used in Part B, and the evidence must come from a different source than the one used in Part B(i). Collapsing B and C into a single conceptual thread is the most common failure pattern on the EBQ. For historical context, both labeled as previous exam format: on the 2024 exam, the Concept Application question (Q1) had a mean of 3.65 and the Research Design question (Q2) had a mean of 3.32, with a global exam mean of 2.97. On the 2023 exam, the Concept Application question (Q1) had a mean of 2.31 and the Research Design question (Q2) had a mean of 3.24, with a global exam mean of 2.89. The 2025 new format produced a global mean of 3.20 with AAQ mean 4.97 of 7 and EBQ mean 4.07 of 7. Because the exam structure changed fundamentally, these figures reflect a structural shift, not a change in exam difficulty.

Top student errors documented in recent reports

  1. 01

    Hedging by naming multiple answers on single answer parts

    The 2025 Chief Reader Report identifies the no contradiction rule as a governing scoring principle. On AAQ Part A, Readers saw responses that named several research methods alongside the correct one (experiment) and withheld the point. On AAQ Part D, responses that named ethical guidelines not in the source summary were treated as contradictory. The examiner perspective is that one precisely correct answer earns the point; adding alternatives signals uncertainty and voids it. This is a structural rule stated in the Scoring Guidelines General Considerations, not a content knowledge problem.

    AP Psychology Chief Reader Report 2025; AP Psychology Scoring Guidelines 2025, General Considerations

  2. 02

    Paraphrasing instead of citing the operational definition precisely

    Part B of the 2025 AAQ (Research Variable) was the lowest scoring part of Question 1, earned by only 69% of students. The Chief Reader notes that successful responses stated the measurable, quantified definition exactly as the researchers used it in the source: 80% of sentences, or 32 of 40, containing incorrect information. Responses that described the misinformation effect in general, or named the topic of the study rather than the operational measure, did not earn the point. Readers reward operational, quantifiable specificity; conceptual paraphrase does not substitute for it.

    AP Psychology Chief Reader Report 2025, Question 1 Part B

  3. 03

    Restating statistics without describing what they indicate

    Part C of the 2025 AAQ (Statistic Interpretation, 65% success) required responses to describe what the difference in group means indicated in the context of the study, including the direction of the difference. Readers saw three failure patterns: merely reporting the means as numbers, reversing the direction of the difference, and defining the word mean rather than interpreting what the mean difference showed about the groups. The examiner's standard is that a statistic has meaning only when placed in the context of the study's finding, not when reported in isolation.

    AP Psychology Chief Reader Report 2025, Question 1 Part C

  4. 04

    Defining a psychological concept rather than integrating it into reasoning

    The 2025 Chief Reader Report identifies the reasoning parts of the EBQ (Parts B(ii) and C(ii)) as the hardest scoring areas of the exam. Chief Reader Hammer states explicitly that providing only a definition of a psychological concept does not earn the second reasoning point. Students must integrate the concept into their explanation of how the evidence supports the claim. Only 22% of students earned both points on B(ii) and only 14% on C(ii), making C(ii) the single hardest scoring unit on the entire exam. The distinction between defining and integrating is the threshold that separates average from high scoring responses on the EBQ.

    AP Psychology Chief Reader Report 2025, Question 2 Parts B(ii) and C(ii)

  5. 05

    Collapsing EBQ Parts B and C into the same source and concept

    The EBQ scoring rules require that Part C(i) draw on a different source than Part B(i), and that Part C(ii) use a different psychological concept than Part B(ii). The 2025 report identifies this as a common failure pattern: students develop a coherent argument around one source and one concept and then use the same source and concept in both B and C. This collapses the two point opportunities into one and forfeits the second pair entirely. The Chief Reader advises reading all three sources before beginning and noting which results and which psychological concepts each source suggests.

    AP Psychology Chief Reader Report 2025, Question 2 Parts C(i) and C(ii)

  6. 06

    Providing evidence about method rather than a cited result

    On EBQ Part B(i) and Part C(i), the Chief Reader notes that responses describing how a study was conducted, or who the participants were, rather than citing a specific, accurate, relevant result from the study, did not earn the evidence point. The examiner distinction is between method description and result citation. A result is what the study found; a method is how the study was conducted. Both parts require citing a result, and the citation must name the specific source. Responses without a source citation did not earn the point.

    AP Psychology Chief Reader Report 2025, Question 2 Parts B(i) and C(i)

What do AP Psychology Readers consistently reward?

AP Psychology Readers reward precision over breadth: one correct, source grounded, specifically worded answer that matches the task verb's required depth, cites a result rather than a method, and on the EBQ integrates a psychological concept into the causal reasoning rather than defining it alongside the evidence.

The 2025 Chief Reader Report describes high scoring responses with four recurring characteristics. First, they match the response to the task verb: an Identify response names the item; a State response gives the specific value or definition; a Describe response explains what something is; an Explain response draws a causal connection. Chief Reader Hammer directs teachers explicitly to the task verb definitions in the CED at page 150. Second, they cite results rather than methods or topic descriptions when evidence is required. Third, they integrate psychological concepts into reasoning rather than defining those concepts alongside the argument. Fourth, on the EBQ, they draw on two genuinely different sources and two genuinely different psychological concepts, which is what the scoring rules require and what most students fail to do on the harder reasoning parts. The gap between a score of 3 and a score of 5 on the AP Psychology exam is not primarily a content knowledge gap; it is a precision gap between responses that state and those that demonstrate.

How should current students use the AP Psychology Chief Reader Reports?

Read the 2025 AP Psychology Chief Reader Report alongside that year's question booklet and scoring guidelines to understand the structural scoring rules, the task verb requirements, and the specific response patterns that earned or lost points on each part of the AAQ and EBQ.

The most productive way to use the 2025 AP Psychology Chief Reader Report is as an annotated scoring guide. Start with the AAQ and EBQ from the 2025 exam. Read the scoring guidelines for each part to understand the rubric requirement. Then read the Chief Reader Report's section on each part to understand what common responses looked like and why they did or did not earn the point. Pay particular attention to the misconceptions table, which shows side by side examples of responses that failed and responses that succeeded. For the EBQ, the Chief Reader's advice on Parts B(ii) and C(ii) is especially high yield: integrating a concept means weaving it into the causal explanation, not appending a definition. Because the 2025 report is the only available new format report, also read the scoring guidelines directly. The CED task verb definitions at page 150 apply to both the AAQ and the EBQ and are worth reviewing alongside the report.

The Chief Reader checklist

  1. 1

    Look up the task verb before writing: Identify, State, Describe, and Explain each require a different response depth. The CED defines all four at page 150.

  2. 2

    On single answer parts of the AAQ (research method, ethical guideline), commit to one answer and do not list alternatives. Naming multiple options is treated as self contradiction and voids the point.

  3. 3

    When a part asks for an operational definition, find and quote the specific, quantifiable measure the researchers used in the source summary. Do not paraphrase or describe the concept generally.

  4. 4

    When interpreting a statistic, state what the number indicates in the context of the study, including the direction of any difference. Do not restate the number alone or define the statistical term.

  5. 5

    On the EBQ, read all three sources before writing. Note which results and which psychological concepts each source suggests. Parts B and C require different sources and different concepts.

  6. 6

    To earn the second point on EBQ B(ii) and C(ii), integrate the psychological concept into the causal chain of your reasoning. Stating that a concept is relevant, or defining it separately, earns no points; the concept must explain why the evidence supports the claim.

  7. 7

    On the EBQ, evidence must be a cited result from a named source, not a description of how the study was conducted or who the participants were. Results answer the question of what the study found.

  8. 8

    After drafting any FRQ response, verify that you have addressed only what the task verb requires for each part and that you have not contradicted yourself by naming alternatives on single answer parts.

AP Psychology Chief Reader Report FAQ

What is the AP Psychology Chief Reader Report?

After each May exam, the Chief Reader publishes a report describing how Readers evaluated student responses to every free response question: what successful responses contained, what the common failure patterns were, and what teachers should reinforce. The 2025 report was authored by Elliott Hammer, Professor of Psychology at Xavier University of Louisiana, and covers 334,960 students scored by 1,005 Readers. It is the most direct public account of how the AAQ and EBQ scoring rules were applied in practice.

Where can I read AP Psychology Chief Reader Reports?

This page links directly to College Board's hosted reports for 2023, 2024, and 2025. The 2025 report describes the current AAQ plus EBQ exam format. The 2023 and 2024 reports describe the previous exam format and are labeled as such. Reports for 2022 and earlier are available through College Board's official past exam questions archive.

What does the 2025 AP Psychology Chief Reader Report say about the AAQ?

Chief Reader Hammer emphasizes two structural rules on the AAQ. First, single answer parts require exactly one correct answer; naming multiple methods or multiple ethical guidelines is treated as self contradiction and voids the point. Second, operational definitions and statistics must be cited with the specific, quantifiable language from the source summary, not paraphrased. Part B (Research Variable) was the lowest scoring part of the AAQ, earned by 69% of students, primarily because responses described the misinformation effect generally rather than citing the 80% or 32 of 40 sentence operational definition.

Why are EBQ Parts B(ii) and C(ii) so hard?

The 2025 report shows that only 22% of students earned both points on Part B(ii) and only 14% on Part C(ii), making C(ii) the single hardest scoring area on the exam. Chief Reader Hammer identifies the reason: students provide a definition of a psychological concept rather than integrating it into their explanation of how the evidence supports the claim. The second point on each reasoning part requires the concept to function as part of the causal argument, not as a definition appended to the evidence.

What does it mean to integrate a psychological concept in the EBQ?

Integration means using the concept to explain why the evidence supports the claim, not defining the concept separately. A response that states 'social facilitation is the tendency to perform better in front of others' and then presents evidence has defined the concept. A response that states 'the finding that participants completed a well learned task faster with an audience is an example of social facilitation, because the arousal generated by an audience enhances performance on tasks that are already mastered' has integrated the concept into the reasoning. The Chief Reader Report 2025 makes this distinction explicit.

Does the AP Psychology exam have a curve?

AP Psychology uses criterion referenced scoring, not a norm referenced curve. The composite to AP score boundaries are set annually through standard setting rather than through a fixed percentage cutoff. The 2025 pass rate of 70.5% and global mean of 3.20 on the new format are not directly comparable to the 2023 and 2024 pass rates on the previous format, because the exam structure changed. Plan for a demanding standard, not a lenient curve.

How are the 2023 and 2024 Chief Reader Reports useful for the current exam?

The 2023 and 2024 reports describe the previous exam format (Concept Application and Research Design questions) and are not directly applicable to the current AAQ and EBQ structure. However, both reports reflect Dr. Hammer's consistent emphasis on applying psychological concepts precisely to a specific context rather than defining them in the abstract, and on using the correct operational language from the prompt. That underlying principle carries forward into the new format. Label any reference to 2023 and 2024 findings as applying to the previous format when discussing exam preparation.

What is the no contradiction rule in AP Psychology FRQ scoring?

The no contradiction rule, stated in the 2025 AP Psychology Scoring Guidelines General Considerations and reinforced in the Chief Reader Report, holds that responses that provide the correct answer plus one or more incorrect alternatives do not earn the point. For AAQ Part A (research method) and Part D (ethical guideline), naming additional options alongside the correct one is treated as self contradiction. The examiner's reasoning is that a response naming both correct and incorrect answers does not demonstrate confident command of the knowledge.

When will multi year AP Psychology Chief Reader Report synthesis be available?

Genuine year over year synthesis of the new AAQ and EBQ format will be possible from the 2026 Chief Reader Report onward, when two administrations of new format data exist. The 2025 report is the only available new format report. This page will be updated when the 2026 report is published. Until then, the single year 2025 findings represent the most authoritative publicly available examiner perspective on the current exam.

What task verbs appear on the AP Psychology FRQ and how are they different?

The AP Psychology CED defines four task verbs used on the free response section (CED page 150). Identify requires naming the item without explanation. State requires giving a specific value or definition precisely as used in the source. Describe requires explaining what something is or how it works. Explain requires making a causal argument for why or how. The Chief Reader Report 2025 explicitly directs students to these definitions as a first step in planning any response, because a response at the wrong depth level typically does not earn the point even if the content is accurate.

More AP Psychology resources

Train on what AP Psychology examiners actually reward

An AI tutor that works through released AAQ and EBQ questions with you and scores your responses against College Board's official rubrics and the Chief Reader's structural rules.

Start free with Tutorioo