AP Biology Unit 6: Gene Expression and Regulation
AP Biology

AP Biology Unit 6: Gene Expression and Regulation

By JonasJune 9, 202611 min read
Key Takeaways
AP Biology Unit 6 covers 12-16% of the exam, making gene expression one of the two or three highest-weight topics in the course.
The Central Dogma sequence (DNA transcription to mRNA, mRNA translation to protein) is the foundation for almost every Unit 6 FRQ.
Four mutation types tested every year: silent, missense, nonsense, and frameshift. Frameshift mutations are the most disruptive.
The lac operon is the key prokaryotic regulation model; eukaryotes use transcription factors and chromatin remodeling instead.
Connecting Unit 6 to Unit 5: regulation explains why identical genotypes can produce different phenotypes depending on which genes are expressed.

Reading through five years of released AP Biology free-response questionswhile building Tutorioo's Unit 6 practice modules, one pattern became clear immediately: most students who lose points on gene expression FRQs do not lack the vocabulary. They lose points because they cannot trace a DNA sequence step by step through transcription and translation to predict how a specific mutation changes the final protein. That process-tracing skill is exactly what the exam tests, and it rewards students who practice the steps in order rather than those who memorize the definitions.

Unit 6 carries 12-16% of the exam, tied with Unit 3 as one of the heaviest individual units in AP Biology. The vocabulary feels memorization-heavy at first (promoter, RNA polymerase, ribosome, tRNA, anticodon, lac operon). But the actual FRQ points go to students who can apply those terms to a new scenario: given this mutation, what happens to the protein? Given this change in the operator, does the gene turn on or stay off?

This walkthrough traces every concept step by step, with worked examples for the two FRQ patterns that appear almost every year.

What Does AP Biology Unit 6 Cover?

AP Biology Unit 6 covers gene expression and regulation, the molecular mechanisms that convert genetic information stored in DNA into functional proteins. According to the College Board Course and Exam Description, Unit 6 spans seven core topics: DNA structure and replication (brief review), transcription, RNA processing, translation, mutations, prokaryotic gene regulation, and eukaryotic gene regulation including epigenetics.

Unit 6 Exam Weight: 12-16%

12-16%
of AP Biology exam questions
Unit 6 is tied with Unit 3 (Cellular Energetics) as one of the highest-weight units. Expect 8-11 multiple-choice questions per exam.
Unit 6 TopicTranscription
What the Exam TestsTrace DNA template to mRNA; identify steps of initiation, elongation, termination
FrequencyEvery year
Unit 6 TopicRNA Processing
What the Exam TestsExplain splicing, 5' cap, poly-A tail; understand introns vs exons
FrequencyMost years
Unit 6 TopicTranslation
What the Exam TestsUse genetic code to convert codons to amino acids; ribosome A/P/E sites
FrequencyEvery year
Unit 6 TopicMutations
What the Exam TestsPredict protein change from silent, missense, nonsense, or frameshift mutation
FrequencyEvery year
Unit 6 TopicLac Operon
What the Exam TestsPredict gene expression given lactose presence/absence or repressor mutation
FrequencyMost years
Unit 6 TopicEukaryotic Regulation
What the Exam TestsExplain role of transcription factors, enhancers, and epigenetic marks
FrequencyFrequently

Source: AP Biology Course and Exam Description. Topics marked Every Year have appeared in FRQs in 4 or more of the last 5 released exam sets.

How Does Transcription Work Step by Step?

Transcription converts a segment of DNA into a complementary messenger RNA strand. RNA polymerase reads the DNA template strand in the 3' to 5' direction and builds the mRNA strand in the 5' to 3' direction. RNA nucleotides pair with DNA nucleotides using the same rules as DNA replication, with one critical difference: uracil (U) replaces thymine (T) in the RNA strand.

Initiation, Elongation, and Termination

1

Initiation

RNA polymerase binds to the promoter sequence, a specific DNA region upstream of the gene. In prokaryotes, the sigma factor helps RNA polymerase recognize the promoter. The double helix unwinds locally, forming a transcription bubble that exposes the template strand.

2

Elongation

RNA polymerase moves along the template strand 3' to 5', adding complementary RNA nucleotides to the 3' end of the growing mRNA chain. The mRNA grows 5' to 3'. Where the DNA template has an A, the mRNA gets a U; where the template has T, the mRNA gets A; G pairs with C and C pairs with G.

3

Termination

In prokaryotes, RNA polymerase reaches a terminator sequence. The mRNA folds back on itself, forming a hairpin structure that stalls polymerase and causes the mRNA to detach. In eukaryotes, a cleavage signal sequence triggers release of the transcript for further processing.

Central Dogma: Transcription and TranslationThree-stage animated flow showing DNA double helix transcribed into mRNA, then mRNA translated into a protein chain at the ribosome, illustrating the central dogma of molecular biologyThe Central Dogma of Molecular BiologyDNATemplate strand · 3'→5'TranscriptionRNA polymerasemRNAAUGCCCGAAUUASingle strand · 5'→3'TranslationRibosomePROTEINMetProGluLeuPolypeptide chain
DNA transcription produces mRNA; mRNA translation at the ribosome produces a protein. Both steps must occur in order for gene expression to produce a functional product.

RNA Processing: From Pre-mRNA to mRNA

In eukaryotes, the initial transcript is a pre-mRNA that requires three processing steps before it can leave the nucleus and be translated. Prokaryotes skip this step entirely, which is why bacterial genes can be translated almost immediately as they are transcribed.

The Three RNA Processing Steps

5' capping:a modified guanosine cap is added to the 5' end, protecting the mRNA from degradation and helping ribosomes recognize where to start translating.

3' polyadenylation:a poly-A tail (150-200 adenine nucleotides) is added to the 3' end, further protecting the mRNA and signaling export from the nucleus.

Splicing: introns (non-coding sequences) are removed by protein complexes called spliceosomes, and exons (coding sequences) are joined together into the mature mRNA that reaches the ribosome.

The AP exam occasionally tests alternative splicing, where the same pre-mRNA can be spliced in different ways to produce different proteins from the same gene. This explains why humans have roughly 20,000 protein-coding genes but produce well over 100,000 distinct proteins.

How Does Translation Convert mRNA to Protein?

Translation reads the mRNA sequence three nucleotides at a time, with each three-nucleotide codon specifying one amino acid. The ribosome catalyzes the formation of peptide bonds between amino acids, building a polypeptide chain from the N-terminus (amino terminus) toward the C-terminus (carboxyl terminus). Translation begins at the start codon AUG (which codes for methionine) and ends at one of three stop codons: UAA, UAG, or UGA.

Reading the Genetic Code

The genetic code has three key properties that the AP exam tests directly. First, it is degenerate: most amino acids are coded by more than one codon (for example, leucine has six codons, glycine has four). This degeneracy is why some mutations are silent. Second, the code is non-overlapping: each nucleotide belongs to exactly one codon. Third, the code is nearly universal: the same codon specifies the same amino acid in nearly every organism, which is evidence for the common ancestry of all life.

The ribosome carries three binding sites for tRNAs. The A site (aminoacyl site) holds the incoming tRNA carrying the next amino acid. The P site (peptidyl site) holds the tRNA attached to the growing polypeptide. The E site (exit site) holds the empty tRNA about to leave the ribosome. During elongation, the ribosome shifts one codon at a time, cycling each tRNA from A to P to E as the chain grows.

Ribosome A, P, and E Sites During TranslationCross-section view of ribosome showing mRNA threading through the ribosome, with three tRNA sites: E site (exit, empty tRNA), P site (peptidyl, growing polypeptide attached), and A site (aminoacyl, incoming charged tRNA). Labels show codon-anticodon pairing and growing protein chain.Large Subunit (60S)Catalyzes peptide bond formationSmall Subunit (40S)E site · exitP site · peptidylA site · aminoacylGAACCCUUA5'3'→ ribosome movesemptyAlaGrowing chainLeu
The ribosome's three sites cycle tRNAs through the translation process. Each shift moves the ribosome one codon along the mRNA, ejecting the used tRNA at the E site and accepting a new charged tRNA at the A site.

Worked Example: DNA Sequence to Protein

FRQ Process: How to Go from DNA to Protein in Three Steps

The exam question gives: a DNA template strand and asks for the amino acid sequence.

Step 1 (Transcription):Write the mRNA by replacing each DNA base with its RNA complement. Template A pairs with mRNA U; template T pairs with mRNA A; template G pairs with mRNA C; template C pairs with mRNA G. The mRNA grows 5' to 3' as the template is read 3' to 5'.

Step 2 (Identify codons): Divide the mRNA into three-nucleotide groups starting from the AUG start codon.

Step 3 (Translate): Use the codon chart to identify each amino acid. Write the protein sequence from N-terminus to C-terminus. Stop when you reach UAA, UAG, or UGA.

Worked example. The DNA template strand reads (3' to 5'): TAC - GGG - CTT - AAT - ACT

Step 1: Transcribe to mRNA (5' to 3'): AUG - CCC - GAA - UUA - UGA

Step 2: Identify codons: AUG | CCC | GAA | UUA | UGA

Step 3: Translate: AUG = Met (start), CCC = Pro, GAA = Glu, UUA = Leu, UGA = Stop

Protein: Met - Pro - Glu - Leu (four amino acids, then termination)

Now apply a point mutation: change the second T in CTT (template position 10) to A. The template becomes TAC - GGG - CAT - AAT - ACT. The new mRNA reads AUG - CCC - GUA - UUA - UGA. GUA codes for valine, not glutamic acid. The protein becomes Met - Pro - Val - Leu. That is a missense mutation: one amino acid changes, and if that position is in the protein's active site, function changes too.

What Are the Types of Mutations and Their Effects?

Four mutation types appear on the AP Biology exam every year. Each has a distinct effect on the mRNA and protein, and the exam tests whether you can predict which type a given sequence change represents.

Silent, Missense, Nonsense, and Frameshift Mutations

AP Biology Mutation Types ComparisonFour rows showing original DNA codon, the mutated codon, and the resulting effect on protein structure for silent, missense, nonsense, and frameshift mutationsMUTATION TYPEORIGINAL → MUTANTPROTEIN EFFECTSilent(Synonymous)CCC → CCGBoth codons = ProNo changeCodon degeneracy absorbs the changeMissense(Nonsynonymous)GAA → GUAGlu → ValOne amino acid changesMay alter protein shape and functionNonsense(Premature stop)GAA → UAAGlu → STOPProtein truncated earlyTruncated, usually nonfunctionalFrameshift(Insertion/Deletion)GAA UUA → GXA AUU…All downstream codons shiftEntire protein scrambledMost disruptive; usually nonfunctional
Frameshift mutations (bottom row) are typically the most damaging because they alter every codon downstream of the insertion or deletion point.

Predicting Mutation Effects on Protein Function

The AP exam does not stop at identifying mutation type. FRQs ask you to predict whether a mutation actually changes protein function. The answer depends on where in the protein the change occurs.

A missense mutation at the active site of an enzyme usually eliminates enzyme function, because the active site's shape determines substrate binding. The same missense mutation in a structurally unimportant region may have no detectable effect on function. Nonsense mutations always produce a shorter protein; whether that protein retains any function depends on how much of the protein is truncated.

Common FRQ Mistake: Assuming All Mutations Are Harmful

Silent mutations produce no amino acid change and no protein change. A missense mutation that substitutes one amino acid for another with similar chemical properties (for example, one nonpolar amino acid for another nonpolar amino acid) may not alter protein folding or function at all. The exam rewards students who connect mutation type to structural and functional consequence, not those who assume every mutation destroys the protein.

How Does Gene Expression Get Regulated?

Gene regulation controls when and how much of a protein a cell produces. Having a gene does not mean it is always expressed; cells regulate transcription constantly in response to nutrients, signals, and developmental cues. The AP Biology exam tests two major regulatory systems: the prokaryotic operon model and the eukaryotic transcription factor system.

Prokaryotic Regulation: The Lac Operon

The lac operon in E. coli controls the production of enzymes for lactose digestion. It consists of a promoter, an operator region, and three structural genes (lacZ, lacY, lacA). A separate regulatory gene encodes the lac repressor protein that binds the operator.

Lac Operon Regulation DiagramTwo-panel diagram showing the lac operon in E. coli. Left panel: lactose absent, repressor bound to operator, transcription blocked. Right panel: lactose present, allolactose binds repressor, repressor released, RNA polymerase transcribes structural genes.LACTOSE ABSENTGene expression: OFFPromoterOperatorStructural genesRNA PolREPRESSOR(bound)No mRNA producedNo lactase enzyme synthesizedRegulatory gene → active repressorRepressor binds operator tightlyLACTOSE PRESENTGene expression: ONPromoterOperatorStructural genesRNA PolRepressor(inactive)allomRNA ✓mRNA produced from structural genesLactase enzyme synthesizedCell can now metabolize lactoseE. coli lac operon (negative control by repressor protein)
The lac operon switches on only when lactose is available. Allolactose (a lactose derivative) binds the repressor, removes it from the operator, and allows RNA polymerase to transcribe the structural genes.

The AP exam tests the lac operon through mutations. A mutation that inactivates the repressor gene produces a constitutively expressed operon: the structural genes are transcribed all the time, whether lactose is present or not. A mutation that prevents allolactose from binding the repressor locks the repressor on the operator permanently, so the genes never turn on even when lactose is present. Knowing which component each mutation affects, and what that predicts about gene expression, is the heart of most operon FRQ questions.

Eukaryotic Regulation: Transcription Factors

Eukaryotic gene regulation operates through a more complex system than the prokaryotic operon. Transcription factors are proteins that bind to specific DNA sequences near or far from a gene and either activate or repress transcription. Activator proteins recruit RNA polymerase to the promoter; repressor proteins block RNA polymerase access.

Prokaryotic Regulation

  • Operons: one promoter controls several related genes at once
  • Repressor proteins bind operator to block RNA polymerase directly
  • Responds rapidly to environmental nutrients (e.g., lactose, tryptophan)
  • No nucleus: transcription and translation occur simultaneously
  • Example: lac operon (negative control), trp operon (repressible)

Eukaryotic Regulation

  • Each gene has its own promoter region plus distant enhancers and silencers
  • Transcription factors assemble a complex that recruits RNA polymerase
  • Chromatin remodeling (histone acetylation/deacetylation) controls DNA accessibility
  • Nucleus separates transcription from translation; mRNA requires processing first
  • Alternative splicing adds another regulation layer after transcription

What Is Epigenetics and Is It Tested?

Epigenetics refers to heritable changes in gene expression that do not involve changes to the DNA sequence itself. Two epigenetic mechanisms appear on the AP Biology exam: DNA methylation and histone modification.

DNA methylation adds methyl groups to cytosine bases, generally in regions called CpG islands near gene promoters. Methylated promoters are associated with gene silencing. Genes that are heavily methylated are transcribed less or not at all, even though the sequence is intact.

Histone modification changes the structure of chromatin, the DNA-protein complex that packages DNA in the nucleus. Histone acetylation adds acetyl groups to lysine residues on histone tails, reducing the positive charge that makes histones bind DNA tightly. Loosened chromatin (euchromatin) is accessible to transcription factors and RNA polymerase. Histone deacetylation removes acetyl groups, tightening the histone-DNA interaction (heterochromatin) and silencing gene expression.

What the AP Exam Actually Asks About Epigenetics

The College Board does not expect students to memorize specific enzyme names for histone modification or methylation. FRQs in this area typically ask: given that a gene is heavily methylated, predict whether it will be expressed at high, low, or zero levels. Or: given that a histone is acetylated near a gene, predict how chromatin structure changes and whether transcription increases or decreases. Connect the mechanism to the outcome.

How Is Unit 6 Tested on the AP Exam?

Unit 6 content appears in both the multiple-choice section and the free-response section of the AP Biology exam. Multiple-choice questions typically give a scenario (a mutated gene, an operon with a specific change, a cell with unusual gene expression) and ask you to predict the outcome. The free-response questions demand step-by-step process tracing.

High-Yield FRQ Patterns

FRQ PatternDNA sequence to protein
What You Must DoTranscribe template to mRNA, divide into codons, use codon chart to identify amino acids, note start and stop codons
Common ErrorReading the coding strand instead of the template strand, or forgetting that mRNA uses U not T
FRQ PatternPredict mutation effect
What You Must DoIdentify mutation type (silent/missense/nonsense/frameshift), then predict protein change, then predict functional consequence
Common ErrorStopping at "the amino acid changes" without connecting to protein shape or enzyme activity
FRQ PatternOperon regulation scenario
What You Must DoDetermine which component is mutated (repressor gene, operator, structural genes), then predict whether expression is always ON, always OFF, or normal
Common ErrorConfusing a nonfunctional repressor (always ON) with a mutated operator (also always ON, but for a different reason)
FRQ PatternRegulatory gene mutation in eukaryotes
What You Must DoIdentify whether a transcription factor is an activator or repressor, then predict whether a loss-of-function mutation increases or decreases gene expression
Common ErrorAssuming all regulatory proteins are repressors; activator mutations silence the gene, not turn it on

These four FRQ patterns cover the majority of Unit 6 questions in released AP Biology exams from 2019 to 2024.

The College Board also connects Unit 6 to its Science Practice requirement for experimental design. A common format gives students the results of a gel electrophoresis experiment showing mRNA size before and after a splicing mutation, asking them to interpret the data and predict the effect on protein length. Practice reading data from this type of scenario alongside the conceptual content.

How Does Unit 6 Connect to the Rest of AP Biology?

Unit 6 sits at the molecular center of AP Biology, connecting backward to genetics and forward to evolution. In Unit 5 (Heredity), you learned that organisms inherit alleles from their parents. Unit 6 explains what those alleles actually do: an allele is a specific version of a gene sequence, and transcription and translation are how that sequence becomes a protein that causes a phenotype.

The connection to Unit 4 (Cell Communication) runs through gene regulation. Signal transduction cascades often terminate by activating or deactivating transcription factors, which then change which genes get expressed. A hormonal signal received at the cell surface can ultimately turn on or off dozens of genes by modifying transcription factor activity.

Looking forward, every mutation discussed in Unit 6 feeds Unit 7 (Natural Selection). Frameshift mutations that produce nonfunctional proteins are usually lethal or strongly selected against. Missense mutations in surface proteins (like sickle cell hemoglobin) can confer heterozygote advantage under specific environmental pressures. Unit 6 builds the molecular alphabet; Unit 7 asks which letters get passed on. For more on the AP Biology exam as a whole, the AP Biology resources hub collects all unit walkthroughs and exam strategy guides in one place.

The unit-to-unit connections extend even to Unit 1 (Chemistry of Life): the molecular properties of amino acids that you memorized in Unit 1 (polar, nonpolar, acidic, basic) directly determine how a protein folds in three dimensions, which determines function. A missense mutation that substitutes a nonpolar amino acid for a charged one in the interior of a protein disrupts folding because the interior of most globular proteins is hydrophobic. That linkage spans four units of AP Biology, and the exam rewards students who see it.

Estimate Your AP Biology Score

Once you have completed your Unit 6 review and worked through the practice problems, it helps to estimate where you stand across the full AP Biology exam before May. The AP Score Predictor at Tutorioo uses your Unit 6 readiness alongside your performance in other high-weight units to generate a score estimate and flag which areas still need the most practice time.

AP Score Predictor

Estimate your AP Biology score based on your current readiness across Units 3, 6, and 7 (the three highest-weight units) and identify which topics to prioritize before the May exam.

Predict My AP Biology Score

Key Takeaways

  1. Unit 6 covers 12-16% of the AP Biology exam, making it one of the highest-weighted units alongside Unit 3 and Unit 7. Treat it as a priority unit, not a secondary review.
  2. The Central Dogma (DNA transcription to mRNA, then mRNA translation to protein at the ribosome) is the backbone of every Unit 6 FRQ. Practice the three-step process until it is automatic: write mRNA, divide into codons, use the codon chart.
  3. Eukaryotes require RNA processing (5' cap, 3' poly-A tail, splicing of introns) before mRNA leaves the nucleus. Prokaryotes skip this step entirely.
  4. Four mutation types matter for the exam: silent (no amino acid change), missense (different amino acid), nonsense (premature stop codon, truncated protein), frameshift (all downstream codons shift, most disruptive).
  5. The lac operon is the model for negative prokaryotic regulation: when lactose is absent, a repressor blocks transcription; when lactose is present, allolactose removes the repressor and the gene turns on.
  6. Eukaryotic gene regulation operates through transcription factors and chromatin remodeling (histone acetylation opens chromatin; methylation closes it). The AP exam tests the conceptual connection between chromatin state and transcription level, not enzyme names.
  7. Unit 6 mutations are the raw material for natural selection in Unit 7. Students who understand how a specific mutation changes protein function can predict its fitness consequences without memorizing separate evolution content.

Related articles

Try a free AI tutoring session