TOPIC
Gene Expression, Transcription and translationMY PROGRESS
Pug Score
0%
Getting Started
"Let's build your foundation!"
Best Streak
0 in a row
Study Points
+0
Overview
Practice
Read
Quiz
Next Steps
Get Started
Get unlimited access to all videos, practice problems, and study tools.
Back to Menu
Topic Progress
Pug Score
0%
Getting Started
"Let's build your foundation!"
Best Practice
No score
Read
Not viewed
Best Quiz
No attempts
Best Streak
0 in a row
Study Points
+0
Overview
Practice
Read
Quiz
Next Steps
Read
Gene Expression: How DNA Becomes Protein Through Transcription and Translation
Gene expression describes how the instructions stored in DNA are used to produce proteins through the sequential processes of transcription and translation, forming the molecular basis of heredity and cellular function.
What Is Gene Expression?
Gene expression is the process by which the information encoded in a gene is used to produce a functional product, typically a protein. This process involves two major stages: transcription and translation, which together form the foundation of the central dogma of molecular biology.
Understanding gene expression builds directly on knowledge of Molecular Structure, DNA Components and Organization and DNA Structure and the Molecular Basis of Heredity, both of which establish the structural framework that makes gene expression possible.
Transcription: From DNA to mRNA
Transcription occurs in the nucleus of eukaryotic cells, where DNA is stored. The enzyme RNA polymerase binds to a specific DNA sequence called the promoter, unwinds the double helix, and reads the template strand (also called the antisense strand) in the 35 direction.
As RNA polymerase moves along the template strand, it assembles a complementary mRNA molecule by adding ribonucleotides according to base-pairing rules. Crucially, uracil replaces thymine in RNA, so adenine on the DNA template pairs with uracil in the mRNA. Transcription ends when RNA polymerase reaches the terminator sequence.
The resulting mRNA strand is complementary to the template strand and identical in sequence to the coding strand (sense strand), except with uracil replacing thymine.
mRNA Processing in Eukaryotes
In eukaryotic cells, the newly made pre-mRNA undergoes processing before leaving the nucleus. A 5 cap and a poly-A tail are added, and non-coding sequences called introns are spliced out by the spliceosome complex. The remaining coding sequences, called exons, are joined together to form mature mRNA, which is then exported to the cytoplasm.
Translation: From mRNA to Protein
Translation takes place at ribosomes, found in the cytoplasm or on the rough endoplasmic reticulum. The ribosome reads the mRNA sequence in groups of three nucleotides called codons. Each codon specifies a particular amino acid or signals the start or end of the polypeptide chain.
Translation begins at the start codon (AUG), which codes for the amino acid methionine. Transfer RNA (tRNA) molecules carry specific amino acids to the ribosome. Each tRNA has an anticodon a three-base sequence complementary to the mRNA codon ensuring the correct amino acid is incorporated. The incoming tRNA first enters the A site (aminoacyl site) of the ribosome, while the growing polypeptide is held at the P site, and empty tRNA molecules exit at the E site.
Translation ends when the ribosome encounters a stop codon (UAA, UAG, or UGA), which does not code for any amino acid. The completed polypeptide chain is released and folds into a functional protein.
The Genetic Code
The genetic code is the set of rules mapping mRNA codons to specific amino acids. It is nearly universal shared by almost all living organisms suggesting a common evolutionary ancestor. The code is also described as degenerate (or redundant), meaning multiple different codons can specify the same amino acid, which helps protect against the harmful effects of some mutations.
A DNA template strand reading 3-TACGGA-5 would produce the mRNA sequence 5-AUGCCU-3, demonstrating how base-pairing rules govern the transcription process.
Key Terms & Definitions
Gene: A specific segment of DNA that contains the instructions to produce a particular protein or functional RNA molecule.
Gene Expression: The process by which the information in a gene is used to produce a functional product, typically a protein, through transcription and translation.
Central Dogma: The fundamental principle of molecular biology stating that genetic information flows from DNA to RNA (transcription) and from RNA to protein (translation).
Transcription: The process by which RNA polymerase reads the DNA template strand and synthesizes a complementary mRNA molecule in the nucleus.
Translation: The process by which ribosomes decode the mRNA sequence to assemble amino acids into a polypeptide chain.
RNA Polymerase: The enzyme that binds to the promoter, unwinds the DNA double helix, and synthesizes mRNA during transcription by reading the template strand in the 35 direction.
Promoter: The specific DNA sequence where RNA polymerase binds to initiate transcription.
Terminator: The DNA sequence that signals the end of transcription, causing RNA polymerase to stop and release the mRNA.
Template Strand (Antisense Strand): The strand of DNA read by RNA polymerase during transcription; it runs 3 to 5 and is complementary to the mRNA produced.
Coding Strand (Sense Strand): The DNA strand that has the same sequence as the mRNA (with thymine instead of uracil); it is not directly read during transcription.
mRNA (Messenger RNA): A single-stranded RNA molecule that carries the genetic code from DNA in the nucleus to the ribosome in the cytoplasm, where it serves as the template for translation.
tRNA (Transfer RNA): RNA molecules that carry specific amino acids to the ribosome during translation; each tRNA has an anticodon that matches a specific mRNA codon.
rRNA (Ribosomal RNA): RNA that forms part of the structural and functional components of the ribosome.
Ribosome: The cellular structure (made of rRNA and proteins) where translation occurs; it reads mRNA codons and catalyzes the formation of peptide bonds between amino acids.
Codon: A sequence of three nucleotides on mRNA that specifies a particular amino acid or signals the start or stop of translation.
Anticodon: A three-base sequence on a tRNA molecule that is complementary to a specific mRNA codon, ensuring the correct amino acid is added during translation.
Start Codon: The codon AUG on mRNA that initiates translation and codes for the amino acid methionine.
Stop Codon: One of three mRNA codons (UAA, UAG, UGA) that signal the ribosome to end translation; they do not code for any amino acid.
Polypeptide: A chain of amino acids joined by peptide bonds, produced during translation, which folds into a functional protein.
Protein: A functional molecule made of one or more folded polypeptide chains; the final product of gene expression.
Genetic Code: The complete set of rules by which mRNA codons are translated into specific amino acids; it is nearly universal across all living organisms.
Introns: Non-coding sequences within a gene that are transcribed into pre-mRNA but spliced out during RNA processing and do not appear in the mature mRNA.
Exons: Coding sequences within a gene that are retained in the mature mRNA after RNA processing and are ultimately translated into protein.
Nucleotide: The basic building block of DNA and RNA, consisting of a sugar (deoxyribose in DNA, ribose in RNA), a phosphate group, and a nitrogenous base.
Nitrogenous Bases (DNA): The four bases found in DNA adenine (A), thymine (T), guanine (G), and cytosine (C) which pair according to Chargaff's rules (A-T and G-C).
Uracil: The nitrogenous base found in RNA that replaces thymine; uracil pairs with adenine during transcription and translation.
Double Helix: The three-dimensional structure of DNA in which two complementary strands wind around each other in a spiral shape, held together by hydrogen bonds between base pairs.
Hydrogen Bonds: The relatively weak bonds that hold the two strands of DNA together by linking complementary base pairs (A-T and G-C).
Deoxyribose: The five-carbon sugar found in DNA nucleotides; it has one fewer oxygen atom than ribose, the sugar found in RNA.
A Site (Aminoacyl Site): The ribosomal site where incoming aminoacyl-tRNA molecules (tRNA carrying their amino acid) first enter during translation.
P Site (Peptidyl Site): The ribosomal site that holds the tRNA attached to the growing polypeptide chain during translation.
E Site (Exit Site): The ribosomal site from which empty tRNA molecules exit after delivering their amino acid.
Missense Mutation: A point mutation in which a single base change causes the codon to specify a different amino acid, potentially altering protein function.
Nonsense Mutation: A point mutation that creates a premature stop codon, resulting in a shortened, typically non-functional protein.
Frameshift Mutation: A mutation caused by the insertion or deletion of nucleotides in a number not divisible by three, shifting the reading frame for all codons downstream of the mutation.
Silent Mutation: A point mutation that changes a codon but still codes for the same amino acid due to the degeneracy of the genetic code, resulting in no change in the protein.
Degenerate (Redundant) Genetic Code: The property of the genetic code whereby multiple different codons can specify the same amino acid.
Applying Gene Expression Concepts
Learners can strengthen their understanding by practicing mRNA synthesis from a given DNA template strand, applying base-pairing rules and substituting uracil for thymine. For example, given the DNA template 3-TACGGA-5, students should be able to derive the mRNA sequence 5-AUGCCU-3.
Students can also explore how mutations affect protein function by tracing how a single base change in DNA alters the mRNA codon and potentially the amino acid incorporated connecting directly to Mutations, Types and Effects. These skills are also foundational for understanding Biotechnology and Current Applications, where gene expression is deliberately manipulated.
Prerequisite Knowledge
Before studying gene expression, learners should be comfortable with the topics that establish its molecular and cellular foundations. DNA Structure and the Molecular Basis of Heredity and Gene Expression and Protein Synthesis provide the essential background for understanding how DNA encodes proteins.
Knowledge of Mitosis, Process and Stages, Cell Cycle, Growth and Regulation, and Meiosis and Gamete Formation explains how genetic information is preserved and distributed during cell division. Genetic Variation, Sources of Diversity, and Cell Reproduction, along with Mendelian Genetics and Basic Inheritance Patterns and Modern Genetics and Complex Inheritance, provide the broader genetic context within which gene expression operates.
Related Topics & Connections
Gene expression sits at the center of a network of interconnected biological concepts. Molecular Structure, DNA Components and Organization provides the structural detail of DNA that makes transcription possible, while Genetic Patterns and Complex Inheritance Models shows how gene expression outcomes are inherited across generations.
Changes in the DNA sequence that alter gene expression are studied in Mutations, Types and Effects. At the population level, altered gene expression through mutation connects to Natural Selection and Selection Pressures and Genetic Drift and Population Changes, both of which drive evolutionary change. The evidence for these evolutionary processes is examined in Evolutionary Evidence and Multiple Lines of Evidence.
The practical manipulation of gene expression is central to Biotechnology and Current Applications, where understanding transcription and translation enables technologies such as recombinant protein production. The ethical dimensions of such applications are addressed in Research Ethics and Ethical Considerations and Scientific Integrity, Data Handling and Reporting.