Protein synthesis

The central dogma of molecular biology describes the flow of genetic material within a biological system. This process is fundamental to understanding how proteins are made using genetic instructions in DNA. Understanding protein synthesis is key to developing therapies in medicine, enhancing crop yields in agriculture and creating new biotechnologies.

The central dogma

The synthesis of proteins is explained using the central dogma (“central dogma” just means it is a fundamental principle or concept). It tells us that DNA is copied into messenger RNA (mRNA) through a process called transcription and then mRNA is used to make proteins in a process called translation.

You will soon learn that this process is very complex, with a number of modifications that occur to the mRNA and protein before it is ready for use by the body. Any errors with these processes could lead to a malfunctioning protein.

Transcription

Transcription (“trans” means “across” and “scribe” means “write”) is the process in which a specific segment of DNA is copied into mRNA. This mRNA strand contains the genetic blueprints for making a protein.
There are three steps to transcribing DNA, which occur in the nucleus of eukaryotic cells: initiation, elongation and termination.

Initiation

During initiation, an enzyme called RNA polymerase binds to a region of the DNA known as the promoter region. This is the part of a gene that signals the DNA to unwind from its double helix structure, exposing the nucleotides so that they can be read by the enzyme.

A segment of double-stranded DNA with RNA polymerase attached to the promoter region.

Elongation

Elongation is when the RNA polymerase moves along the segment of unwound DNA and adds nucleotides in a chain to build the mRNA strand. The mRNA strand is complementary to the DNA strand, following the complementary base pair rule (or Chargaff’s rules).

Adenine (A) in the DNA strand pairs with uracil (U) in the RNA.
Thymine (T) in the DNA strand pairs with adenine (A) in the RNA.
Cytosine (C) in the DNA strand pairs with guanine (G) in the RNA.
Guanine (G) in the DNA strand pairs with cytosine (C) in the RNA.

U and T are similar in structure. The presence of U instead of T indicates that the molecule is RNA, not DNA. Thymine helps keep DNA stable, while uracil is more useful in RNA as it requires less energy to produce.

A segment of double-stranded DNA that is partially unwound from its double helix structure. RNA polymerase is attached one strand and has moved along the unwound section. A complementary strand of mRNA is formed.

Termination

When the RNA polymerase reaches a stop signal (or termination sequence) in the gene, transcription ends. During termination, the RNA polymerase detaches from the DNA, and the mRNA strand is free to carry the blueprints for the protein to a ribosome in the cytosol or endoplasmic reticulum, where it can be translated into a protein.

A segment of double-stranded DNA that is partially unwound from its double helix structure. RNA polymerase has reached the end of the unwound section of DNA, labelled the stop signal, and has detached from the DNA strand. A single strand of pre-mRNA is released.

Watch this video to see transcription in action.

What you are about to see is DNA's most extraordinary secret — how a simple code is turned into flesh and blood. It begins with a bundle of factors assembling at the start of a gene. A gene is simply a length of DNA instructions stretching away to the left. The assembled factors trigger the first phase of the process, reading off the information that will be needed to make the protein. Everything is ready to roll: three, two, one, GO! The blue molecule racing along the DNA is reading the gene. It's unzipping the double helix, and copying one of the two strands. The yellow chain snaking out of the top is a copy of the genetic message and it's made of a close chemical cousin of DNA called RNA. The building blocks to make the RNA enter through an intake hole. They are matched to the DNA - letter by letter - to copy the As, Cs, Ts and Gs of the gene. The only difference is that in the RNA copy, the letter T is replaced with a closely related building block known as "U". You are watching this process - called transcription - in real time. It's happening right now in almost every cell in your body.

Post-transcriptional modifications

In eukaryotes, transcription forms pre-mRNA which needs to undergo additional modifications like 5’ capping, polyadenylation and splicing to prepare it for translation.

5’ capping – to improve the stability of the mRNA transcript and protect it from degradation, a cap can be added to the 5’ end via a phosphate linkage
polyadenylation – just like 5’ capping, polyadenylation helps to stabilise the mRNA transcript; it involves adding a poly-A tail consisting of about 200 adenine nucleotides
splicing – segments of genetic code called exons code for functional proteins but mRNA transcribed from DNA also contains segments called introns which no not code for protein; introns are cut out from the mRNA and the exons are joined together again.

Three types of post-transcriptional modification of pre-mRNA.

Translation

Translation is the conversion of the genetic code from the mRNA into a specific sequence of amino acids, forming a polypeptide or protein. It occurs within ribosomes, which consist of two subunits. The small subunit binds the mRNA transcript and the large subunit binds transfer RNA (tRNA).

Ribosome by MajoraMaster on Sketchfab, licensed under CC BY 4.0

A three-dimensional model of a single ribosome with the two subunits shown in different colours.

Ribosome model

Small subunit (pink): decodes the genetic messages sent by the mRNA.
Large subunit (purple): the ribosome reads the messenger RNA (mRNA) and uses the information to string together amino acids into a protein.

tRNA molecules read the mRNA template in sets of three nucleotides, called codons, with each codon corresponding to a specific amino acid. They bring the appropriate amino acids to the ribosome so they can be added to the polypeptide chain.

The codon chart is shown.

Like transcription, translation involves an initiation, elongation and termination step.