Achievable logoAchievable logo
MCAT
Sign in
Sign up
Purchase
Textbook
Practice exams
Support
How it works
Exam catalog
Mountain with a flag at the peak
Textbook
Introduction
1. CARS
2. Psych/soc
3. Bio/biochem
4. Chem/phys
4.1 Translational motion, forces, work, energy, and equilibrium
4.2 Fluids in circulation of blood, gas movement, and gas exchange
4.3 Electrochemistry and electrical circuits and their elements
4.4 How light and sound interact with matter
4.5 Atoms, nuclear decay, electronic structure, and atomic chemical behavior
4.6 Unique nature of water and its solutions
4.7 Nature of molecules and intermolecular interaction
4.8 Separation and purification methods
4.9 Structure, function, and reactivity of bio-relevant molecules
4.9.1 Alcohols and carboxylic acids
4.9.2 Protein structure, non-enz protein function, lipids
4.9.3 Nucleic acids, amino acids, proteins
4.9.4 Carbohydrates, aldehydes and ketones
4.9.5 Acid derivatives, phenols, polycyclic and heterocyclic aromatics
4.10 Principles of chemical thermodynamics and kinetics, enzymes
Wrapping up
Achievable logoAchievable logo
4.9.3 Nucleic acids, amino acids, proteins
Achievable MCAT
4. Chem/phys
4.9. Structure, function, and reactivity of bio-relevant molecules

Nucleic acids, amino acids, proteins

7 min read
Font
Discuss
Share
Feedback

Nucleotides and nucleosides

Nucleic acids (DNA and RNA) are macromolecules that store and transmit genetic information. Their basic building blocks are nucleotides. Each nucleotide contains:

  • A five-carbon sugar (ribose in RNA, deoxyribose in DNA)
  • A phosphate group (which contributes acidity)
  • A nitrogenous base

If the sugar and base are bonded without a phosphate group, the compound is called a nucleoside.

  • For example, adenosine (base + sugar) is a nucleoside; adding a phosphate group transforms it into a nucleotide.

Sugar-phosphate backbone

Nucleotides link together through phosphodiester bonds between the phosphate of one nucleotide and the sugar of the next. This creates the sugar-phosphate backbone, which:

  • Provides the structural framework of both DNA and RNA
  • Positions the bases so they project inward, allowing specific base pairing

Pyrimidine, purine residues

The nitrogenous bases fall into two broad categories:

  • Purines: adenine (A), guanine (G), which feature a double-ring structure
  • Pyrimidines: cytosine ©, thymine (T), and uracil (U), which contain a single ring
    DNA uses thymine in place of RNA’s uracil.

Deoxyribonucleic acid (DNA): double helix

DNA (deoxyribonucleic acid) lacks an oxygen at the 2′ position on its sugar (hence “deoxy”). It is typically double-stranded:

  • According to the Watson-Crick model (building on Rosalind Franklin’s X-ray images), DNA forms a double helix with two complementary strands spiraling around a common axis.
  • Base pairing helps ensure replication accuracy:
    • Adenine pairs with thymine
    • Guanine pairs with cytosine

Chemistry

  • DNA’s sugar is deoxyribose, missing the 2′−OH group found in RNA’s ribose.
  • The phosphate group imparts acidic properties to both DNA and RNA.
  • RNA generally remains single-stranded and includes uracil in place of thymine.

Other functions
In addition to storing genetic information:

  • Some nucleotides (e.g., ATP) serve as energy currency in cells.
  • Certain RNA molecules (like rRNA, tRNA, and mRNA) play key roles in protein synthesis.
  • Modified nucleotides can act as cofactors in enzymatic reactions or participate in signaling pathways.

Amino acids

Amino acids are organic compounds and the fundamental building blocks of proteins.
Each amino acid contains:

An amino group (−NH2​)
A carboxyl group (−COOH)
A hydrogen atom (−H)
A variable side chain (−R)

Most amino acids (except glycine) have a chiral alpha carbon, which means they can exist in different stereochemical forms.

Absolute configuration at the α position
The absolute configuration describes the three-dimensional arrangement of substituents around the amino acid’s alpha carbon. It is designated R or S using the Cahn-Ingold-Prelog rules. Most naturally occurring amino acids are in the L form, and they commonly correspond to the S configuration - except for cysteine, which is L but R by CIP priority (because sulfur has a higher atomic number).

Key points

  • Glycine is achiral (its side chain is a second hydrogen).
  • Cysteine is typically R since sulfur outranks the carboxyl group in priority.
  • D and L notation is based on comparison with the reference molecule glyceraldehyde and does not necessarily match S or R assignments.

Dipolar ions (zwitterions)

Amino acids often exist as zwitterions (dipolar ions) at or near physiological pH. In this form, the amino group is protonated (−NH3+​) and the carboxyl group is deprotonated (−COO−). This dual charge affects:

  • Solubility in water
  • Interactions with other charged molecules
  • Protein folding and stability

The pH at which this dipolar form dominates is the amino acid’s isoelectric point (pI). It is determined by the average of the relevant pKa values.

Classification: acidic or basic

  • Acidic amino acids (e.g., aspartic acid, glutamic acid) have side chains that can donate protons and carry a negative charge at physiological pH.
  • Basic amino acids (e.g., lysine, arginine, histidine) contain side chains that accept protons, often carrying a positive charge at or near physiological conditions.
    These properties allow amino acids to buffer pH changes in biological systems.

Classification: hydrophilic or hydrophobic

  • Hydrophobic amino acids possess nonpolar side chains (e.g., valine, leucine) that typically avoid water.
  • Hydrophilic amino acids have polar or charged side chains (e.g., serine, lysine) that readily form hydrogen bonds or electrostatic interactions with water.
    This polarity distinction strongly influences protein folding: hydrophobic side chains tend to cluster inward, while hydrophilic side chains orient toward the aqueous environment.

Synthesis of α-amino acids

Amino acids can be synthesized in the laboratory by several methods, most notably:

  • Strecker synthesis: Involves the reaction of an aldehyde with ammonium chloride (NH4​Cl) and potassium cyanide (KCN), forming an alpha-aminonitrile intermediate, which is then hydrolyzed to yield the amino acid.
  • Gabriel synthesis: Utilizes phthalimide as a nitrogen source, which is alkylated and subsequently hydrolyzed to form the free amino acid.

These synthetic methods are often used to obtain racemic mixtures, which can then be resolved to isolate specific enantiomers if desired.

Peptides and proteins: reactions

  1. Sulfur linkage for cysteine and cystine

    • Disulfide bonds: Two cysteine residues can form a covalent disulfide bond (−S−S−) when their sulfhydryl (−SH) groups come into proximity, creating cystine.
    • Importance: These linkages stabilize a protein’s tertiary or quaternary structure by connecting distant parts of a single chain or linking multiple polypeptide chains.
  2. Peptide linkage: polypeptides and proteins

    • Formation: Peptide bonds arise from a condensation reaction between the carboxyl group (−COOH) of one amino acid and the amino group (−NH2​) of another, releasing a water molecule.
    • Polypeptides and proteins: Short chains are known as polypeptides, while longer, often more complex chains are referred to as proteins once they fold into functional shapes.
  3. Hydrolysis

    • Definition: Hydrolysis breaks peptide bonds, turning polypeptides into smaller fragments or individual amino acids through the addition of water.
    • Methods: This can occur via laboratory treatments (acid or base) or through proteolytic enzymes in biological contexts, essential for protein digestion and turnover.

General principles

  1. Primary structure of proteins

    • Linear sequence: The primary structure is the exact order of amino acids in the polypeptide chain, read from the N-terminus to the C-terminus.
    • Importance: This sequence dictates how the chain will fold into higher-level structures.
  2. Secondary structure of proteins

    • Localized folding: Secondary structure refers to regional conformations stabilized by hydrogen bonds among backbone NH and C=O groups.
    • Common motifs:
      • α helices: Right-handed spirals with side chains oriented outward.
      • β pleated sheets: Extended strands with alternating side chains pointing above or below the sheet plane.
  3. Tertiary structure of proteins

    • Three-dimensional folding: Tertiary structure emerges from side-chain interactions such as hydrophobic clustering, ionic bonds, hydrogen bonds, and disulfide linkages.
    • Key components:
      • Proline: Its cyclic side chain introduces bends, disrupting regular helices or sheets.
      • Cystine: Formed by disulfide bonds between cysteine residues, adding robust stability.
      • Hydrophobic bonding: Nonpolar residues cluster inward, shielding themselves from water and driving the overall fold.
  4. Isoelectric point

    • Definition: The pH at which a protein or amino acid carries no net charge, often resulting in minimal solubility.
    • Behavior: Around this pH, separation techniques like isoelectric focusing exploit small differences in pI values to differentiate proteins.
GFP (PDB 1B9C) charge at different pH
GFP (PDB 1B9C) charge at different pH

Nucleotides and nucleosides

  • Nucleotides: sugar (ribose or deoxyribose), phosphate group, nitrogenous base
  • Nucleoside: sugar + base (no phosphate)
  • Nucleotides form nucleic acids (DNA, RNA); nucleosides do not

Sugar-phosphate backbone

  • Nucleotides linked by phosphodiester bonds
  • Forms structural framework of DNA/RNA
  • Bases project inward for specific pairing

Pyrimidine, purine residues

  • Purines: adenine (A), guanine (G); double-ring
  • Pyrimidines: cytosine ©, thymine (T, DNA), uracil (U, RNA); single-ring
  • DNA uses thymine; RNA uses uracil

Deoxyribonucleic acid (DNA): double helix

  • DNA: double-stranded, deoxyribose sugar (lacks 2′-OH)
  • Double helix structure (Watson-Crick model)
  • Base pairing: A-T, G-C

Chemistry

  • DNA: deoxyribose; RNA: ribose (has 2′-OH)
  • Phosphate group gives acidity
  • RNA: usually single-stranded, contains uracil

Other functions

  • Nucleotides (e.g., ATP): cellular energy currency
  • RNA types (rRNA, tRNA, mRNA): protein synthesis roles
  • Modified nucleotides: enzyme cofactors, signaling

Amino acids

  • Structure: amino group (−NH2​), carboxyl group (−COOH), hydrogen, variable −R side chain
  • Most have chiral alpha carbon (except glycine)
  • L-form predominates in nature; S configuration except cysteine (L, but R by CIP rules)

Absolute configuration at the α position

  • L and D notation: based on glyceraldehyde reference
  • S/R assignment: most L-amino acids are S; cysteine is L but R

Dipolar ions (zwitterions)

  • Amino acids exist as zwitterions at physiological pH
    • Amino group protonated (−NH3+​), carboxyl deprotonated (−COO−)
  • Isoelectric point (pI): pH where net charge is zero

Classification: acidic or basic

  • Acidic: aspartic acid, glutamic acid (negative charge at pH 7)
  • Basic: lysine, arginine, histidine (positive charge at pH 7)
  • Buffering capacity due to side chain ionization

Classification: hydrophilic or hydrophobic

  • Hydrophobic: nonpolar side chains (e.g., valine, leucine)
  • Hydrophilic: polar/charged side chains (e.g., serine, lysine)
  • Drives protein folding: hydrophobic inside, hydrophilic outside

Synthesis of α-amino acids

  • Strecker synthesis: aldehyde + NH4​Cl + KCN → aminonitrile → amino acid (hydrolysis)
  • Gabriel synthesis: phthalimide alkylation, hydrolysis yields amino acid
  • Lab syntheses often give racemic mixtures

Peptides and proteins: reactions

  • Disulfide bonds: cysteine residues form covalent −S−S− (cystine), stabilizing structure
  • Peptide bonds: condensation of −COOH and −NH2​ groups, forming polypeptides/proteins
  • Hydrolysis: breaks peptide bonds (acid/base or enzymes), releases amino acids

Primary structure of proteins

  • Linear amino acid sequence (N- to C-terminus)
  • Determines higher-level folding

Secondary structure of proteins

  • Local folding via hydrogen bonds (backbone)
  • α helices: right-handed coils; β sheets: extended, pleated strands

Tertiary structure of proteins

  • 3D folding from side chain interactions: hydrophobic, ionic, hydrogen, disulfide
    • Proline: introduces bends
    • Cystine: disulfide bonds add stability
    • Hydrophobic residues cluster inward

Isoelectric point

  • pH where protein/amino acid has no net charge
  • Used for separation (e.g., isoelectric focusing)

Sign up for free to take 7 quiz questions on this topic

All rights reserved ©2016 - 2026 Achievable, Inc.

Nucleic acids, amino acids, proteins

Nucleotides and nucleosides

Nucleic acids (DNA and RNA) are macromolecules that store and transmit genetic information. Their basic building blocks are nucleotides. Each nucleotide contains:

  • A five-carbon sugar (ribose in RNA, deoxyribose in DNA)
  • A phosphate group (which contributes acidity)
  • A nitrogenous base

If the sugar and base are bonded without a phosphate group, the compound is called a nucleoside.

  • For example, adenosine (base + sugar) is a nucleoside; adding a phosphate group transforms it into a nucleotide.

Sugar-phosphate backbone

Nucleotides link together through phosphodiester bonds between the phosphate of one nucleotide and the sugar of the next. This creates the sugar-phosphate backbone, which:

  • Provides the structural framework of both DNA and RNA
  • Positions the bases so they project inward, allowing specific base pairing

Pyrimidine, purine residues

The nitrogenous bases fall into two broad categories:

  • Purines: adenine (A), guanine (G), which feature a double-ring structure
  • Pyrimidines: cytosine ©, thymine (T), and uracil (U), which contain a single ring
    DNA uses thymine in place of RNA’s uracil.

Deoxyribonucleic acid (DNA): double helix

DNA (deoxyribonucleic acid) lacks an oxygen at the 2′ position on its sugar (hence “deoxy”). It is typically double-stranded:

  • According to the Watson-Crick model (building on Rosalind Franklin’s X-ray images), DNA forms a double helix with two complementary strands spiraling around a common axis.
  • Base pairing helps ensure replication accuracy:
    • Adenine pairs with thymine
    • Guanine pairs with cytosine

Chemistry

  • DNA’s sugar is deoxyribose, missing the 2′−OH group found in RNA’s ribose.
  • The phosphate group imparts acidic properties to both DNA and RNA.
  • RNA generally remains single-stranded and includes uracil in place of thymine.

Other functions
In addition to storing genetic information:

  • Some nucleotides (e.g., ATP) serve as energy currency in cells.
  • Certain RNA molecules (like rRNA, tRNA, and mRNA) play key roles in protein synthesis.
  • Modified nucleotides can act as cofactors in enzymatic reactions or participate in signaling pathways.

Amino acids

Amino acids are organic compounds and the fundamental building blocks of proteins.
Each amino acid contains:

An amino group (−NH2​)
A carboxyl group (−COOH)
A hydrogen atom (−H)
A variable side chain (−R)

Most amino acids (except glycine) have a chiral alpha carbon, which means they can exist in different stereochemical forms.

Absolute configuration at the α position
The absolute configuration describes the three-dimensional arrangement of substituents around the amino acid’s alpha carbon. It is designated R or S using the Cahn-Ingold-Prelog rules. Most naturally occurring amino acids are in the L form, and they commonly correspond to the S configuration - except for cysteine, which is L but R by CIP priority (because sulfur has a higher atomic number).

Key points

  • Glycine is achiral (its side chain is a second hydrogen).
  • Cysteine is typically R since sulfur outranks the carboxyl group in priority.
  • D and L notation is based on comparison with the reference molecule glyceraldehyde and does not necessarily match S or R assignments.

Dipolar ions (zwitterions)

Amino acids often exist as zwitterions (dipolar ions) at or near physiological pH. In this form, the amino group is protonated (−NH3+​) and the carboxyl group is deprotonated (−COO−). This dual charge affects:

  • Solubility in water
  • Interactions with other charged molecules
  • Protein folding and stability

The pH at which this dipolar form dominates is the amino acid’s isoelectric point (pI). It is determined by the average of the relevant pKa values.

Classification: acidic or basic

  • Acidic amino acids (e.g., aspartic acid, glutamic acid) have side chains that can donate protons and carry a negative charge at physiological pH.
  • Basic amino acids (e.g., lysine, arginine, histidine) contain side chains that accept protons, often carrying a positive charge at or near physiological conditions.
    These properties allow amino acids to buffer pH changes in biological systems.

Classification: hydrophilic or hydrophobic

  • Hydrophobic amino acids possess nonpolar side chains (e.g., valine, leucine) that typically avoid water.
  • Hydrophilic amino acids have polar or charged side chains (e.g., serine, lysine) that readily form hydrogen bonds or electrostatic interactions with water.
    This polarity distinction strongly influences protein folding: hydrophobic side chains tend to cluster inward, while hydrophilic side chains orient toward the aqueous environment.

Synthesis of α-amino acids

Amino acids can be synthesized in the laboratory by several methods, most notably:

  • Strecker synthesis: Involves the reaction of an aldehyde with ammonium chloride (NH4​Cl) and potassium cyanide (KCN), forming an alpha-aminonitrile intermediate, which is then hydrolyzed to yield the amino acid.
  • Gabriel synthesis: Utilizes phthalimide as a nitrogen source, which is alkylated and subsequently hydrolyzed to form the free amino acid.

These synthetic methods are often used to obtain racemic mixtures, which can then be resolved to isolate specific enantiomers if desired.

Peptides and proteins: reactions

  1. Sulfur linkage for cysteine and cystine

    • Disulfide bonds: Two cysteine residues can form a covalent disulfide bond (−S−S−) when their sulfhydryl (−SH) groups come into proximity, creating cystine.
    • Importance: These linkages stabilize a protein’s tertiary or quaternary structure by connecting distant parts of a single chain or linking multiple polypeptide chains.
  2. Peptide linkage: polypeptides and proteins

    • Formation: Peptide bonds arise from a condensation reaction between the carboxyl group (−COOH) of one amino acid and the amino group (−NH2​) of another, releasing a water molecule.
    • Polypeptides and proteins: Short chains are known as polypeptides, while longer, often more complex chains are referred to as proteins once they fold into functional shapes.
  3. Hydrolysis

    • Definition: Hydrolysis breaks peptide bonds, turning polypeptides into smaller fragments or individual amino acids through the addition of water.
    • Methods: This can occur via laboratory treatments (acid or base) or through proteolytic enzymes in biological contexts, essential for protein digestion and turnover.

General principles

  1. Primary structure of proteins

    • Linear sequence: The primary structure is the exact order of amino acids in the polypeptide chain, read from the N-terminus to the C-terminus.
    • Importance: This sequence dictates how the chain will fold into higher-level structures.
  2. Secondary structure of proteins

    • Localized folding: Secondary structure refers to regional conformations stabilized by hydrogen bonds among backbone NH and C=O groups.
    • Common motifs:
      • α helices: Right-handed spirals with side chains oriented outward.
      • β pleated sheets: Extended strands with alternating side chains pointing above or below the sheet plane.
  3. Tertiary structure of proteins

    • Three-dimensional folding: Tertiary structure emerges from side-chain interactions such as hydrophobic clustering, ionic bonds, hydrogen bonds, and disulfide linkages.
    • Key components:
      • Proline: Its cyclic side chain introduces bends, disrupting regular helices or sheets.
      • Cystine: Formed by disulfide bonds between cysteine residues, adding robust stability.
      • Hydrophobic bonding: Nonpolar residues cluster inward, shielding themselves from water and driving the overall fold.
  4. Isoelectric point

    • Definition: The pH at which a protein or amino acid carries no net charge, often resulting in minimal solubility.
    • Behavior: Around this pH, separation techniques like isoelectric focusing exploit small differences in pI values to differentiate proteins.
Key points

Nucleotides and nucleosides

  • Nucleotides: sugar (ribose or deoxyribose), phosphate group, nitrogenous base
  • Nucleoside: sugar + base (no phosphate)
  • Nucleotides form nucleic acids (DNA, RNA); nucleosides do not

Sugar-phosphate backbone

  • Nucleotides linked by phosphodiester bonds
  • Forms structural framework of DNA/RNA
  • Bases project inward for specific pairing

Pyrimidine, purine residues

  • Purines: adenine (A), guanine (G); double-ring
  • Pyrimidines: cytosine ©, thymine (T, DNA), uracil (U, RNA); single-ring
  • DNA uses thymine; RNA uses uracil

Deoxyribonucleic acid (DNA): double helix

  • DNA: double-stranded, deoxyribose sugar (lacks 2′-OH)
  • Double helix structure (Watson-Crick model)
  • Base pairing: A-T, G-C

Chemistry

  • DNA: deoxyribose; RNA: ribose (has 2′-OH)
  • Phosphate group gives acidity
  • RNA: usually single-stranded, contains uracil

Other functions

  • Nucleotides (e.g., ATP): cellular energy currency
  • RNA types (rRNA, tRNA, mRNA): protein synthesis roles
  • Modified nucleotides: enzyme cofactors, signaling

Amino acids

  • Structure: amino group (−NH2​), carboxyl group (−COOH), hydrogen, variable −R side chain
  • Most have chiral alpha carbon (except glycine)
  • L-form predominates in nature; S configuration except cysteine (L, but R by CIP rules)

Absolute configuration at the α position

  • L and D notation: based on glyceraldehyde reference
  • S/R assignment: most L-amino acids are S; cysteine is L but R

Dipolar ions (zwitterions)

  • Amino acids exist as zwitterions at physiological pH
    • Amino group protonated (−NH3+​), carboxyl deprotonated (−COO−)
  • Isoelectric point (pI): pH where net charge is zero

Classification: acidic or basic

  • Acidic: aspartic acid, glutamic acid (negative charge at pH 7)
  • Basic: lysine, arginine, histidine (positive charge at pH 7)
  • Buffering capacity due to side chain ionization

Classification: hydrophilic or hydrophobic

  • Hydrophobic: nonpolar side chains (e.g., valine, leucine)
  • Hydrophilic: polar/charged side chains (e.g., serine, lysine)
  • Drives protein folding: hydrophobic inside, hydrophilic outside

Synthesis of α-amino acids

  • Strecker synthesis: aldehyde + NH4​Cl + KCN → aminonitrile → amino acid (hydrolysis)
  • Gabriel synthesis: phthalimide alkylation, hydrolysis yields amino acid
  • Lab syntheses often give racemic mixtures

Peptides and proteins: reactions

  • Disulfide bonds: cysteine residues form covalent −S−S− (cystine), stabilizing structure
  • Peptide bonds: condensation of −COOH and −NH2​ groups, forming polypeptides/proteins
  • Hydrolysis: breaks peptide bonds (acid/base or enzymes), releases amino acids

Primary structure of proteins

  • Linear amino acid sequence (N- to C-terminus)
  • Determines higher-level folding

Secondary structure of proteins

  • Local folding via hydrogen bonds (backbone)
  • α helices: right-handed coils; β sheets: extended, pleated strands

Tertiary structure of proteins

  • 3D folding from side chain interactions: hydrophobic, ionic, hydrogen, disulfide
    • Proline: introduces bends
    • Cystine: disulfide bonds add stability
    • Hydrophobic residues cluster inward

Isoelectric point

  • pH where protein/amino acid has no net charge
  • Used for separation (e.g., isoelectric focusing)