Nucleic acids, amino acids, proteins | Structure, function, and reactivity of bio-relevant molecules | Chem/phys

Nucleotides and nucleosides

Nucleic acids (DNA and RNA) are macromolecules essential for storing and transmitting genetic information. Their building blocks are nucleotides, each containing:

A five-carbon sugar (ribose in RNA, deoxyribose in DNA)
A phosphate group (which contributes acidity)
A nitrogenous base

When a sugar and base are bonded without a phosphate, the compound is called a nucleoside.

For example, adenosine (base + sugar) is a nucleoside; adding a phosphate group transforms it into a nucleotide.

Sugar-Phosphate Backbone Nucleotides link together via phosphodiester bonds between the phosphate of one nucleotide and the sugar of the next, forming the sugar-phosphate backbone. This backbone:

Provides the structural framework of both DNA and RNA
Positions the bases so they project inward, allowing specific base pairing

Pyrimidine, Purine Residues

The nitrogenous bases fall into two broad categories:

Purines: adenine (A), guanine (G), which feature a double-ring structure
Pyrimidines: cytosine (C ), thymine (T), and uracil (U), which contain a single ring
DNA uses thymine in place of RNA’s uracil.

Deoxyribonucleic Acid (DNA): Double Helix DNA (deoxyribonucleic acid) lacks an oxygen at the 2′ position on its sugar (hence “deoxy”). It is typically double-stranded:

According to the Watson–Crick model (building on Rosalind Franklin’s X-ray images), DNA forms a double helix with two complementary strands spiraling around a common axis.
Base pairing ensures replication accuracy:
- Adenine pairs with thymine
- Guanine pairs with cytosine

Chemistry

DNA’s sugar is deoxyribose, missing the 2′ $- O H$ group found in RNA’s ribose.
The phosphate group imparts acidic properties to both DNA and RNA.
RNA generally remains single-stranded and includes uracil in place of thymine.

Other Functions
Beyond storing genetic information:

Some nucleotides (e.g., ATP) serve as energy currency in cells.
Certain RNA molecules (like rRNA, tRNA, and mRNA) play key roles in protein synthesis.
Modified nucleotides can act as cofactors in enzymatic reactions or participate in signaling pathways.

Amino Acids

Amino acids are organic compounds that serve as the fundamental building blocks of proteins.
They each contain:

An amino group ( $- N H_{2}$ )
A carboxyl group ( $- COO H$ )
A hydrogen atom ( $- H$ )
A variable side chain ( $- R$ )

Most amino acids (except glycine) have a chiral alpha carbon, allowing for different stereochemical forms.

Absolute Configuration at the α Position
The absolute configuration refers to the three-dimensional arrangement of substituents around the amino acid’s alpha carbon, designated by R or S using the Cahn–Ingold–Prelog rules. Although most naturally occurring amino acids are in the L form, they commonly correspond to the S configuration—except for cysteine, which is L but R by CIP priority (due to sulfur’s higher atomic number).

Key Points

Glycine is achiral (its side chain is a second hydrogen).
Cysteine is typically R since sulfur outranks the carboxyl group in priority.
D and L notation is based on comparison with the reference molecule glyceraldehyde and does not necessarily match S or R assignments.

Dipolar Ions (Zwitterions) Amino acids frequently exist as zwitterions (dipolar ions) at or near physiological pH. Their amino group is protonated ( $- N H_{3}^{+}$ ), while their carboxyl group is deprotonated ( $- CO O^{-}$ ). This dual charge influences:

Solubility in water
Interactions with other charged molecules
Protein folding and stability

The exact pH value where this dipolar form dominates is the amino acid’s isoelectric point (pI), determined by the average of relevant pKa values.

Classification: Acidic or Basic

Acidic amino acids (e.g., aspartic acid, glutamic acid) have side chains that can donate protons and carry a negative charge at physiological pH.
Basic amino acids (e.g., lysine, arginine, histidine) contain side chains that accept protons, often carrying a positive charge at or near physiological conditions.
These properties allow amino acids to buffer pH changes in biological systems.

Classification: Hydrophilic or Hydrophobic

Hydrophobic amino acids possess nonpolar side chains (e.g., valine, leucine) that typically avoid water.
Hydrophilic amino acids have polar or charged side chains (e.g., serine, lysine) that readily form hydrogen bonds or electrostatic interactions with water…
This polarity distinction strongly influences protein folding, as hydrophobic side chains tend to cluster inward, while hydrophilic side chains orient toward the aqueous environment.

Synthesis of $α$ -Amino Acids Amino acids can be synthesized in the laboratory by several methods, most notably:

Strecker Synthesis: Involves the reaction of an aldehyde with ammonium chloride ( $N H_{4} Cl$ ) and potassium cyanide ( $K CN$ ), forming an alpha-aminonitrile intermediate, which is then hydrolyzed to yield the amino acid.
Gabriel Synthesis: Utilizes phthalimide as a nitrogen source, which is alkylated and subsequently hydrolyzed to form the free amino acid.

These synthetic methods are often employed to obtain racemic mixtures, which can then be resolved to isolate specific enantiomers if desired.

Peptides and Proteins: Reactions

Sulfur Linkage for Cysteine and Cystine
- Disulfide Bonds: Two cysteine residues can form a covalent disulfide bond ( $- S - S -$ ) when their sulfhydryl ( $- S H$ ) groups come into proximity, creating cystine.
- Importance: These linkages stabilize a protein’s tertiary or quaternary structure by connecting distant parts of a single chain or linking multiple polypeptide chains.
Peptide Linkage: Polypeptides and Proteins
- Formation: Peptide bonds arise from a condensation reaction between the carboxyl group ( $- COO H$ ) of one amino acid and the amino group ( $- N H_{2}$ ) of another, releasing a water molecule.
- Polypeptides and Proteins: Short chains are known as polypeptides, while longer, often more complex chains are referred to as proteins once they fold into functional shapes.
Hydrolysis
- Definition: Hydrolysis breaks peptide bonds, turning polypeptides into smaller fragments or individual amino acids through the addition of water.
- Methods: This can occur via laboratory treatments (acid or base) or through proteolytic enzymes in biological contexts, essential for protein digestion and turnover.

General Principles

Primary Structure of Proteins
- Linear Sequence: The primary structure is the exact order of amino acids in the polypeptide chain, read from the N-terminus to the C-terminus.
- Importance: This sequence dictates how the chain will fold into higher-level structures.
Secondary Structure of Proteins
- Localized Folding: Secondary structure refers to regional conformations stabilized by hydrogen bonds among backbone $N H$ and $C = O$ groups.
- Common Motifs:
  - $α$ helices: Right-handed spirals with side chains oriented outward.
  - $β$ pleated sheets: Extended strands with alternating side chains pointing above or below the sheet plane.
Tertiary Structure of Proteins
- Three-Dimensional Folding: Tertiary structure emerges from side-chain interactions such as hydrophobic clustering, ionic bonds, hydrogen bonds, and disulfide linkages.
- Key Components:
  - Proline: Its cyclic side chain introduces bends, disrupting regular helices or sheets.
  - Cystine: Formed by disulfide bonds between cysteine residues, adding robust stability.
  - Hydrophobic Bonding: Nonpolar residues cluster inward, shielding themselves from water and driving the overall fold.
Isoelectric Point
- Definition: The pH at which a protein or amino acid carries no net charge, often resulting in minimal solubility.
- Behavior: Around this pH, separation techniques like isoelectric focusing exploit small differences in pI values to differentiate proteins.