Achievable logoAchievable logo
USMLE/1
Sign in
Sign up
Purchase
Textbook
Support
How it works
Resources
Exam catalog
Mountain with a flag at the peak
Textbook
Introduction
1. Anatomy
2. Microbiology
3. Physiology
4. Pathology
5. Pharmacology
6. Immunology
7. Biochemistry
8. Cell and molecular biology
9. Biostatistics and epidemiology
10. Genetics
10.1 Fundamentals
10.2 Inheritance patterns
10.3 Trinucleotide repeat disorders
10.4 Chromosomal disorders
10.5 Population genetics
10.6 Additional information
11. Behavioral science
Wrapping up
Achievable logoAchievable logo
10.5 Population genetics
Achievable USMLE/1
10. Genetics

Population genetics

4 min read
Font
Discuss
Share
Feedback

For a large population with random mating, the Hardy-Weinberg law lets you calculate the frequency of each of the three genotypes from the frequency of the individual alleles (and vice versa).

The Hardy-Weinberg equation is:

p^2+2pq+q^2 = 1

p+q = 1, and p^2+2pq+q^2 = 1

Here, p and q are the frequencies of two alleles of a gene:

  • p = frequency of the normal allele
  • q = frequency of the disease-causing allele

The genotype frequencies are:

  • p^2 = frequency of homozygous normal
  • 2pq = frequency of heterozygotes (carriers in AR diseases)
  • q^2 = frequency of homozygous disease state

In autosomal recessive (AR) diseases, q^2 equals the incidence of the disease.

For example, assume the frequency of the recessive allele for cystic fibrosis (an AR disease) is 1/50. That means q = 1/50. Applying the Hardy-Weinberg law:

  • Incidence of cystic fibrosis = q^2 = (1/50)^2 = 1/2500
  • Carrier frequency = 2pq

Since you know q and p+q = 1:

  • p = 1 − 1/50 = 49/50 = 0.98 (approximately 1)

So:

  • 2pq = 2 × 1 × 1/50 = 1/25

Another example: if the frequency of Tay-Sachs disease (an AR disease) in the Ashkenazi Jewish population is 1/400, that is another way of saying q^2 = 1/400. Therefore:

  • q = 1/20

So the carrier frequency is:

  • 2pq = 2 × 19/20 × 1/20 ≈ 1/10

When a couple presents for genetic counselling, the risk of an inherited disease in their offspring depends on:

  • the inheritance pattern (AR, AD, XR, XD, or mitochondrial)
  • whether each parent is a carrier or affected

If one or both parents have no family history of the inherited disorder in question, their risk is similar to that of the general population.

The equation needs to be interpreted differently for X-linked disorders. Because males are hemizygous for X-linked genes, the frequency (incidence) of an X-linked disorder in males is directly equal to the frequency of the disease allele in the population, which is q. In contrast, the frequency of affected females is q^2, because females must inherit two disease alleles (two affected X chromosomes) to be affected.

For example, if the frequency of the allele for color blindness is 0.08 (8/100):

  • 8/100 males will have color blindness (frequency = q)
  • (8/100)^2 = 0.0064 = 64/10,000 females will be color blind (frequency = q^2)

In X-linked recessive disorders, phenotypically normal females may be either homozygous normal or heterozygous (carriers). So, the frequency of phenotypically normal females is:

  • p^2 + 2pq

In the above example:

  • p = 1 − 0.08 = 0.02

In X-linked dominant conditions, q equals the disease frequency in males, because males have only one X chromosome.

LOD or logarithm of the odds score: It is a statistical estimate of whether two genes, or a gene and a disease, are linked to one another. Genes are said to be linked when they are located close together on a chromosome, so they tend to be inherited together because the frequency of recombination between them is quite low. LOD score is used for linkage analysis. Positive LOD scores favour the presence of linkage, whereas negative LOD scores indicate that linkage is less likely.

An LOD score > 3.0 is considered evidence for linkage. A LOD score of 3 means the odds are a thousand to one that the two genes are linked and therefore inherited together. On the other hand, a LOD score < −2.0 is considered evidence to exclude linkage.

Linkage equilibrium and disequilibrium: Two or more alleles are said to be in linkage equilibrium when they occur randomly in a population. Conversely, alleles are in linkage disequilibrium when they do not occur randomly with respect to each other. Positive linkage disequilibrium exists when two alleles occur together on the same haplotype more often than expected, and negative linkage disequilibrium exists when alleles occur together on the same haplotype less often than expected.

In other words, if you multiply the frequencies of each allele occurring independently (for example, “p×q”), then:

  • positive linkage disequilibrium is implied when observed frequency > p×q
  • negative linkage disequilibrium is implied when observed frequency < p×q

Sign up for free to take 3 quiz questions on this topic

All rights reserved ©2016 - 2026 Achievable, Inc.