The Hardy-Weinberg principle and equations
and . Frequencies stay constant when no evolution occurs.
The Hardy-Weinberg principle (G.H. Hardy, mathematician, and W. Weinberg, physician, both 1908) states that in an idealised population, the allele and genotype frequencies remain constant from generation to generation. The principle provides a baseline — a null hypothesis — against which to test whether a real population is evolving.
The two equations.
For a gene with two alleles in the population, let:
- = frequency of the dominant allele
- = frequency of the recessive allele
Equation 1 — allele frequencies: (because every copy of the gene is either dominant or recessive, so they must sum to 1).
Equation 2 — genotype frequencies (after one generation of random mating): where:
- = frequency of homozygous dominant (e.g. AA)
- = frequency of heterozygous (e.g. Aa) — the factor of 2 is because Aa and aA are equivalent
- = frequency of homozygous recessive (e.g. aa)
Derivation (intuitive). If alleles are sampled at random from the gene pool for fertilisation, the probability of an AA offspring is ; aa is ; and Aa or aA is . The three probabilities sum to .
Why is the most useful quantity. Many genetic diseases (cystic fibrosis, phenylketonuria, sickle-cell anaemia) are caused by homozygous recessive genotypes (aa). The observed disease frequency therefore directly gives . From there:
- (take the square root).
- .
- Carrier (heterozygous) frequency = .
This calculation is widely used in genetic counselling to estimate the chance that an apparently healthy person carries a disease allele.
Worked example — phenylketonuria (PKU). PKU affects 1 in 10,000 newborns in some populations (autosomal recessive).
- (PKU allele frequency = 1%)
- (carrier frequency = 2%, or about 1 in 50 people)
Note that carriers vastly outnumber affected individuals (2% vs 0.01% = 200× more carriers). This is why recessive disease alleles persist — most copies hide in heterozygotes who are themselves healthy.
- (allele frequencies sum to 1).
- (genotype frequencies sum to 1).
- = homozygous dominant; = heterozygous; = homozygous recessive.
- Disease frequency ; ; carriers .
- Cystic fibrosis: 1/2500 → = 0.02 → carriers ~1/25.
- Carriers always far outnumber affected for rare recessive diseases.