In Part 1, you learned how the Human Genome Project and Next-Generation Sequencing made it possible to read entire genomes—transforming genetics from an observational science into a data science. We can now directly see the DNA sequences that Mendel could only infer from breeding patterns.
But having sequencing data raises new questions: Which variants cause disease? How are they inherited? Why does one mutant copy sometimes cause disease while two copies are needed for others?
This chapter explores how Mendel’s 150-year-old principles apply to modern genomic data. The laws of segregation and independent assortment still govern inheritance, but now we observe them at the molecular level—tracking specific DNA variants through families, understanding why alleles are dominant or recessive based on protein function, and predicting pathogenicity from population data.
The shift is profound: Mendel saw 3:1 ratios in flower colors and inferred invisible factors. We sequence families, see the exact DNA variants, and watch them segregate according to Mendel’s laws—validating his insights while revealing the molecular mechanisms he couldn’t observe.
This chapter has seven interconnected sections that explain how we interpret variants in the NGS era:
What types of DNA changes exist (SNVs, indels, structural variants)? How do we distinguish disease-causing mutations from the millions of benign differences using databases like gnomAD and ClinVar? What tools predict pathogenicity when variants are novel?
How do we track variants from parents to children in sequencing data? What do Mendel’s laws look like when observed directly in sequence reads? How do we identify de novo mutations that weren’t inherited from either parent?
Why does one mutant copy cause disease in some cases? We’ll explore three mechanisms through clinical examples:
Why are some genes sensitive to losing one copy while others tolerate it? How do pLI and LOEUF scores measure constraint using population-scale data? Why does natural selection eliminate loss-of-function variants in critical genes?
What happens when both gene copies are disrupted? How does marriage between relatives increase autozygosity, revealing recessive alleles? How has studying consanguineous populations accelerated disease gene discovery?
What about large DNA rearrangements—deletions of entire genes, duplications causing dosage imbalance, inversions disrupting gene function? How do they form and why are they harder to detect than SNVs?
How do we combine all these concepts—variant type, inheritance pattern, gene constraint, population frequency, functional prediction—to determine whether a variant explains a patient’s disease?
Consider cystic fibrosis, a recessive disorder. The traditional genetic view:
The NGS molecular view:
This is the same inheritance pattern Mendel described, but now we see:
This chapter focuses on interpretation: once you have variant calls from NGS data, how do you determine which variants matter and why they cause disease?
Don’t memorize that “KMT2D causes Kabuki syndrome.” Instead, understand:
Interpreting variants requires integrating:
No single piece of evidence is sufficient. Pathogenicity is a weight-of-evidence determination.
The field is evolving rapidly. What’s uncertain today may be resolved tomorrow as databases grow.
Classical Approach | NGS-Era Approach |
---|---|
Observe phenotype ratios in crosses | Read actual DNA sequences in families |
Infer genotypes from breeding patterns | Directly see genotypes in sequence data |
Map genes to chromosomes via linkage | Identify causal variants in all genes simultaneously |
“This gene is dominant” (observed) | “This variant causes haploinsufficiency” (mechanistic) |
Study common Mendelian diseases | Solve rare diseases, understand complex traits |
Years per gene | Weeks for all genes |
Mendel’s laws remain valid—we’ve just moved from observing their effects to understanding their molecular basis.
By the end of this chapter, you should be able to:
Classify genetic variants by type (SNV, indel, structural variant) and explain detection/annotation approaches for each
Trace variant transmission through families using NGS data, identifying inherited vs. de novo patterns
Explain three dominant mechanisms (haploinsufficiency, gain-of-function, dominant-negative) with disease examples and predict pathogenicity based on mechanism
Interpret gene constraint metrics (pLI, LOEUF) and understand how natural selection shapes allele frequencies
Recognize recessive inheritance patterns in sequencing data, explain how consanguinity reveals recessive alleles, and understand biallelic requirements
Describe structural variant mechanisms (NAHR, mobile elements) and explain their phenotypic impacts
Integrate evidence (frequency, constraint, inheritance, function, phenotype) to assess variant pathogenicity
Apply Mendelian principles to genomic data, connecting abstract genetic rules to concrete molecular observations
Part 1 showed you how to read genomes. Part 2 shows you how to interpret them—how to look at millions of variants and determine which ones explain disease, how they were inherited, and why they have the effects they do.
Mendel discovered the rules by counting pea plants. We validate those same rules by reading DNA sequences. The principles haven’t changed. The resolution has.
Let’s see how Mendel’s laws manifest in modern genomic data.