Molecular Genetics

profileyjysupergirl
SlideSet8-12.pdf

Back to the basics: Pedigrees and inheritance patterns

8

Autosomal dominant • affects either sex • transmitted by either sex • 50% chance of transmission * (*) for affected x unaffected with affected being heterozygous

Autosomal recessive • affects either sex • affected people often have unaffected

parents (asymptomatic carriers) • 25% chance of being affected

Achondroplasia (dwarfism) Type 1 Waardenburg syndrome (including deafness) Huntington’s disease (adult onset neurological deterioration)

only need one chromosome copy to have disease

Equally

very rare to mate with someone with same autosomal dominant disease

Unborn baby

= related by blood

Not sex-linked

I

I

II

X-linked recessive • affects mainly males • mother unaffected carrier • no male to male transmission • 50% transmission in males

X-linked dominant • affects either sex (females>males) • males transmit to daughters only (100%) • females transmit to 50% of offspring • females more mildly or variably affected

Y-linked • affects only males • all affected have affected father (no known human character besides maleness itself)

Red-green color blindness Hemophilia Duchenne muscular dystrophy

since its X linked it cant be transmitted through a Y chromosome

due to X chromosome inactivation

more likely to be lethal in males, so that's why it is seen higher rate in females

no known disease that looks like this!

:-. -

Locus heterogeneity: same clinical phenotype can result from mutations at several independent loci (example: congenital deafness). This can lead to genetic complementation.

Allelic heterogeneity: mutations in one gene can lead to different clinical manifestations (example: Duchenne vs. Becker muscular dystrophy).

Duchenne MD = deletions of multiple exons leading to out-of-frame mutations Little to no functional DMD protein is produced.

Becker MD = point mutations or deletions that restore frame Some level of functional DMD protein is produced. Severity of symptoms varies according to protein levels and functionality

same clinical manifestation produced by different genes

ex) BRCA1/2 phenotype of mutations within these genes have very different phenotypes for severity of breast cancer more disfunction will cause worse diseaes

mutations in gene A or gene B can cause the same disease

Ex aaBB and AAbb can both have the disease if they mate, their children will be hers so none of them will have the disease most human traits are controlled by many genes not only one gene

both deaf affected people

none of the children are deaf

they are all carriers for mutant alleles not do not show phenotype

mutations in same gene can cause different diseases

Dosage effect genes... proteins are usually not a monomer, they are usually multiple subunits. If 3/6 subunits are mutated, it will not function if its a dominant mutation even though 3 of the subunits are functional.

less severe than Duchenne MD

Mutate many differentgenes and result in same disease

Different mutation in SAME gene

-

-

- - - -

r o

- -

C l

-

Further complications to basic Mendelian pedigrees

Common recessive condition leads to appearance of pseudo-dominant inheritance patterns

Cystic Fibrosis (1 / 2,000 individual is a carrier)

High frequency of recessive alleles?

Geographical distribution of endemic malaria and mutant hemoglobin alleles in Africa T.E. Wellems, R.M. Fairhurst Nat Genet, 37 (2005)

heterozygote advantage?

CFTR gene- transport of chloride ions in mucous, respitory problems. mutant alleles may be common since they provide advantage against respitory infections in cold areas like Europe

High transmission rate, affects males and females...O is mutant allele married to asymptomatic carrier. There are many carriers in the populations. frequency of recessive alleles in population leads to large prevalence of recesive disease

can appear to be a dominant condition, but actually just high frequency of recessive alleles in form of carriers in population

Single cell anemia common to Africa. Hb mutation is thought to be an advantage against malaria

all affected are double O (00)

⑥ 0 @ ⑥

• O O

Incomplete penetrance

Polydactily: autosomal dominant, 70% penetrance; BRCA1: autosomal dominant, 75-85% lifetime risk

Age-related penetrance.

Huntington syndrome, a dominant form of dementia due to triplet repeat amplification.

Curve A: probability of showing the disease phenotype as a function of age; curve B probability that an unaffected individual with an affected parent carries the disease allele as a function of age.

carriers are unaffected since there is incomplete penetrance, 70% risk of getting it, 30% chance of not getting. Depends of expression of mutation, by gene expression.

ONLY FOUND IN AUTOSOMAL DOMINANT

Grandma is affected

Mom is not affected

But her child is affected??

curve A: past age 20, there is increased likelihood as age increases B: as you reach 30+, you probably don't have the disease

If you receive the mutant chromosome 70% chance that will show

(Mom was a carrier)

We all have the BRCA 1/2 gene its whether or not mutation is present

positive: 50% chance to transmit to children. can select non-mutated gene so can prevent transmit on onto children negative: Find out that this disease could kill you. Can destroy the family structure

Gene Test?

> 3bases repeated

Variable expressivity: only some of the symptoms are presented by patients

Waardenburg syndrome

New spontaneous (de novo) mutations

color part of shape if they have some of the traits.

family has no history, there may be de novo mutations that cause new disease

(dominant condition)

Inbreeding: an X-linked recessive condition shows apparent male to male transmission and an affected female

Inbreeding effects

Inbreeding: happens! (cultural, geographical, religious) concentrates rare recessive alleles in a family & results in high frequency of recessive diseases

Nonsyndromic Congenital Retinal NonAttachment (NCRNA) Ghiasvand et al. (2011) Nature Neuroscience. Affects 1% of Kurdish people living in a group of neighboring villages in North Khorasan, Iran.

heterozygosity decreases homozygosity increases

inbreeding increases homozygousity inbreeding concentrates rare recessive alleles

females are carriers .

concentration of mutation

Inc frequency of rare mutations

#

#

Inbreeding concentrates rare recessive alleles (this can be useful for mapping disease-associated genes)

Nature Genetics (2015)

• Icelandish population is recent (est. ~900 AD) • The isolation of Iceland means more consanguinity • Good health records • Best characterized population genetically (deCODE

Genetics)

Iceland- recently populated. found 7,000 LOF autosomal SNPS ( knockout) in thousands of genes. what happens when you knock out genes?

consanguinity- blood relation

Inbreeding concentrates rare recessive alleles (this can be useful for mapping disease-associated genes)

Nature (2017)

can provide insight into gene function by gene knockout in humans.

APOC3 prevents against heart disease. its a good mutation. can lower triglycerides in the blood if you have this mutant allele.

Founder effects also lead to reduced genetic diversity Nature Genetics (Sept. 2017)

The caste system in India has led to the existence of thousands of reproductively isolated populations with evidence of profound genetic bottlenecks.

founder effect- some local human poplations can be founded by a few individuals. The founders are the ancestors of a village. the founders result from a bottleneck. There is a high degree of consignuity

This study looked at S Asia, viewed as many small groups of populations. Could identify 81 unique groups within the population that descent from founder events more extreme than those in the past. High rates of recessive disease due to founder effects (not inbreeding). .

we know diseases in jewish pop, can observe for presence of disease, can do genetic testing etc

• The size of human families tend to be small and collecting large pedigrees is often impossible.

• Careful, well-preserved medical records are often not available

But there are exceptions…

Nature Genetics 38, 184 - 190 (2006) Published online: 22 January 2006; | doi:10.1038/ng1728 Spectrin mutations cause spinocerebellar ataxia type 5 Yoshio Ikeda1, 2, 10, Katherine A Dick1, 2, 10, Marcy R Weatherspoon1, 2, Dan Gincel5, Karen R Armbrust1, 2, Joline C Dalton1, 2, Giovanni Stevanin6, Alexandra Dürr6, Christine Zühlke7, Katrin Bürk8, H Brent Clark3, 4, Alexis Brice6, Jeffrey D Rothstein5, Lawrence J Schut9, John W Day2, 4 & Laura P W Ranum

Limitations of human genetics how do we know if a gene is genetic? look at inheritance patterns, look at twins

people move, die etc.

ex. intellectual disability requires very careful diagnosis etc.

ataxia- defects in walking. can be linked to seizures Concordance: 60% likely to share concordance vs 40% ex) congenital deafness: 30% genetic (made up of 40% syndrome, 60% unknown genetic causes). 40% is acquired, due to infections of ears, injury. 30% is unknown causes.

Twins: share 100% of genome Fraternal: Brothers and Sisters who happen to be twins: 50% on average but can range from 0-100% depending on meiosis outcome.

ex. twins are genetically identical. they should have the same disease frequency etc. concordance rate: rate of disease without impact from environment

Gene mapping

The gene hunter minimal tool kit:

• Genetically-determined trait (disease, physical attribute, …) with precise phenotypic description

• Families in whom the trait is segregating (large # of offspring, high # of generations, good medical records, inbreeding)

• Markers (high density, high polymorphism, easy/cost-effective to assay)

• Goal of the game: identify markers that are as closely genetically linked as possible to the unknown disease gene (i.e. F(rec) is as low as possible)

there are a lot of markers along the genome, multiple alleles of a marker

9

Genetic mapping using linkage analysis

An ancestral mutation (A) for a dominant disorder segregates through human pedigrees…

…and the chromosomes get broken down through meiotic recombination

…but sequences in the immediate vicinity of the disease mutation remain relatively constant because two closely linked sequences tend to recombine at very low frequencies

Goal of the game: identify markers that show tight genetic linkage (low or no recombination) with the (unknown) disease-causing mutation

unknown disease gene, marker A is super close to the gene

gives us info on the recombination fraction/ frequency = theta

identify the markers that are in the same haplotype with the mutant allele.

affected

To be able to correctly infer genetic linkage we need to:

- Understand what genetic markers are - Understand what the effect of DNA recombination is on the inheritance of

DNA segments across generations - Understand how patterns of recombination (crossover) operate

Genetic markers used in linkage analysis

Two key attributes of markers are density and polymorphism!

use Restriction enzymes to cut and map SNPS near RE sites

ton of alleles, highly variable can use in forensics

only 2 chromosomes- so only 2 different alleles per locus. but high density because they are everywhere

The maximal recombination fraction between two markers is 50%

Recombination Fraction is the unit for measuring distance between marker

RF is ideally 0 for linked trait and marker

R= recombinant

Total#of Never more than recombinantChr

=-

501. RF Total # of gametes

Counting the number of chiasmatas during meiosis informs about the total number of genetic exchanges, i.e. the size of the genetic map

Each chromosome undergoes 1-2 obligatory crossover at meiosis

Old modern

SNP mapping in families can help identify recombination sites

Also, sharing of chromosome regions between individuals is most often due to inheritance from an ancestor (identity by descent, IBD)

Credit Dr. Graham Coop - UCD

Effect of crossing over

\

Genetic relatedness decreases every generation

1 2

3 4

5 6

7 8

9 10 11 12

13 14

15 16

17 18

19 20

21 22

Your genome in your Mum

1 2

3 4

5 6

7 8

9 10 11 12

13 14

15 16

17 18

19 20

21 22

Your genome in your Maternal

grandma

1 2

3 4

5 6

7 8

9 10 11 12

13 14

15 16

17 18

19 20

21 22

Your genome in your Maternal

granddad

1 2

3 4

5 6

7 8

9 10 11 12

13 14

15 16

17 18

19 20

21 22

Your genome in your Dad

1 2

3 4

5 6

7 8

9 10 11 12

13 14

15 16

17 18

19 20

21 22

Your genome in your Paternal

grandma

1 2

3 4

5 6

7 8

9 10 11 12

13 14

15 16

17 18

19 20

21 22

Your genome in your Paternal

granddad

Credit Dr. Graham Coop - UCD

inherited bits and pieaces of

Patent& grandmother 1grandfather 's

gene

sister brother

SNP-based mapping of shared blocks between sibs

white- no sharing due to random segregation... one copy from mom and other from dad so likely that sibilings got opposite ones dark blue-both chromosomes are shared (2)-Both got the same chromosomes from mom&dad Light blue.-shares 1 chromosome

Average about 50 % sharing

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

Your maternal Grandmother’s Contribution to you

Your maternal Grandmother’s Contribution to your cousin

Overlap of Grandmother’s Contribution to you & cousin

Genetic relatedness decreases non-linearly along family tree

Credit Dr. Graham Coop - UCD

Genetic relatedness decreases non-linearly along family tree (now with 3rd cousins)

● ● ● ● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

5 10 15

0 10

00 20

00 30

00 40

00 50

00

Generations ago

N um

be r o

f a nc

es to

rs

● ● ● ● ●

● ●

● ● ● ● ● ●

● ●

● ● ● ● ●

5 10 15

0. 0

0. 2

0. 4

0. 6

0. 8

1. 0

Generations ago

P ro

b. a

nc es

to r i

s ge

ne tic

a nc

es to

r

You have vast numbers of ancestors… but most of them are not genetic ancestors

Ancestors (2n)

Ancestors who contributed to your genome

Credit Dr. Graham Coop - UCD

prob Not genetically I related

, passed 11 .

I

l l

Genetic ancestors

All ancestors

Credit Dr. Graham Coop - UCD

way more ancestors than genetic ancestors

T j parent grandparent

Many Americans likely will have many identifiable 3rd and 4th cousins (10s -- 100s) in large genomic databases. Likely have vast numbers of >5th cousins but they won’t be easily identifiable.

Credit Dr. Graham Coop - UCD

Implications for genealogy hunters… Establishing genetic ancestry can be tricky...

The vast majority of Warren’s DNA — 95 percent — indicated European ancestors. But five genetic segments were identified, with 99 percent confidence, as being associated with Native American ancestry. The largest segment identified was on Chromosome 10.

“While the vast majority of the individual’s ancestry is European, the results strongly support the existence of an unadmixed Native American ancestor in the individual’s pedigree, likely in the range of 6-10 generations ago,” the report said.

https://www.washingtonpost.com/politics/2018/10/18/ just-about-everything-youve-read-warren-dna-test-is- wrong/

… and useful!

Golden state killer identified through matches with 3rd and 4th cousins in GED Match database.

What your 3rd and 4th cousins do may end up affecting you!!

Meiotic recombination is a complex multi-step process that requires a large number of proteins

DSB

Resection

Strand invasion

CO or NCO intermediate

Resolution

Baudat et al., (2014) Nat Reviews Genetics

displacement loop: blue strand pairing against the homolog trying to find exactly complementary region to the sequence

proteins polymerize on to the single strand & searchs the homolog for the complementary region of DNA

RAD 51= most Important protein,the one actually performing thehomology search

PMDR9 to open DNA

form DSB

cut back ends to make ssDNA

D loop formed by strand invasion

resolution

most are non crossovers produced by gene conversion

crossover, chiasmata

Binding of MostofDNA

PRDM 9Occurs found lat loops .

at the tip of loopsof 100ps DNA

recombination occurs at

broken the axis

exposesthe

singlestrand Rare

- Most common

Meiotic recombination is a highly regulated and orchestrated process

Baudat et al., (2014) Nat Reviews Genetics

• Meiosis initiates via the introduction of programmed DSBs • Many more DSBs are created than will become crossovers • Breaks repaired through NCO pathways might permit proper

alignment through homology sensing

To find the right alignment non-crossover can still be useful . So breaking chromosomein 100's of places, the chromosomes can interact with eachother,sensing eachotherand make sure alignment is correct.

MEI4- works with PMRD9 creates on avg 150 DSB per cell we only have 1-2 crossovers and 43 chromosomes should only have 40 breaks on avg

yH2AX- DNA damage response DMC1/RAD51 repair homologs DSB

SYP3- holds homologs together. chiasmata form

pairingsynapses

↳ gehrxomo

DNA Dama ! some

marker Double strand breaks .

Recombination hotspots are genetically determined by the PRDM9 protein binding to a 13-mer motif

Recombination hotspots vary between individuals, between sexes and between populations

so what this means people from west African ancestry have a different Variant of PRDM9 then people of European ancestry. and as a result the protein binds to slightly different different sites such that some sites are only active in people from west African and industry ancestry because they are bound to the west African version of PRDM9 and some sites are only active in the European ancestry because they are bound by the European ancestry variant of PRDM9. which really tells you that location over combination hotspot is genetically determined and controlled by this one protein called PRDM9.

PRDM9 directs recombination away from genesso genes are not broken

13 bp sequence which is a recombination hotspot

there are 2500 hotspots that are active in Africans but not Europeans, there are genetic variants ( alleles) in different people that determines where recombination will be

no Cpg islands since it is meiosis specific ( expressed only in germ cells undergoing meiosis)

more recombinations in Africans vs Eurasian ancestry on average

I

*

PRDM9 functions by opening up chromatin and allowing Double Strand DNA (DSB) break formation

Baudat et al., (2014) Nat Reviews Genetics Grey et al., (2018) PLoS Genetics

PR/SET: Histone Methyltransferase catalytic domain KRAB: Histone binding module (function unclear) C2H2 zinc finger: DNA binding module

(1) (2) (3) 1. Binding to site and

chromatin modification

2. Recruitment to chromosome axis

3. DSB formation by SPO11

has a ZF protein that binds 13 bp sequence in DNA in the form of chromatin

PR SET domain is a catalytic domain that acts as a methyl transferase to add methyl group to H3K4me3 Histone 3, lysine 4 so the chromatin becomes open. allows it to open dna to break and repair

opensup the ← additional 3 histone tail chromatin . methyl groups chemically modified(Inc accessibility)

Brings the binding

site

to the central

axis

steps for PRDMI towork

synaptonomal complex Doublestrand bleak

PRDM9 and its binding sites evolves quickly!

Detection of PRDM9-dependent sites in inter-specific strains of mice. (A) PRDM9Dom2 and PRDM9Cst differ both in the number of zinc finger domains and sequence-specific identity of individual domains. (B) Identification of PRDM9-specific chromatin binding sites over a representative 100-kb window of chromosome 1. Peaks can be unique (Boxed) or shared between the two strains of mice.

Dom. Cst.

Baker et al., (2014) Genome Research

Consensus binding sites to PRDM9 alleles from Mus domesticus (left) and Mus castaneus (right)

Carries variants of prdm9 So therefore the bin ding sites of protein will be slightly different. So the dam and meitok recombination location will be different between ppl of different ancestry

Each finger corresponds to one base -linear arrangement of fingers are different.

where DNA breaks IS NOT random, dictated by location of PRDM 9 Because PRDM9 = product of genes. The genetic recombination & hotspots ate genetically determined

amino acid codes .

I

Genetic recombination rates vary between individuals

These patterns tend to be inherited as a result of distinct PRDM9 alleles: genetic control of DNA recombination!

Different alleles of PRDM 9 slome are Not good at binding, so low recombination frequency -because of the way oocytes do meiosis (are vulnerable to aneuploidy ) A good way to buffer/balance aneuploidy is to have more crossovers. -people with more crossovers are able to have children later in life because their oocytes have chromosomes that have Not apart fallen

Much more susceptible for aneploidy and therefore much less likely to have children late.

some individuals have different recombiation rates.

high recombination is passed on

more crossovers- less likely to have NDJ in meiosis if there are more of them

what are the genes involved in controlling meiotic recombination?

each dot is a single person

aufoiiakotorrt

Genetic maps are non-random and sex-specific

Males tend to recombine next to telomeres. This may reflect the fact that the first events of chromosome synapsis occur near telomeres from yeast to vertebrates.

Females tend to recombine away from telomeres AND have higher density of crossovers. Could be due to evolutionary pressure to avoid non-disjunction during MI arrest.

We used genotypes from high-density single nucleotide polymorphism (SNP) markers of 2,315 individuals and their children from two Caucasian populations to characterize meiotic recombinations.

Chowdhury et al., 2009 PLoS Genetics

red- females blue- males. higher at telomeres both have no recombination at centromere

High density analysis of recombination rates

Females are more likely to recombine more than males. More within the chromosome (middle) instead of at tips like males.if crossover is at the tip, much more likely to dissolve over time.Who are now two or three crossovers They are less likely to fall apart

Reflects the evolutionary pressure on oocytes

peaks-_ recombination hotspots

Malestend to recombine closer to the telomeres .

<

heend of chromosome

High density analysis of Double Strand Breaks (DSBs)

PAR DSBs in males

Sex-specific pattern of DSB usage

Brick et al., (2018) Nature

female MAPOFDSBON

Both specific germ cells(sperm& oocytes )

oocytes (

sperm

male specific

Fine scale mapping of recombination events reveals “hotspots” and “deserts”

recombination is not constant across the entire genome. there are hotspots which are the spikes. deserts- not no recombination activity- genes in one unit always segregate together. they never get broken from each other. It is flanked by recombination hotspots

Segment of DNA that does not tend to be broken by recombination and is inherited as a block I >

NO Recombination :haplotype block likely inherited asablock

together

Haplotypes and Linkage Disequilibrium

• Haplotype Block: segment of DNA that is flanked by recombination hotspots and therefore tend to be inherited as a block.

SNPs within a haplotype block segregate as one unit. This implies that a particular haplotype block can be “tagged” by one SNP – simplifies genotyping

• Linkage Disequilibrium: a mathematical measure of linkage between markers. Low LD indicates that markers are often broken apart by recombination High LD indicates that markers are highly correlated (tightly linked) = co-exist on one haplotype

you dont need to genotype every SNP. if you know one SNP in the block you can infer the rest without sequencing them all

low LD- markers are often broken by recombination within a haplotype ( there is a recombination hotspot between them. high LD-The markers are highly correlated and inherited together since there is no recombination. ( never broken by recombination)

LD and recombination are inversely correlated

SNPS located on the same haplotype block tend to not recombine & if they dont recombine = linkage disequilibrium

High LD : low recombination

'

he there needs tobe a hotspotbit them.

October 27, 2005

ENr131 2q37.1 ENm014 7q31.33

Comparison of LD and recombination at two ENCODE regions

Over 1 million SNPs typed in 269 individuals

The HapMap project first enabled large-scale mapping of recombination events and haplotype blocks

derived recombination maps and found recombination hotspots and deserts and allowed mapping of haplotype blocks

Haplotype Map project -

Not just diversity

also recombination

density

(Family trios)

Relationship between recombination rates, haplotype lengths and gene locations across 19q13

Fig. 10

• No obvious relationship with gene density • Recomb rate varies with PRDM9 binding site density. • PRDM9 sites appear to have evolved from Alu/THE1 transposable elements

high LD- little recombination

short haplotype length- lots of recombination every horizontal line= different person

genelocation doesntmatter

recombine alot

PRDM bindingsite

T

Genetic recombination is directed away from functional genomic elements in mice Brick et al., (2012) Nature

Interesting given that PRDM9 has been lost from several branches of the Vertebrate tree (Birds, Crocodiles, Amphibians) and specific species (Dogs)

Baker et al., (2017) eLife

PRDM9 can read & modify patchessanity

-

Length of LD spans across the genome (HapMap project)

Fig. 15 Of note: - LD is lower at chromosome ends -> chromosome ends have higher recombination rates - Centromeric regions show high LD -> counter selected for recombination - X chromosome has highest LD; it only undergoes recombination in females - African populations have lower LD than European

telomeres- males like to recombine around telomeres

for every chromosome

lower recombination, 1/2 time there is no homolog since males so less recombination overall

recombination is suppressed

inmales

First task: Identifying recombinants vs. non-recombinants

Practical aspects of genetic linkage mapping Gene mapping = identifying markers that are genetically linked to an unknown gene of interest.

Use recombination to position genes on the genome .

In close proximity i. recombine less frequently If on the same haplotype block they won't recombine at all or rarely .

* recombination -40W

very close proximity & high linkage

> a

pad MOM

10

Identifying informative vs un-informative matings

In this example, A1 is given as linked to a dominant disease gene in first generation.

Disease gene still linked to A1

Uninformative with regards to

recombination

No information Unclear if disease gene is linked with

A1 or A2

Disease gene still linked to A1

Uninformative with regards to

recombination

Disease gene now linked to A1

Informative with regards to

recombination

eithercanbe

from mom or dad

AZ

- lostsome info

At Az D=dominant

NO D-

' -a a.mnfrtamt.at,

d

'

ti: recombination

Distance bit marker & disease gene Al - - Az The likelyhood of crossover is high • ma

chromosome has atleast IN2 crossover. centromere so further distance in chromosome

very likely to crossover. ×

Not - Max recombination ratebit 2 chromosome IS 501. → likely 501. = NObetter than random assortment

informative - The 2 markers at the end of the chromosomeD - - d will show a high frequency of recombination .

informative' ffD µA2 -

Rgemcoarnnebri ,

nation freq will be much

- Much less likely for DSM

908hitmnegsuameeunnahpiox.pe Free hai p f Istianghethifn'8Ye "

- High LD

Establishing “linkage phase”

Is the specific allele of the marker you are following physically linked to disease gene (on same chromosome)?

In this example, A1 is given as linked to dominant disease gene in first generation.

grandparents are dead so we cant genotype them! we cannot esablish this linkage phase, the diseased allele could be on A1 or A2 due since there could have been a recombination

- we can't tell who is recombinant or not

disease gene is on the A1 chromosome

there was no recombination even in III-1 since we do not see A6 marker in III

in most pedigrees it is hard to determine linkage. Makes it hard to determine RF.

Gene mapping = identifying markers that are genetically linked to an unknown gene of interest. 1ststep: establishing linkage phase

recombinant bc it is affected but disease is on A1 normally, now it is on A2 allele

lost

recombination DON't know if )freq-- 1 mutation "

m.

""

BHAmarker linked to Al

Ot Az an & Unknown gene

recombinant Is unknown. (But Notcertain)climes =Lost phase

2nd step: Determine recombination frequencies of flanking markers, obtain evidence for linkage between disease allele and markers

• This needs to be done in the context of pedigrees, following the segregation of the desired trait

• To have sufficient statistical power, collecting data from many individuals is necessary

• The larger the recombination fraction is, the more informative samples are necessary to distinguish between linkage and random assortment. more recombination could lead to being confused with random assortment.

sample sizes .4 is very close to random assortment, would need 343 people to do this signifigantly

change markers , keep measuring recombination freq

(More the better)

- -frequency

(frequency )

>

Information increases .

Logarithm of Odds Ratio (LOD) scores

Provide a mathematical formula: ➢ that helps interpret potential linkage between trait and a marker ➢ that does not depend on knowledge of linkage phase ➢ and does not depend on knowledge of the actual recombination fraction

LOD score = Z = LOG ( )Likelihood that pedigree is explained by linkage to marker Likelihood that pedigree shows

no linkage

EXTRA BONUS: * LOD scores are additive across families * LOD scores can be calculated for many markers at once, even chromosome-wide

odds non recombinant/ linkage- (1-theta)^#non recomb X # recombinant

odds no linkage- (1/2)^#non recombinant

theta is a variable that can vary between 0 and .5

way toevaluate whetherthe marker or trait is linked to the disease gene

(mayor may not be linked)

Above 1 ④ →completely random

Below 1 ① assortment

LOD score analysis from 2 point mapping

Z > 3 or Z< -2 show significant evidence for or against linkage, respectively

At low recombination freq, there is significant evidence against linkage. Around 25% recombination freq, there is significant evidence for linkage. 1. Marker IS NOT closely linked to the disease gene 2. There IS Slg evidence for linkage between the marker & disease gene around 0.25% of recombination freq

Anything above 3= thousand times likely that pedigree is explained by linkage → considered to be statistical significant evidence for linkage So close to the disease marker SO likely on the same haplotype or unlike

-.

LOD score improves as recombinant freq increase. → significant evidence against linkage → random assortment of markers → Marker is no way near the disease

Ideal outcome → Z score = above 3 → As recombination freq increases, goes down to 0 → strong evidence that pedigree is explained for linkage at low recombination freq

Meansmarkerson the same

chromosome

Ihbtt Intermediate

Not informative Not significant

[ Yet.fm#8innaotn- frequency

,

evidence

T '

X 1013=0 NOInfo

¥ § NOW -2,100 times B- rely that pedigree is0.8 Ee explained by random ± = assortment .

→statistical significant evidence against linkage

all curves converge at 0.5 because recombination at 50% is about same as being on 2 diff chromosomes

RED- for small theta, there is evidence of no linkage ( not close). at large theta from .15-.3 there is linkage not random assortment. - prob on the right chromosome but still pretty far from diseased gene

PURPLE- no linkage for small RF, no info after that

GREEN- desired result. tightly linked, small RF are significant. the marker is close.

Multipoint LOD score across a region

Candidate region LOD score nearly constant indicates little if any recombination = haplotype block

can do this genome wide. each dot is a LOD score

abrubt transitions- recombination hotspots

haplotype on avg is 100,000 bp

Region of interest

diseased gene of interest

I

Further narrowing down regions by detecting rare recombination events within haplotypes

Darier-White disease: dominant inherited skin disorder mapped to a haplotype on 12q

Disease gene locates to the interval between D12S84 and D12S129

each number is an allele

red one- it is a rare recombinant

mom is homozygous for 5 5 so we cant determine where the crossover was

marker 2 and above does not cause disease. the disease haplotype is 52622.

622 is unaffected. so 622 does not contribute to disease New diseased haplotype is 52.

autosomal dominant disease

different family has different markers.

We can assume 838 is the diseased haplotype.

2 I

¥25] 24 2×2] l l 6 5 93 25 57

↳Beabletonarrow 25 ss

interval of hotspot

f) ① ←

Dad markers

r Unknown exact crossover location

9 R # Utt From

a.EE#ed Udnaafatected mom d l

.

Please ChalhlsNot located in 622

Mapping of a gene for autosomal-recessive profound congenital deafness in a large inbred family lots of inbredding. autosomal recessive so afected people must

be homozygous

likely that flanking markers above and below (marker not involved) contain the diseased region

62 is shared among affected individuals ONLY

Identifying a candidate gene region is often the beginning of a long road…

human genome browser

1:13:48 of lecture

12

started mouse set #

Genetic mapping of complex traits

11

• Most human traits are not controlled by one gene with a large influence on the phenotype ➔ NOT Mendelian.

• Instead, most traits are controlled by many genes exerting subtle effects on overall phenotype.

• Environment also exerts a significant effect on many traits (e.g. Type II diabetes)

Identifying causal variants is hard!

Adopted Twin Studies separated at birth ↳ environment taken out and evaluate the influence of the genome

Polygenic models of inheritance can account for the distribution of complex traits in populations

Rationalize that there is genetic contribution?

⇐ I

£ I

fitness

Polygenic diseases manifest when the balance between “good” and “bad” alleles tips towards presenting a phenotype – liability threshold

Sibs of affected people tend to carry more liability alleles

relatives BY being related to affected individual we know the # ofbad allele.

good bad

balance .

e

VISK IS much

higher

more likely to

carry liability alleius

An example of a disease with sex-specific thresholds

Pyloric stenosis: - affects about 1 in 2,000 births - narrowing of the opening in the stomach - Genetic and environmental causes - affects boys 4 times more often than girls

Green: general population Blue: sibs of affected males Red: sibs of affected females

How to deal with complex traits? 1- Determine if the trait is genetically determined

Twin studies. Concordance rate is significant between MZ twins Concordance rate is higher in MZ than DZ and unrelated individuals

2- Attempt to identify families showing near-Mendelian segregation See BRCA1 mapping example below

3- Develop adequate genetic mapping methods Association studies

MOMO

MZ > Dz fraternal

Mapping the BRCA1 breast and ovarian susceptibility gene (1994)

Caveat: BRCA1 mutations are responsible for only ~20% of familial risk and even less for spontaneous cases.

GWAS studies have identified at least 100 other genetic loci associated with increased risk

First incarnation: Sib-pair mapping.

Affected sib pairs are likely to share segments of DNA closely linked to the disease allele

Drawback: shared regions are usually huge!

Association studies aim to identify shared haplotypes between affected individuals

On average, full siblings share 50% of their genome

Dominant Recessive

We’re all sibs!

On average, 2 unrelated individuals in the UK share ancestors ~22 generations ago (circa 1500)

This implies that at some low frequencies, we all share alleles that are Identical by Descent (IBD)

This enable Association Studies in all populations!

World population= ancestors

implies that at some low frequency

we might all share alleles that are

inherited inthe big humantree .

Association studies

• aim to identify markers that are shared within populations of unrelated “affected” individuals (independent of inheritance mode, genetic linkage, recombination frequency, etc…)

• rely on the fact that our genome is composed of millions of haplotype segments which are segments of DNA that are flanked by recombination hotspots and therefore tend to be inherited as blocks

• rely on the fact that even unrelated individuals tend to have shared ancestry

➢ Individuals affected by diseases tend to share the mutation-containing haplotype segment(s) compared to the general population

Limitations: • Require many, many patients to uncover common variants • Hard to ensure homogeneity of cohorts (limited reproducibility) • Only capture a limited fraction of genetic susceptibility • Often identify variants “near” genes but provide little clue as to what the variants actually do to contribute to disease

Mapping of shared, ancestral fragments within unrelated patients with Nijmegen breakage syndrome (NBS)

Common region to all patients NBN gene: chr8:90,945,564-90,996,899

Regions broken by rec. events

Nijmegen syndrome: autosomal-recessive Characterized by chromosome breakage, microcephaly, immunodeficiency, susceptibility to cancer

First step: identified a haplotype block with strong linkage with disease.

Second step: look for ancestral segments share across unrelated patients.

Looked for shared region withth haplotypes

association study :

affected individuals are more likely to share disease alleles than

controlled individuals .

Manolio et al., Nature 2009

Various types of genetic “architectures” encountered in human genetics

Strong impact on the phenotype

very rare

Latest thoughts on disease and genetic architecture in human populations

- mutation concentrated in the family - or acquired new mutations

Examples of successful gene mapping of human disease traits

Genetics of obesity as seen through Genome Wide Association Studies (GWAS)

Obesity – 2010 update These variants only explain 6-11% of the genetic susceptibility…

GIPR: Gastric inhibitory peptide receptor – defects lead to impaired insulin signaling and higher blood glucose levels (link to T2D) FTO: m6A RNA demethylase / signals to brain circuitry / circadian rythm MC4R: Melanocortin 4 receptor POMC: Pro-opiomelanocortin BDNF: Neuronal transcription factor involved in brain-specific expression and splicing NRXN3: Neurexin 3 – receptor and cell adhesion in nervous system NEGR1: Neuronal Growth regulator

Manhattan plot

All Neuronal genes

Obesity – Oct 2017 update Nature Genetics

• Confirms major involvement of Central Nervous System in obesity • Discovers new role for immune function in obesity risk

• Double # of loci associated by BMI by analyzing a distinct population

How about Alzheimer’s Disease?

APOE: Apolipoprotein E – role in cholesterol metabolism CLU: Clusterin – prevents stress-induced protein aggregation BIN1: Bridging Integrator 1 ?? PICALM: phosphatidylinositol binding clathrin assembly protein

RISK factors

role In

Chloesterol metabolism

Alzheimer’s Disease cont’d: GWAS studies can have very high resolution if sufficient SNPs are tested

highly shared .

significant

peaks should

not really be read along

shared the right

Dots -_

satistical association values

zoomed in on a small region

How about traits such as intelligence?

Sniekers et al., Nature Genetics May 2017

negatively correlates

- #ignnitticana . >

high intelligence correlates w/

Nature (2016)

Genetic correlations between EduYears and other traits

positive negative

An example of modern GWAS resource

GWAS for height

identified 100 s of genes related to height

y differs BH people

Personal Genomics: hope, hype, risks

• Knowledge of genetics variants and their functional significance is steadily increasing

• Variants have significance for disease risks but also a variety of traits and geographical origins.

• Personal genomics is on the rise and private individuals can now obtain information on their own genomes at low cost.

Where we are today:

12

Personal Genomics: hope, hype, risks

Hope: • If I am at risk for a particular disease, knowledge is power

❑ “Passive” attitude: prepare for what’s to come ❑ Active attitude: correct the mutations (in self or in offspring)

Genetics + DNA editing = End of Genetic Diseases

Risks: • Genetic discrimination (confidentiality of information)

GATTACA http://www.imdb.com/title/tt0119177/ • Human germline editing?

https://en.wikipedia.org/wiki/He_Jiankui_affair

https://www.youtube.com/watch?v=k99bMtg4zRk CRISPR-Cas9 ("Mr. Sandman" Parody) | A Capella Science

Hype: • Do we really know what we are doing?

Aaaa

a Aaaa a Aaaa

1- Avoiding Transmission of Genetic Disease – Peter Braude / King’s College, London

2- Genetic Basis of Human Disease and Implications for Germline Editing – Eric Lander / MIT

Avoiding Transmission of Genetic Disease

Professor Peter Braude Division of Women’s Health

Kings College, London

Centre for Preimplantation Genetic Diagnosis

Avoiding Transmission of Genetic Disease

WHY? • The health legacy of genetic disorders • Reproductive options for couples with history of

recurrent genetic risk HOW? • Prenatal Genetic Diagnostic (PGD) and

Prenatal Genetic Screening (PGS)

Centre for Preimplantation Genetic Diagnosis

Health legacy of genetic disorders

Sporadic early embryonic loss Recurrent pregnancy loss Anatomical abnormality Mental disability Neonatal and childhood death Chronic disease and early demise Late onset disease

Centre for Preimplantation Genetic Diagnosis

Recurrent: As a result of inherited disorders

Sporadic:

Random, often age related

Two kinds of genetic risk

Centre for Preimplantation Genetic Diagnosis

Autosomal Recessive Genetic Disease

1:4 free of disease; 1:2 carriers – but asymptomatic; 1:4 will be affected

Cystic fibrosis Spinal muscular atrophy Sickle cell disease Tay-Sachs disease Fanconi Anaemia

Autosomal Dominant Genetic Disease

Huntington’s Disease Achondroplasia Marfan Syndrome Neurofibromatosis Retinoblastoma

50:50 (1:2) chance of passing on disease

Haemophilia

Genetic disease

Women are carriers but generally don't have symptoms; only males affected

x

from mom

Reproductive options for those with serious recurrent genetic risk

Gamete donation Adoption Remain childless

Reproductive roulette

Prenatal diagnosis and termination of pregnancy

Centre for Preimplantation Genetic Diagnosis

highriskof early termination

pregnancy

needleto collectDNA

placenta

*

Preimplantation Genetic Testing

perform genetictesting 10h the 1 Cell .

Preimplantation Genetic Testing

Detection of genetic information in an embryo made by examining a representative sample taken at a preimplantation stage of development

Centre for Preimplantation Genetic Diagnosis

Early human development in vitro

Late Day 1 2-cellsDay 1 Fertilised egg Day 2 4-cells Day 3 8-cells

Day 4 Morula

Day 5 BlastocystDay 6 Hatching blastocystLate Day 6 Hatched blastocyst

ICM

ICM

not fused yet

differentiation occurred

Tissues for Preimplantation Biopsy

Polar Body Blastomere Trophectoderm

egg cleavage stage blastocyst

collect from polar area that will body become placenta / or

Recurrent: As a result of inherited disorders

Sporadic:

Random, often age related

Two kinds of genetic risk Two kinds of genetic test

PGD

To diagnose known genetic condition

PGS

To screen for random aneuploidy to improve infertility outcome

Centre for Preimplantation Genetic Diagnosis

Diagnostic

screening

The principle of PGD

Select for transfer to the patient

affected affectedaffected unaffectedunaffected

Centre for Preimplantation Genetic Diagnosis

Questions: - selection of embryos - oocyte collection / IVF

- expensive - Best

oocyte to

grow is not

clear

affordability & equity

.

(quality of

being fait )

- not coveredby

insistence .

classic IVF

A broad look at PGD / PGS: ESHRE PGD Consortium members by country

number of centres: 11

number of centres: 86

number of centres: 27

Total number of centres: 124

(June 2015)

ESHRE 2015

Indications PGD & PGS

0

1000

2000

3000

4000

5000

6000

7000

PGS

social s exing

sexing for X-linked disorders

chromosomal abnormal ities

monogenic dis ease

ESHRE 2015

54,589 cycles

ESHRE 2015 Coonen: Unpublished and still to be verified data

clearly Hsing

y down

syndrome +

aneuploidy

Monogenics

Chromosomal

Sexing X linked

Aneuploidy

Social sexing 33 741 (58.0%)

668 (1.1%)

12 885 (22.2%)

9 081 (15.6%)

ESHRE 2015 Coonen: Unpublished and still to be verified data

REASONS FOR EMBRYO BIOPSY ESHRE Consortium data I-XV

Based on 54,589 cycles

Range of PGD Cases for 2014

Centre for Preimplantation Genetic Diagnosis

191 biopsy cases

CISAC

Huntington . Fibrosis

PGS leads to over-estimation of genetic risk due to cleavage stage mosaicism

15, 490 - 491 (2009)

Ext NK e I.ee#iiEiet et et r

µ 4¥

May die

& bereplaced by

normal embryo IS NOT Uniform .

cells .

48% of cells tested normal at blastocyst stage

52% not normal

Ermanno Greco, M.D. Maria Giulia Minasi, M.Sc. Francesco Fiorentino, PhD. European Hospital & Genoma Molecular Genetics Laboratory Rome, Italy

cells w/ chromosomal

abnormalities ended up dying and embryo is populated by normal cells

In the light of recent re-evaluations of the risk benefit ratio, it seems that PGS has been oversold

“Rather than improving outcome for childless couples, PGS encourages the waste of healthy embryos which are excluded from transfer to the uterus.”

“The procedure just appears to increase costs and complexities of IVF. Its utilization, at present, should therefore be acknowledged as highly

experimental and refuted in routine IVF care.”

Number of PGS cycles reaching a plateau?

0

500

1000

1500

2000

2500

3000

3500

4000

4500

PGS cycles over time

ESHRE 2015ESHRE 2015 Coonen: Unpublished and still to be verified data

4

0

5

10

15

20

25

30

35

40

45

Per Egg Collection Per Embryo Transfer

Results of 2091 cycles in one PGD only centre (1998 -2014)

Centre for Preimplantation Genetic Diagnosis

Ongoing Pregnancy Rate

Success rate of IVF after PGD is 30-45% doesn't effect

success rate

similar to

IVF

Human Diseases and Traits

Rare, Mendelian Cystic fibrosis, Huntington Disease, Diastrophic Dysplasia …

Common, polygenic Heart disease, Alzheimer’s Schizophrenia, Height, Obesity Intelligence? . . .

Avoid all cases of severe genetic disease Eliminate disease alleles from population

Eliminate disease risk ’Enhance’ human population

Goals of editing intervention:

Eric Lander - MIT

l

Rare Mendelian Disease: Dominant

D+ ++

D+ D+ ++ ++

Heterozygous parent Half of offspring affected Half of offspring unaffected

Can use pre-implantation diagnostics (PGD) PGD+germline editing adds relatively little

DD ++

D+ D+ D+ D+

Homozygous parent All offspring affected

Germline editing would be useful Homozygotes are extremely rare For Huntington’s disease,

only dozens of cases found worldwide

t.tt/FFHtO*; Aa

aa

in

-

Rare Mendelian Disease: Recessive

m+ m+

mm m+ +m ++

Heterozygous unaffected parents One-quarter of offspring affected

To avoid affected offspring: Can use pre-implantation diagnostics (PGD) PGD+germline editing adds relatively little

To avoid most cases of devastating genetic diseases, the most important intervention would be ensuring access to genetic testing so carrier couples know they are at risk

To eliminate disease alleles from population, we’d all need to use IVF – since we all carry multiple disease genes in heterozygous state

NO need for

editing

- O

=

Rare Mendelian Disease: Recessive

m+ m+

mm m+ +m ++

Heterozygous unaffected parents One-quarter of offspring affected

To avoid affected offspring: Preimplantation diagnostics available (PGD) PGD+germline editing adds relatively little

mm mm

mm mm mm mm

Homozygous parents All offspring affected

Germline editing would be useful Very rare, unless brought together by disease E.g.: Deaf parents with mutations in same gene

Human Diseases and Traits

Rare, Mendelian Cystic fibrosis, Huntington Disease, …

Common, polygenic Heart disease, Alzheimer’s Schizophrenia, Height, Obesity Intelligence? . . .

Avoid all cases of severe genetic disease Eliminate disease alleles from pop’n

Decrease disease risk ‘Enhance’ human population

Common, Polygenic Disease

Genetic variants have modest effects Handful: 3-5-fold 99+%: <1.2-fold

Why? Selection keeps strong alleles at

low frequency Disease processes are buffered

20 '

a effect

on thephenotype

strong alleles don't

have a strongphenotype .

Common, Polygenic Disease

Genetic variants have modest effects Handful: 3-5-fold 99+%: <1.2-fold

Why? Selection keeps strong alleles at

low frequency Disease processes are buffered

Schizophrenia

Population 1% risk C4 gene 1.1% risk

Polygenic risk score • Top 100 loci (statistically significant) Top decile: ~3% risk

• Top 10,000 loci (only a fraction significant) Top decile: ~10% risk

Common, Polygenic Disease

Is there a free lunch? Genetic variants have ‘pleiotropic’ effects

and environmental interactions

Inflammatory bowel disease Lower risk of: Higher risk of:

FUT2 Norovirus Crohn's & Type 1 diabetes IFIH1 Type 1 diabetes Crohn's disease RNF186 Ulcerative colitis Chronic kidney disease

Viral infection Lower risk of: Higher risk of:

CCR5 HIV West Nile (13x higher risk for fatal cases)

NO

⑦ Are there any detrimental sideeffects,yes .

-effect multiple

diffprocesses .

Common, Polygenic Disease: Germline editing

Avoid deleterious variants? Most have very small effects

Those with large effects usually rare: treat like Mendelian

Bestow protective variants with large effects? Very few examples overall

Moreover, want common (to assess impact in homozygotes) ideally, with no downsides (undesired pleiotropic effects)

can we NO

-

mm

canyou edit the genome to allow protective variants w/ strongeffect?

-

nm

↳ downside Unknown

Common, Polygenic Disease: Germline editing

Avoid deleterious variants? Most have very small effects

Those with large effects usually rare: treat like Mendelian

Bestow protective variants with large effects? Very few examples overall

Moreover, want common (to assess impact in homozygotes) ideally, with no downsides (undesired pleiotropic effects)

Best candidates: • ApoE2, 3 vs. ApoE4 (3%) Higher Alzheimer’s risk • PCSK9 null (<2%) Lower LDL levels, heart attack risk

Still, we have incomplete knowledge about pleiotropic effects If alleles are so good, why aren’t they at higher frequency?

Bad

<

-

• remove ApoE4 In favor of APOEL ,3 Few genes to edit )

The world’s first CRISPR-edited babies was a fiasco Target: CCR5 gene – HIV cell surface receptor Rationale: individuals naturally carry truncated copies of CCR5 and are immune to HIV. Father was HIV positive. Goal was to avoid HIV transmission. Ethically: consent was given by parents but using vague language. Study was cloaked in secrecy and not adequately reviewed. Preventing HIV transmission to offspring in IVF is routinely done with sperm washing, rendering editing unnecessary. Medically: edited babies may now face shortened lifespan. In addition, they may have higher risk of infection by influenza and West Nile.

Summary and Conclusion Rare, Mendelian diseases

• Vast majority of cases can be addressed by IVF and PGD • Some cases of compelling need, although rare • If we wish to avoid devastating genetic diseases, most important intervention

is ensuring couples have access to genetic testing to know they are at risk

Common, Polygenic diseases • Thousands of genes being identified

— revealing disease processes, pointing to therapeutic hypotheses • For vast majority of variants, impact on risk is small • Currently, at most a few plausible variants for editing

Conclusion • Genetic basis of human disease is complex • We still have a lot to learn • Before making permanent changes

to the human gene pool, we should use great caution