Some Practical Consideration For Planning a Genetic
Download
Report
Transcript Some Practical Consideration For Planning a Genetic
Selecting TagSNPs in Candidate
Genes for Genetic Association Studies
Shehnaz K. Hussain, PhD, ScM
Assistant Professor
Department of Epidemiology, UCLA
[email protected]
Epidemiology 244: Cancer Epidemiology Methods
Objectives
Molecular genetics primer
Databases and tools to conduct in silico
analyses for tagSNP selection/prioritization
Central dogma
ATCG
DNA
mRNA
Protein
What are SNPs?
More than 99% of all nucleotides are the same
in all humans
1% of nucleotides are polymorphic
SNPs>> insertions-deletions
Bi-nucleotide – T (80%)
Where do SNPs occur?
Exons
Introns
Flanking regions
A (20%)
What are haplotypes?
A haplotype is the pattern of nucleotides on a
single chromosome
Two “copies” of each chromosome
The haplotype inference problem
?
T
T
?
C
G
G
T?
A
A
TA TT CG GG TA AA
?
A
T
?
G
G
?
A
A
What is linkage disequilibrium?
Linkage disequilibrium (LD) describes the nonrandom association of nucleotides on the
same chromosome in a population
One nucleotide at one position (locus) predicts the
occurrence of another nucleotide at another locus
No LD
LD
What are markers?
Disease
Phenotype
Test for association
between phenotype and
marker loci
Test for genetic
association between the
phenotype and the DSL
LD
Candidate gene
Marker loci
(SNPs)
Disease
Susceptibility
Locus
What are tagSNPs?
TagSNPs are a subset of all SNPs in a gene
that mark groups of SNPs in LD
Avoids redundant genotyping
LD
Marker loci
(SNPs)
LD
Disease
Susceptibility
Locus
The joint effect of tagSNPs in
cytokine genes and cigarette
smoking in cervical cancer risk
T-cell proliferation
IL-2
IL-2 gene
IFNγ gene
IL-2
receptor Proliferation
Proliferation
of
ofTH1-cells
TH1-cells
IFNγ
Activated T-cell
Background
Cigarette smoking ↑ 1.5- to 3-fold cancer risk
Cigarette smoking ↓ levels of IL-2 and IFNγ
(cervical and circulating)
↓ levels of IL-2 and IFNγ
HPV persistence in the cervix
Cervical neoplasia
Decreased survival from invasive cervical cancer
Model
Cigarette smoking
SNPs in IL-2,
IL-2R, and IFNG
HPV-associated
squamous cell
cervical cancer
Methods
Study design
Population-based case-only study
Subjects
308 Caucasian squamous cell cervical cancer cases
diagnosed 1986-2004
Residing in 3 western Washington counties
Data collection
Structured in–person interviews
DNA isolated from buffy coats
Multi-stage tagSNP design
Select reference panel
Re-sequence panel, identify SNPs
(many markers, few subjects)
Choose tagSNPs
Genotype tagSNPs in main study
(few markers, many subjects)
1. Select reference panel
A sample of your study population
Most representative
Samples from the Coriell Repository
Ability to integrate your data with other
resources
= Candidate gene SNPs
= HapMap SNPs
2. Re-sequence reference panel
Amplify and Sequence DNA
Gene
Phred
Phrap
(Ewing, 1998)
(Ewing, 1998)
PolyPhred
(Nickerson, 1997)
Alternatives to re-sequencing
Program for Genomic Applications (PGA)
SeattleSNPs – inflammation
NIEHS SNPs – environmental response
Innate Immunity
International HapMap Project
5 million SNPs in four ethnically distinct
populations
3. Choose tagSNPs
Option
LDSelect
Tagger
(Carlson, 2002) (de Bakker, 2005)
r2 threshold
Yes
Yes
SNP exclusions/inclusions
No
Yes
SNP design score
No
Yes
LDSelect output for IL-2
SeattleSNPs, r2≥0.80, MAF ≥0.05, Caucasian
Bin
Total Number
of Sites
1
2
2
2
TagSNPs
rs2069763
rs2069772
rs2069776
rs2069778
3
2
rs2069777
rs2069779
4
1
rs2069762
Genomic context
Exons (cSNPs)
SIFT (Ng, 2002)
PolyPhen (Ramensky, 2002)
Upstream flanking region
Intron-exon junctions
Sequence conservation
UCSC Genome Browser, PhasCons (Siepel,
Score
2005)
Repeat region
Unique region
TagSNP summary
Efficient yet comprehensive coverage of the
genetic variation in our candidate genes
Reduce costs
Preference should be given to putatively
functional variants:
Literature, gene context, sequence conservation
Thanks for your attention!
Questions?