No Slide Title

Download Report

Transcript No Slide Title

DNA Microarrays for Biomedical Research:
Wonders and Cautions
Daniel R. Salomon, M.D.
Department of Molecular and Experimental Medicine
The Scripps Research Institute
If you don’t understand anything
today - it is my fault.
This is just technology.
What is a gene?
30-35,000
Open Reading Frames
(ORFs)
Genome
Transcriptosome
Proteosome
300-500,000
Proteins
Environment: ECM, Ischemia, Stress
Growth Factors
Cytokines
Development/Differentiation Programs
Baseline/Constitutive
Activated
What happens immediately, what happens next?
Tissues are complex mixtures
of different cell types.
Infiltrating
Inflammatory
Cells
Endothelium
Epithelium
Mesenchymal Elements
How are signal pathways
regulated by transcription?
Gene Discovery vs. Custom
Arrays
• “Gene Discovery Arrays”
Affymetrix GeneChips
Incyte
Rosetta
Filter arrays
• “Custom DNA Microarrays”
Low-cost
Replicate data points
Focused sets of genes
Affymetrix Chip Technology
Affymetrix: Photolithography-based Gene Chip Printing
Affymetrix Chip System
Chip
Hybe/Wash
Station
Array
Scanner
Informatics
Work Station
Processing Microarray Image Data - Step 1
Scanning Image
Red - Cy5
Green - Cy3
Yellow - Both
Processing Microarray Image Data - Step 2
Creating an image mask
Processing Microarray Image Data - Step 3
Analyzing probe intensity at a single spot
Processing Microarray Image Data - Step 4
Comparing multiple time points or conditions
Image Analysis
• ImaGene (Biodiscovery)
• QuantArray (GSI Lumonics)
• Affymetrix GeneChip software
Data Analysis
•
•
•
•
•
Excel
GeneSpring (Silicon Genetics)
Affymetrix GeneChip software
Cluster/Treeview (Stanford)
NFUEGO, Promoter Cruncher,
Chip Annotation databases
Image files
(from Affymetrix or
Custom slide arrays)
To
Web
Server
Microarray specs
Patient/Clinical data
Database
Annotation
Database
Promoter
Cruncher
User Workstations
ImaGene, GeneSpring, Cluster/Treeview
GeneChip, NFUEGO
A
B
C
Affymetrix Chip HG U95A
X60957
Human mRNA for putative receptor tyrosine kinase
X65962
H.sapiens mRNA for cytochrome P-450
X68149
Homo sapiens BLR1 gene for Burkitt''s lymphoma receptor 1
X68149
Homo sapiens BLR1 gene for Burkitt''s lymphoma receptor 1
X68277
H.sapiens CL 100 mRNA for protein tyrosine phosphatase
X07820
Human mRNA for metalloproteinase stromelysin-2
U48705
Human receptor tyrosine kinase DDR gene, complete cds
U50648
Human interferon-inducible RNA-dependent protein kinase (Pkr) gene
U51004
Homo sapiens protein kinase C inhibitor (PKCI-1) mRNA, complete cds
Y09305
H.sapiens mRNA for protein kinase, Dyrk4, partial
U53442
Human p38Beta MAP kinase mRNA, complete cds
U70981
Human interleukin-13 receptor mRNA, complete cds
U85611
Human DNA-PK interaction protein (KIP) mRNA, complete cds
J00219
Human immune interferon (IFN-gamma) gene, complete cds
Incyte Genomics: Human UniGene I Array
(8393 unique genes or ESTs)
Clone ID
Accession #
Gene Name
1362728
1363074
1363684
1363832
1364004
1364225
1365434
1365507
1365962
1365975
1366043
1366085
1366602
1366614
1366817
1366978
1367201
1367516
1367527
1367862
1368173
1368319
1368493
1368653
AF068236
BF035921
NM_014823
NM_006405
AA778107
AW136140
AI627624
BE045743
U75898
AA595575
BE043061
NM_001129
AI416967
N68666
R58925
AF007170
AI754198
AA813998
AF072164
H39214
Y00698
AA348317
AK000005
X57548
nitric oxide synthase 2A (inducible, hepatocytes)
lymphocyte cytosolic protein 1 (L-plastin)
KIAA0344 gene product
transmembrane 9 superfamily member 1
Homo sapiens mRNA; cDNA DKFZp586O2124
Homo sapiens cDNA: FLJ23053 fis, clone LNG02858
zinc finger protein 195
LBP protein 32
heat shock 27kD protein 2
Homo sapiens cDNA: FLJ23516 fis, clone LNG04848
Homo sapiens cDNA FLJ12366 fis, clone MAMMA1002411
AE-binding protein 1
MUM2 protein
f-box and leucine-rich repeat protein 7
EST
DEME-6 protein
KIAA0076 gene product
ESTs
HsHomo sapiens HSFE-1 mRNA, partial cds
ESTs
phosphofructokinase, muscle
ESTs
FLJ00005 protein
cadherin 2, type 1, N-cadherin (neuronal)
http://www.gene.ucl.ac.uk/cgi-bin/nomenclature/
Human Gene Nomenclature Database: Genew3 Search
Search of Approved Symbols AND Literature Aliases from this page [help]
This public copy of the database was last updated on Wed Apr 11, 2001
Now containing 12892 active gene symbols
and 8700 literature aliases
and 2634 withdrawn symbols
Quick search by first letter of symbol [help]
ABCDEFGHIJKLMNOPQRSTUVWXYZ
Page 1 of 4: Cadherins
Symbol
CDH1
CDH2
CDH3
CDH4
CDH5
CDH6
CDH7
CDH8
CDH9
CDH10
CDH11
CDH12
CDH12P
CDH13
CDH15
CDH16
CDH17
CDH18
CDH19
CDH20
CDH21
CDH23
CDH24
Full Name
Cytogenetic Location
cadherin 1, type 1, E-cadherin (epithelial) 16q22.1
cadherin 2, type 1, N-cadherin (neuronal) 18q12.1
cadherin 3, type 1, P-cadherin (placental) 16q22.1
cadherin 4, type 1, R-cadherin (retinal)
20q13.3
cadherin 5, type 2, VE-cadherin
reserved
cadherin 6, type 2, K-cadherin (fetal kidney) 5p14-p15.1
cadherin 7, type 2
18q22-q23
cadherin 8, type 2
16q22.1
cadherin 9, type 2 (T1-cadherin)
reserved
cadherin 10, type 2 (T2-cadherin)
5p13-p14
cadherin 11, type 2, (osteoblast)
16q22.1
cadherin 12, type 2 (N-cadherin 2)
5p14-p13
cadherin 12 (N-cadherin 2) pseudogene
5q13
cadherin 13, H-cadherin (heart)
16q24.2
cadherin 15, M-cadherin (myotubule)
16q24.3
cadherin 16, KSP-cadherin
16q21-q22
cadherin 17, LI cadherin (liver-intestine)
8q22.2-q22.3
cadherin 18, type 2
5p15.1-p15.2
cadherin 19, type 2
18q22-q23
cadherin 20, type 2
18q22-q23
cadherin 21
reserved
cadherin related 23
10q21-q22
cadherin-like 24
reserved
PubMed ID
9925936
2384753
1427864
10191097
2059658
7743525
9615235
9615235
2059658
2059658
9615235
7731968
7731968
8673923
1427864
9721215
9615235
9030594
11090341
PubMed Reference for Cadherin 2
N-cadherin gene maps to human chromosome 18 and is
not linked to the E-cadherin gene.
Walsh FS, Barton CH, Putt W, Moore SE, Kelsell D, Spurr N, Goodfellow PN.
Department of Experimental Pathology, UMDS, Guy's Hospital, London, England.
cDNA clones encoding the human N-cadherin cell adhesion molecule have been isolated from an embryonic muscle library by screening
with an oligonucleotide probe complementary to the chick brain sequence and chick brain cDNA probe lambda N2. Comparison of the
predicted protein sequences revealed greater than 91% homology between chick brain, mouse brain, and human muscle N-cadherin cDNAs
over the 748 amino acids of the mature, processed protein. A single polyadenylation site in the chick clone was also present and duplicated
in the human muscle sequence. Immediately 3' of the recognition site in chick a poly(A) tail ensued; however, in human an additional 800 bp
of 3' untranslated sequence followed. Northern analysis identified a number of major N-cadherin mRNAs. These were of 5.2, 4.3, and 4.0 kb
in C6 glioma, 4.3 and 4.0 kb in human foetal muscle cultures, and 4.3 kb in human embryonic brain and mouse brain with minor bands of
5.2 kb in human muscle and embryonic brain. Southern analysis of a panel of somatic cell hybrids allowed the human N-cadherin gene to be
mapped to chromosome 18. This is distinct from the E-cadherin locus on chromosome 16. Therefore, it is likely that the cadherins have
evolved from a common precursor gene that has undergone duplication and migration to other chromosomal locations.
http://www.gdb.org/
Search
Genomic Segments
All Biological Data
People
Citations
by
Name/GDB ID
Keyword
DNA Sequence ID
Submit
Gene List - Accession #, Map Coordinates
Gene Map of Chromosome 8q13.3-8q24.22
Displayed by MapView 2.4
Custom Microarray Printing
• Glass slide substrate
• 500-10,000 spots/slide
• Robotic spotting device
• cDNA clones
• PCR products
• Oligonucleotides
(50-70mers)
The Mguide: A complete guide to building your own microarrayer.
http://cmgm.stanford.edu/pbrown/mguide
Oligonucleotides (50-70mers)
•
•
•
•
are sequence verified
eliminate clone library banking
eliminate PCR and PCR contamination issues
are inexpensive (500ng of purified 75mer will
print 12,000 arrays each with 3 replicate spots)
• can be directed at specific exons to detect
splice variants
• can be designed to distinguish closely related
genes
Array-based genotyping vs.
gene expression analysis
Gene Expression
Genotyping
• 1-40 markers/sample
• multiple samples/chip
• 100-10,000 genes / chip
• sample tracking
• automated sample
and reagent handling
• 2-4 color fluorescence
detection
•
•
•
•
• 1-2 samples/chip
library of array elements
high speed/high density printing
data interpretation
database management
:::::::::::::::
:::::::::::::::::::::::::
:::::::::::::::
:::::::::::::::::::::::::
:::::::::::::::
:::::::::::::::::::::::::
:::::::::::::::
:::::::::::::::::::::::::
:::::::::::::::::::::::::
Solid-phase primer extension assay:
“Mini-sequencing”, “Genetic Bit Analysis (GBA)”
1. Hybridization of template DNA to solid-phase primer
5’
X=C/T polymorphism
3’
Hybridized
template
TACGTGACTGATGCTTAGCTTCAATGAGXTGATGTAG
|||||||||||||||||||||
GACTACGAATCGAAGTTACTC
Solid-phase oligonucleotide
Glass slide
3mm
A Microarray Strategy
Gene Discovery
(large arrays, limited
numbers of samples)
Custom DNA microarrays
(small arrays, large
numbers of samples)
Genotyping (SNPs)
The challenges for chip technology I.
• Incredible amounts of “data”
•
•
•
•
An incomplete database
Technical issues involving multiple technologies
Relatively primitive tools for handling data
Major issues for statistical methodologies
Normalization
Replicates
Significant changes
The challenges for chip technology II.
• Major challenge to scientific thought and method
Big science or “organismal” biology
Hypothesis-driven vs. “Fishing Expedition”
• How is the transcriptosome regulating the proteome?
Clinical Specimens
Laser Capture Microscopy
RNA/Signal Amplification