Transcript Document
MGH-PGA
Genomic Analysis of Stress and Inflammation:
Sequence Analysis of Pseudomonas aeruginosa Strain PA14
Nicole T. Liberati, Dan G. Lee, Jacinto M. Villanueva
and Frederick M. Ausubel
Department of Molecular Biology
Massachusetts General Hospital
Kate Montgomery, Wade Brown, David Sarracino and
Raju Kucherlapati
Harvard-Parners Center for Genetics and Genomics
PA14 Genomic Sequence
PAO1*
vs.
PA14
• (Less pathogenic)
• (More pathogenic)
• 6.26 MB
• 6.54 MB
• 5,570 genes
• ~5,800 genes
*Stover et al, 2000, Nature 406: 959-964.
Comparison of PA14 and PAO1 Genomes
% killing in a murine sepsis model compared to 100% for PA14
50
38 25
100
100 75 63 56
67 0 50 38
38
insertions from non-redundant library in unknown ORFS
PAPI-1
(108 Kb)
PA14
(6.54 Mb)
PAO1
(6.26 Mb)
• 96.3% of the PAO1 genome is present in PA14; 92.4% of PA14 is present in PAO1.
• regions of PA14 inverted relative to those in PAO1 are shown as black vertical lines.
• regions of either genome absent in the other are indicated by violet vertical lines highlighted by arrows.
• 313 PA14-specific ORFS; 145 PA01-Specific ORFS.
• Several large blocks of PA14 genes (pathogenicity islands) not present in PA01.
• magnified view of the largest PA14 pathogenicity island (PAPI-1) shown at top. Arrows indicate positions of hypothetical ORFs for
which a transposon insertion strain in the ORF has been tested in mice. The percent mortality is indicated above each arrow (wildtype PA14 results in 100% mortality).
A Second Example of a PA14-Specific ORF Cluster
PAO1
PA14
No hits
6 contiguous hypothetical proteins
from Yersinia pestis strain KIM
• PA14 has a 7-ORF insertion in between genes corresponding to PA2422 and PA2423 of
PAO1.
• 1 ORF (5998) has no homology to any entry in the NCBI nr database.
• 6 ORFs (5999-6004) are similar to 6 contiguous hypothetical proteins from Yersinia pestis (strain
KIM). The order an orientations of the genes is identical to that of Y. pestis. Gene identities
range from 34% to 66%, and similarities range from 49% to 79%.
PA14 Proteomics for Validation of
PA14 Annotations
1.
Bacteria grown to mid-log phase, washed and frozen.
2.
Sonicated in presence of Urea, SDS, etc.
3.
Sample was reduced and alkylated.
4.
Proteins were separated by SDS-PAGE.
5.
In-gel tryptic digests.
6.
Analyzed only peptides from low-molecular weight proteins by MS in the initial pilot
experiment.
•
A number of PA14-specific ORFs (i.e. - not present in PAO1) have been verified in this initial
pilot proteomic analysis.
7.
A large scale analysis of the entire proteome (all molecular weight fractions) has
been initiated.
PA14 Proteomics:
Example of a Validated PA14-Specific ORF
PAO1: gene PA3362 lies in between lecB
and amiR.
• PA3362 is annotated as a hypothetical protein.
PA14: ORF 6327 lies in between lecB and
amiR.
• 6327 is 53% identical (76% similar) to a
conserved hypothetical protein in Pseudomonas
putita.
• 6327 does NOT have a homolog in PAO1.
• 6327 has been validated by proteomics.
Genomic Typing via Spotted
Oligo Microarrays
1.
Microarray for transcriptional profiling AND genomic typing.
2.
Shear genomic DNA from a panel of P. aeruginosa strains:
•
4-base cutters
•
DNaseI
3.
Label genomic DNA:
•
End labeling
•
Random hexamer labeling with aa-dUTP
4.
Hybridize to microarray
5.
Determine which genomic regions are present/absent in each test
strain. Those “PA14-specific” regions that are conserved in other
clinical isolates are of particular interest.
Hybridizations of PA14 and PAO1 Genomic DNA
to Pilot Microarray
• A pilot array was produced consisting of 95 oligonucleotides
(and one blank) spotted in quadruplicate on each slide:
• 14 oligos were designed for P. aeruginosa sequences present in both
PA14 and PAO1
• 36 oligos were designed for P. aeruginosa sequences present in PA14
(but absent in PAO1).
• 32 oligos were designed for P. aeruginosa sequences present in PAO1
(but absent in PA14).
• 13 oligos were designed for P. aeruginosa sequences absent in both
PA14 and PAO1.
• Genomic PA14 and PAO1 DNA was digested with a 4-base
cutter (HaeIII), purified, labeled, and hybridized to the pilot
array.
Hybridizations of PA14 and PAO1 Genomic DNA to Pilot Microarray
PAO1 Genomic DNA Hybridization
PA14 Genomic DNA Hybridization
PAO1
PA14
18000
40000
16000
35000
14000
30000
12000
25000
10000
20000
8000
15000
6000
10000
4000
5000
2000
0
0
0
10
20
30
40
50
60
70
80
90
100
0
10
20
30
40
50
60
70
80
90
100
• Labeled PAO1 DNA (left) or PA14 DNA (right) was hybridized to the microarray and hybridization
intensities (corrected for background) are shown.
• Turquoise bars below each graph indicate probes corresponding to sequences present in the strain.
• An arbitrary cut-off to assign sequences as present or absent was not based on absolute intensities,
but rather normalized values in the following manner: each intensity was divided by the average
intensity for the entire sample to adjust for overall intensity differences between samples, and any
probe with an adjusted intensity greater than an 0.5 was defined as present in the genome.
• Using this criterion, 94% of the probes were correctly assigned for PAO1 (6 probes in regions absent
in PAO1 were incorrectly assigned as present) and 94% of the probes were correctly assigned for
PA14 (3 probes in regions present in PA14 were incorrectly assigned as absent, and 3 probes in
regions absent in PA14 were incorrectly assigned as present). Incorrect assignments based on this
arbitrary cut-off are indicated by white circles.