Human Origins

Download Report

Transcript Human Origins

Chapter 8
Proteomics
Using high-throughput methods to
identify proteins and to understand their
function
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Contents
 Definition of proteomics
 Proteomics technologies




2-D gel electrophoresis
Mass spectrometry
Protein chips
Yeast two-hybrid method
 Biochemical genomics
 Using proteomics to uncover transcriptional
networks
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
What is proteomics?
 An organism’s proteome
 A catalog of all proteins
 Expressed throughout life
 Expressed under all conditions
 The goals of proteomics
 To catalog all proteins
 To understand their functions
 To understand how they interact with each
other
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
The challenges of proteomics
 Splice variants create an enormous diversity of
proteins
 ~25,000 genes in humans give rise to 200,000
to 2,000,000 different proteins
 Splice variants may have very diverse functions
 Proteins expressed in an organism will vary
according to age, health, tissue, and
environmental stimuli
 Proteomics requires a broader range of
technologies than genomics
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Diversity of function in splice variants
 Example: the calcitonin gene
 Gene variant #1
 Protein: calcitonin
 Function: increases calcium uptake in bones
 Gene variant #2
 Protein: calcitonin gene-related polypeptide
 Function: causes blood vessels to dilate
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Posttranslational modifications
 Posttranslational modifications are defined as
any changes to the covalent bonds of a protein
after it has been fully translated.
 Proteolytic cleavage
 Fragmenting protein
 Addition of chemical groups to one or more
amino acids on the protein
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Chemical modifications













Phosphorylation: activation and inactivation of enzymes
Acetylation: protein stability, used in histones
Methylation: regulation of gene expression
Acylation: membrane tethering, targeting
Glycosylation: cell–cell recognition, signaling
GPI anchor: membrane tethering
Hydroxyproline: protein stability, ligand interactions
Sulfation: protein–protein and ligand interactions
Disulfide-bond formation: protein stability
Deamidation: protein–protein and ligand interactions
Pyroglutamic acid: protein stability
Ubiquitination: destruction signal
Nitration of tyrosine: inflammation
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Practical applications
 Comparison of protein expression in diseased
and normal tissues
 Likely to reveal new drug targets
 Today ~500 drug targets
 Estimates of possible drug targets: 10,000–
20,000
 Protein expression signatures associated with
drug toxicity
 To make clinical trials more efficient
 To make drug treatments more effective
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Technologies for proteomics
 2-D gel electrophoresis (2-dimensional)
 Separates proteins in a mixture on the basis of their molecular
weight and charge
 Mass spectrometry
 Reveals identity of proteins based on computer software that can
uniquely identify individual proteins
 Protein chips
 A wide variety of identification methods
 structure, biochemical activity, and interactions with other
proteins
 Yeast two-hybrid method
 Determines how proteins interact with each other
 Biochemical genomics (Enzymatic)
 Screens gene products for biochemical activity
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
2-D gel electrophoresis
 pH gradient along first
axis neutralizes
charged proteins at
different places
 pH constant on a
second axis where
proteins are separated
by weight
Basic
Low MW
 x–y position of proteins
on stained gel uniquely
identifies the proteins
Acidic
High MW
 Polyacrylamide gel
 Voltage across both axes
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Differential in gel electrophoresis
 Label protein samples
from control and
experimental tissues
 Fluorescent dye #1 for
control
 Fluorescent dye #2 for
experimental sample
 Mix protein samples
together
 Identify identical
proteins from different
samples by dye color
with
benzoic
acid
Cy3
without
benzoic
acid
Cy5
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Caveats associated with 2-D gels
 Poor performance of 2-D gels for the
following:




Very large proteins
Very small proteins
Less abundant proteins
Membrane-bound proteins
 Presumably, the most promising drug targets
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Mass spectrometry
 Measures mass-tocharge ratio
 Components of mass
spectrometer




Ion source
Mass analyzer
Ion detector
Data acquisition unit
A mass spectrometer
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Ion sources used for proteomics
 Proteomics requires
specialized ion sources
 Electrospray Ionization
(ESI)
 With capillary
electrophoresis and
liquid chromatography
 Matrix-assisted laser
desorption/ionization
(MALDI)
 Extracts ions from
sample surface
ESI
MALDI
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Mass analyzers used for proteomics
 Ion trap
Ion Trap
 Captures ions on the basis
of mass-to-charge ratio
 Often used with ESI
 Time of flight (TOF)
 Time for accelerated ion to
reach detector indicates
mass-to-charge ratio
 Frequently used with
MALDI
Time of Flight
Detector
 Also other possibilities
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
A mass spectrum
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Identifying proteins with mass
spectrometry
 Preparation of protein sample
 Extraction from a 2-D gel
 Digestion by proteases — e.g., trypsin
 Mass spectrometer measures mass-charge ratio of
peptide fragments
 Identified peptides are compared with database
 Software used to generate theoretical peptide mass
fingerprint (PMF) for all proteins in database
 Match of experimental readout to database PMF allows
researchers to identify the protein
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Stable-isotope protein labeling
 Stable isotopes used to
label proteins under
different conditions
 Variety of labeling
methods
 Enzymatic
 Metabolic
 Via chemical reaction
 Relative abundance of
labeled and nonlabeled
proteins measured in
mass spectrum
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Data from a MALDI experiment
 Distributions of
individual proteins in a
slice of rat brain
 Tissue is coated with
UV-absorbing matrix
 MALDI ion source
with laser sampling
tissue every 180 mm
 Mass-spectrum peaks
reveal individual
proteins
 Image processing for
false-color images
Sections of rat brain imaged
by mass spectrometry
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Limitations of mass spectrometry
 Not very good at identifying minute quantities
of protein
 Trouble dealing with phosphorylated proteins
 Doesn’t provide concentrations of proteins
 Improved software eliminating human
mediated analysis is necessary for highthroughput projects
 Are only able to identify hundreds of proteins
in a single day
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Protein chips
 Thousands of proteins
analyzed simultaneously
 Wide variety of assays






Antibody–antigen
Enzyme–substrate
Protein–small molecule
Protein–nucleic acid
Protein–protein
Protein–lipid
Yeast proteins detected
using antibodies
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Fabricating protein chips: physical array that can hold proteins, isolate them from
each other, and prevent them from becoming denatured
 Protein substrates:
minipads
 Polyacrylamide Polydimethylsiloxane
or
agarose gels
 Glass
 Nanowells
 Proteins deposited on
chip surface by robots
PDMS
UV
Quartz mask
Glass slide
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Protein attachment strategies
 Diffusion (pads)
 Protein suspended in
random orientation, but
presumably active
 Adsorption/Absorption
Diffusion
Adsorption/
Absorption
 Some proteins inactive
 Covalent attachment
Covalent
 Some proteins inactive
 Affinity
 Orientation of protein
precisely controlled
Affinity
antibodies
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Classes of capture molecules
Agent that interacts with molecules applied to the
chip to carry out some kind of assay
 Different capture molecules
must be used to study
different interactions
 Examples
 Antibodies (or antigens) for
detection
 Proteins for protein-protein
interaction
 Enzyme-substrate for
biochemical function
 analytical microarrays and
functional protein chips
Antigen–
antibody
Protein–
protein
Aptamers:
short peptides
Enzyme–
substrate
Receptor–
ligand
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Reading out results
 Fluorescence
 Most common method
 Fluorescent probe or tag
 Can be read out using standard nucleic acid microarray
technology
 Surface-enhanced laser desorption/ionization (SELDI)
 Laser ionizes proteins captured by chip
 Mass spectrometer analyzes peptide fragments
 Atomic-force microscopy
 Detects changes in chip surface due to captured
proteins
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Difficulties in designing protein chips
 Unique process is necessary for constructing each
probe element
 Challenging to produce and purify each protein on
chip
 Proteins can be hydrophobic or hydrophilic
 Difficult to design a chip that can detect both
 Protein’s function may be dependent on
posttranslational modification or an interaction with
another biological molecule
 Challenging and constantly improving with new
technological advancements
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Regulation of transcription
UE
TATA box
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Yeast two-hybrid method
 Goal: Determine how proteins interact with each other
 Method
 Use yeast transcription factors
 Gene expression requires the following:
 A DNA-binding domain
 An activation domain
 A basic transcription apparatus
 Attach protein1 to DNA-binding domain (bait)
 Attach protein2 to activation domain (prey)
 Reporter gene expressed only if protein1 and protein2
interact with each other
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
A schematic of the yeast two-hybrid method
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Results from a yeast two-hybrid experiment
 Goal: To characterize protein–protein
interactions among 6,144 yeast ORFs
 5,345 were successfully cloned into yeast as
both bait and prey
 Identity of ORFs determined by DNA
sequencing in hybrid yeast
 692 protein–protein interaction pairs
 Interactions involved 817 ORFs
 interactome on a genome-wide scale
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Caveats associated with the yeast
two-hybrid method
 There is evidence that other methods may be
more sensitive (protein chip)
 Some inaccuracy reported when compared
against known protein–protein interactions
 False positives
 False negatives
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Subcellular localization of the yeast
proteome
 Complete genome sequences allow each ORF
to be precisely tagged with a reporter molecule
 Tagged ORF proteins indicate subcellular
localization
 Useful for the following:
 Correlating to regulatory modules
 Verifying data on protein–protein interactions
 Annotating genome sequence
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Attaching a GFP tag to an ORF
Marker gene: HIS free medium
GFP
PCR product
HIS3MX6
Homologous
recombination
Chromosome
Fusion protein
ORF1
NH2
protein
ORF2
GFP COOH
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Location of proteins revealed
 75% of yeast proteome
localized
 > 40% of proteins in
cytoplasm
 67% of proteins were
previously un-localized
 Localizations correlate
with transcriptional
modules
cytoplasm
nucleus
A protein localized
to the nucleus
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Pregenomics biochemical assays
 Methods used to find genes responsible for
specific biochemical activity before the
inception of genomics
 Laboriously purify responsible protein
 Often expensive and time consuming
 Expression cloning
 Introduce cDNA pools into cells
 Look for biochemical activity in those cells
 Caveat: Often difficult to detect biochemical
activity in cell’s biochemical “background”
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Biochemical genomics
 Genome of an organism is already known
 Approach
 Construct plasmids for all ORFs
 Attach ORFs to sequence that will facilitate
purification
 Transform cells
 Isolate ORF products
 Test for biochemical activity
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Biochemical genomics in yeast
 6,144 ORF yeast strains made
 ORFs fused to glutathione S-transferase (GST) for
purification purposes
 Biochemical assay revealed three new biochemical
reactions associated with yeast ORFs
tRNA ligase
pre-tRNA
Ligated tRNA
tRNA halves
Abc1
35
64
Used as screening pools
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Microfluidics
 Proteomics requires
greater automation
 Microfluidics: a “lab on
a chip”
 Microvalves and
pumps allow control of
nanoliter amounts
 Can control
biochemical reactions
A microfluidics chip
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Microfluidics in action
loading
mixing
compartmentalization
purging
500 mm
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Finding transcription-factor targets
 All yeast transcription
factors were used to
make yeast strains
 Use chromatin
immunoprecipitation to
select factors attached to
promoter regions on
DNA (ChIP)
 DNA fragments used on
microarray to identify
transcription-factor
targets (ChIP on chip)
antibodies
Genomic DNA
Over 106 TFs of 141 are
active
© 2005 Prentice
Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Network motifs for transcriptional regulation
Autoregulation
Multicomponent loop
Multi-input motif
TF
Motif
Regulator chain
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Studying proteins with
posttranslational modifications
 Example: tyrosine phosphorylation
 Traditional method
 Radioactive labeling with 32P followed by gel
electrophoresis or chromatography
 Problems using mass spectrometry are being
overcome to allow high-throughput analysis
 Better purification techniques
 Mass spectrometers more capable of detecting
phosphorylated peptides
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
HUPO
 Human Proteome Organization (HUPO) was
established in 2002
 Mission
 Consolidate proteomics organizations in
different countries into single worldwide body
 Scientific and educational programs to help
spread proteomics knowledge and technology
 Coordination of public proteomics initiatives
 Examples of current initiatives
 Human Liver Proteome Project
 Human Plasma Proteome Project
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Future prospects
 The next decade may see the complete
deciphering of the proteome of yeast
 More initiatives, like the Human Liver
Proteome Project, are underway
 Better understanding of disease
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Summary I
 Goals of proteomics
 Identify and ascribe function to proteins under
all biologically plausible conditions
 Proteomics methods
 2-D gel electrophoresis for separating proteins on the
basis of charge and molecular weight
 Mass spectrometry for identifying proteins by
measuring the mass-to-charge ratio of their ionized
peptide fragments
 Protein chips to identify proteins, to detect protein–
protein interactions, to perform biochemical assays, and
to study drug–target interactions
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Summary II
 Proteomics methods (continued)
 Yeast two-hybrid method for studying protein–protein
interactions
 Biochemical genomics for high-throughput assays
 Some accomplishments of proteomics
 Example: yeast
 Yeast two-hybrid method reveals interactome
 Transcriptional regulatory networks deduced
 Biochemical genomics uncovers new ORF
functions
 Subcellular localization of proteins
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458
Summary III
 Future prospects
 Better technology for studying posttranslational
modifications
 ~10 years for completion of yeast proteome?
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey 07458