Characterizing Alternative Splicing With Respect To

Download Report

Transcript Characterizing Alternative Splicing With Respect To

Characterizing Alternative
Splicing With Respect To
Protein Domains
BME 220 Project
Charlie Vaske
Overview
Background
 Methods and Data
 Preliminary Results
 Preliminary Conclusions

Big Picture
Genome
DNA
Transcriptome
RNA
Proteome
Protein
Splicing in Higher Eukaryotes
Genome:
5’ Intergenic
Gene
Intergenic
3’
Exon
3’
Transcription of pre-mRNA
5’ Exon
Intron
GT
Donor Site
Exon
AG
Acceptor Site
U1
Intron
GT
AG
U2
Splicing to mRNA
Transcriptome:
5’ Exon
Exon
Exon
3’
Alternative Splicing
5’ Exon
Intron
GT
5’
Exon
GT
GT
Intron
Exon
GT
5’ Exon
5’
pre-mRNA
Exon
AG
AG
Intron
GT
AG
Intron
GT
AG
Intron
GT
AG
Exon
GT
Intron
AG
GT
GT
GT
Exon
3’
Exon
3’
Exon
3’
Exon
3’
AG
AG
Microarrays For Alt. Splicing
Use short oligonucleotides
 Get a guess at the rate of expression of
the oligo

Exon 1
Exon 2
Exon 3
Exon 4
Exon 5
Affymetrix
Microarrays For Alt. Splicing
Exon 1
Exon 2
Exon 4
Exon 5
Exon 3
Isoform 1:
Exon 1
Exon 2
Exon 4
Exon 5
Isoform 2:
Exon 1
Exon 3
Exon 5
Probe types
Constitutive
Junction
Exon
Unique (“Cassette”)
Expression
Ideal Microarray Readings
a
Isoform 1:
b
a
Exon 1
c
Probe
d
c
Exon 2
Exon 4
Exon 5
b
Isoform 2:
a
Exon 1
e
d
Exon 3
Exon 5
e
Probe types
Constitutive
Exon
Junction
Unique (“Cassette”)
Motivation
Why alternatively splice?
 How does it affect the resulting proteins?
 Look at domains:

High level summary of protein
 ~80% of eukaryotic proteins are multi-domain
 Domains are big relative to an exon

Some Previous Work

Signatures of domain shuffling in the
human genome. Kaessmann, 2002.
Intron phase symmetry around domain
boundaries

The Effects of Alternative Splicing On
Transmembrane Proteins in the Mouse
Genome. Cline, 2004.
Half of TM proteins studied affected by altsplicing.
Method
Predict Alternative Splicing
 Predict Protein Domains
 Look for effects of Alt-Splicing on
predicted domains

“Swapping”
 “Knockout”
 “Clipping”

Microarray Design
Genes based on mRNA and EST data in
mouse
 Mapped to Feb. 2002 mouse genome
freeze
 ~500,000 probes (~66,000 sets)
 ~100,000 transcripts
 ~13,000 gene models

Technical work
Genome Space
Provided data
gene models
transcripts
Overlap
Probe to transcript mapping
Generated Data
Overlap
E@NM_021320
cc-chr10-000017.82.0
G6836022@J911445
cc-chr10-000017.91.1
G6807921@J911524_RC cc-chr10-000018.4.0
probes
Predicting Alternative Splicing
Using mouse alt-splicing microarrays
 Data from Manny Ares

8 tissues
 3 replicates of each tissue

Predicting Alternative Splicing

General Approach: Clustering, then AntiClustering
107 Clusters
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
Detail View
Predicting Alternative Splicing

Cluster pairs have both anti-correlation and overlap
First Attempt at Predictions
Concerned with prediction quality
 Only took clusters-pairs with anticorrelation less than -0.9

QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
First Attempt at Predictions
Greater than -0.9 anti-correlation
 121 genes
 ~60 named genes
 Many of these have documented isoforms

Predicting Protein Domains
Used local install of InterPro
 Only used pfam

Based on sequence motifs

Liberal e-value cut-off: ~1e-10
Technical work
Genome
Space
Transcriptome
Space
Proteome
Space
mRNA
Amino Acid
Sequences
transcripts
Predicted
Domains
Predicted
Domains
Predicted
Domains
“Swapping”
I define to be:

Genome base pair annotated with >1 domain
Exon
Exon
Exon

Exon
2 isoforms share a domain, then each has a
domain of different types on the same side
Exon
Exon
Exon
Exon
“Knockout”

Cassette exon indels a predicted domain
Exon
Exon
Exon
Exon
Exon
or
Exon
Exon
Exon
Exon
Exon
“Clipping”

Lengthening/shortening of a domain
Exon
Exon
Exon
Exon
Exon
Exon
Exon
Exon
Exon
Results: Selected Prediction

3’
C
Rabggtb
5’
N
Preliminary Conclusions
Only a few genes examined
 Analysis pipeline in infancy
 Not thoroughly tested

I do have alternative splicing events
 Example and literature suggest that some
effects will be found
