Computational Biology - University of Missouri

Download Report

Transcript Computational Biology - University of Missouri

Computational Biology
June 24, 2004
MUPGRET Workshop
Overview




Math
Statistics
Computer Models
Bioinformatics
Math and Science


Mathematics are an integral part of
science.
Used everyday by bench scientists to
perform experiments, interpret data,
and make predictions.
Math Examples



Making solutions
Plotting graphs
Calculating area
Area calculations




NIH Image Software
http://rsb.info.nih.gov/nihimage/Default.html
Allows you to measure length, width,
area, density on objects in a picture.
Free
Statistics and Science


Necessity for analyzing datasets.
Experiment must be well designed to be
meaningful.



Ex. replications and controls
Should know how you’ll analyze data
before you start the experiment.
Means, standard deviations, and linear
regression are often used.
Probability


Tests the likelihood that something will
or will not occur.
Used extensively in everyday life.




Las Vegas type gaming
Lotto
Insurance amortization
Decisions regarding medical treatment
Everyday examples

Rolling the dice



1 in 6 chance that you will roll a one with a
single die.
(1/6)2 = 1/36 chance you will roll snake
eyes.
Playing cards


4 in 52 chance (1/13) of drawing an ace at
random from a deck.
What’s the chance of a full house?
Biology examples





Punnett square
Nucleotide frequencies along a gene are
used to examine evolutionary forces.
Mutation rates
Testing limits and sample sizes for
transgenics.
DNA forensics
Computers





Data
Data
Data
Data
Data
quality
storage
analysis
validation
manipulation
Barcode systems
The “ics”




Genomics
Proteomics
Metabolomics
Bioinformatics
Bioinformatics



Revolutionized our ability to do biology
in much the same way as PCR and
robotics changed the bench science.
“the computational branch of molecular
biology” (Bioinformatics for Dummies).
a merger of computer science and
biology (Introduction to Bioinformatics)
Before bioinformatics

In vivo experiments


In the living organism
In vitro experiments

In a test tube



Manhattan Project
Space Program
Human Genome Project
Progress towards the HGP





1953-DNA structure
1975-Maxim and Gilbert DNA
sequencing
1977- First genome sequenced (x174)
1981-Human mitochondrial genome
sequenced
1984-Epstein Barr virus sequenced
Progress towards the HGP






1990- Human genome project launched
1992-TIGR formed
1996-High resolution map of the human
genome
1998-C. elegans genome sequenced
1999-Drosophila genome sequenced
2000-Draft sequence of human genome
completed.
Bioinformatics





Integration of computer science and
biology
Applied field
Inference
Connection
Prediction
The basics



DNA sequence  protein sequence
protein sequence  protein structure
protein structure  protein function
Bioinformatics




Computer simulation
Data management and retrieval
Pattern recognition
Artificial intelligence
Data management/retrieval




Database design and implementation
Data entry tools
Distributed computing
Querying tools
www.mgdb.org
Pattern Recognition

DNA sequence analysis



www.ncbi.nlm.nih
Geneology
Disease diagnosis
Artificial intelligence

Software learns from the data it is given
and modifies its programs to be more
efficient or to be more accurate.


Proteomics software
Disease diagnostic imaging
Computer Science



Algorithm-program that specifies how to
solve a problem
Data structure and information retrieval
Software engineering
The human side



Curation
Annotation
Quality control design
Examples of utility





Determining phylogenetic relationships
Sequence similarities
Protein structure prediction
Disease diagnosis
Pharmacogenomics
Detailed structure information

Requires crystallization of the protein.




Large amount of protein required.
Often time consuming.
Limiting step to high throughput.
Followed by X-ray crystallography or
NMR.

Determines position of each atom in the
molecule.
A Rational Approach


Christendat et al. 2000. Nat. Struct.
Biol. 7:903-908.
Determine structure of all proteins in
Methanobacterium
thermoautotrophicum.

1871 ORFs
The dilemma


Cell membrane is “semipermeable” and
comprised of phospholipids.
Only hydrophobic molecules can pass
through cell membranes.


Conversely, no charged (polar) molecules.
Water can pass through membranes.

Water is a polar molecule.
Aquaporin-1



First water channel protein cloned.
Water travels through aquaporin rather
than phospholipid bilayer.
Water can pass through but protons
can’t.


Membrane potential
Hydrogen gradients
Aquaporin



But protons can move along a column
of water so how does aquaporin
prevent this?
Monomer has 269 aa with 6 membrane
spanning domains.
Heterotetramer is the functional
molecule.
Aquaporin





Protein has a hourglass shape.
The narrowest place is 3.0 A wide (water is
2.8 A).
Passage is lined with hydrophobic aa that
help exclude other small charged molecules.
Predicts one water molecule passes through
at a time.
Hydrogen bond between molecules is
transferred to two asparagine molecules.
Fig. 6.11
Prions




Proteins that can change shape.
And make other proteins change their
shape!
As number of changed proteins
increases a phenotype is observed.
Causal agent of mad cow disease,
scrapie in sheep and Creutzfeldt-Jakob
disease in humans.
Prions II



Previously thought only nucleic acid
encoded changes caused disease.
Stanley Prusiner discovered prion’s
ability to change other protein’s
structure and won the Nobel Prize.
Sup35 is a prion-like protein in yeast.
Sup35



Translation termination factor
Carboxyl end binds to the ribosomal
complex to terminate translation.
If Sup35 is converted to an alternate
conformation (infectious prion
conformation) the shape change
spreads throughout the cell and is
passed to daughter cells.
Sup35



In prion conformation causes ribosomes
to read through stop codons altering
shape and function of proteins.
Fig. 6.13
Not adaptively advantageous so why is
it maintained?
Why?




True et al. 2000. Nature 407: 477-483.
Reduced translation fidelity, extends
proteins.
Some of these are antibiotic resistant.
Could lead to stabilization of new
phenotype under correct environment.
Introduction to Bioinformatics

www.oup.com/uk/lesk/bioinf