CS790 – Bioinformatics
Download
Report
Transcript CS790 – Bioinformatics
CS790 – Bioinformatics
Introduction and overview
CS 790 – Bioinformatics
What is Bioinformatics?
DNA (and RNA)
Course Overview
Proteins
CS 790 – Bioinformatics
2
What is Bioinformatics?
Computational
Biology
Bioinformatics
Genomics
Functional
genomics
Proteomics
Structural
bioinformatics
Course Overview
CS 790 – Bioinformatics
3
Why is Bioinformatics Important?
Applications areas include
•
•
•
•
•
•
•
•
Medicine
Pharmaceutical drug design
Toxicology
Molecular evolution
Biosensors
Biomaterials
Biological computing models
DNA computing
Course Overview
CS 790 – Bioinformatics
4
The Role of Computational Biology
GenBank BASEPAIR GROWTH
Source: GenBank
3,841
Millions
4,000
3,500
3,000
2,009
2,500
2,000
1,160
1,500
1,000
652
1
2 3
5
10
16
24
35
49
72
101 157
217
385
500
0
82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99
3D Structures
Growth:
Source:
http://www.rcsb.org/pdb/
holdings.html
Course Overview
CS 790 – Bioinformatics
5
Fighting Human Disease
Genetic / Inherited
• Diabetes
Viral
• Flu, common cold
Bacterial
• Meningitis, Strep throat
Course Overview
CS 790 – Bioinformatics
6
Drug Development Life Cycle
Discovery
(2 to 10 Years)
Preclinical Testing
(Lab and Animal Testing)
Phase I
(20-30 Healthy Volunteers used to
check for safety and dosage)
Phase II
(100-300 Patient Volunteers used to
check for efficacy and side effects)
Phase III
(1000-5000 Patient Volunteers
used to monitor reactions to
long-term drug use)
$600-700 Million!
FDA Review
& Approval
Post-Marketing
Testing
Years
0
2
4
6
8
10
7 – 15 Years!
12
14
16
Drug lead screening
5,000 to 10,000
compounds screened
5 Drug Candidates
enter Clinical Testing;
80% Pass Phase I
250 Lead Candidates in
Preclinical
Testing
30%Pass Phase II
80% Pass Phase III
One drug approved by the FDA
Course Overview
CS 790 – Bioinformatics
8
What are we going to learn?
DNA, Proteins, life, and disease: an overview
Basic chemistry introduction/review
Basic biochemistry: proteins
Basic biochemistry: DNA, genes, and
molecular evolution (Dr. Dan Krane, Biological
Sciences)
Drug docking and screening, dealing with water
molecules: Dr. Raymer
Student presentations: techniques in
bioinformatics
Course Overview
CS 790 – Bioinformatics
9
Student Presentations
Students will each make 1 or 2 one-hour presentations
on topics in bioinformatics
• Tutorial
• Survey
• Research Paper
Each class, we’ll turn in either:
• A one-page summary of the previous presentation, or
• A mini-project assigned as part of the presentation.
We’ll talk more about this next time
Web page
Course Overview
CS 790 – Bioinformatics
10
DNA is the blueprint for life
Every cell in
your body has
23
chromosomes
in the nucleus
The genes in
these
chromosomes
determine all
of your
physical
attributes.
Image source: Crane digital, http://www.cranedigital.com/
Course Overview
CS 790 – Bioinformatics
11
Mapping the Genome
The human genome project has provided us
with a draft of the entire human genome.
• Four bases:
A, T, C, G
• 3.12 billion basepairs
• 99% of these are
the same
• Polymorphisms =
where they differ
Course Overview
CS 790 – Bioinformatics
12
How does the code work?
Template for construction of proteins
Course Overview
CS 790 – Bioinformatics
13
Proteins: Molecular machinery
Proteins in your muscles allows you to move:
myosin
and
actin
Course Overview
CS 790 – Bioinformatics
14
Proteins: Molecular machinery
Enzymes
(digestion, catalysis)
Structure (collagen)
Course Overview
CS 790 – Bioinformatics
15
Proteins: Molecular machinery
Signaling
(hormones,
kinases)
Transport
(energy,
oxygen)
Image source: Crane digital, http://www.cranedigital.com/
Course Overview
CS 790 – Bioinformatics
16
Example Case: HIV Protease
1. Exposure &
infection
2. HIV enters your cell
3. Your own cell reads
the HIV “code” and
creates the HIV
proteins.
4. New viral proteins
prepare HIV for
infection of other
cells.
© George Eade, Eade Creative Services, Inc.
http://whyfiles.org/035aids/index.html
Course Overview
CS 790 – Bioinformatics
17
HIV Protease & Inhibition
Course Overview
CS 790 – Bioinformatics
18
HIV Protease as a drug target
Many drugs bind to
protein active sites.
This HIV protease
can no longer
prepare HIV
proteins for
infection, because
an inhibitor is
already bound in
its active site.
HIV Protease + Peptidyl inhibitor (1A8G.PDB)
Course Overview
CS 790 – Bioinformatics
19
Drug Discovery
Target Identification
• What protein can we attack to stop the disease from
progressing?
Lead discovery & optimization
• What sort of molecule will bind to this protein?
Toxicology
• Does it kill the patient?
• Does it have side effects?
• Does it get to the problem spots?
Course Overview
CS 790 – Bioinformatics
20
Drug discovery: past & present
Put some of the infectious agent into thousands
of tiny wells
Add a known drug lead compound into each
well.
• Try nearly every drug lead known.
See which ones kill the agent…
• To small to see, so we have to use chemical tests
called assays
Course Overview
CS 790 – Bioinformatics
21
Finding drug leads
Once we have a target, how do we find some
compounds that might bind to it?
The old way: exhaustive screening
The new way: computational screening!
Course Overview
CS 790 – Bioinformatics
22
Drug Lead Screening & Docking
?
Complementarity
• Shape
• Chemical
• Electrostatic
Course Overview
CS 790 – Bioinformatics
23
Problems in Bioinformatcs
Genomics
• Gene finding
• Annotation
Sequence alignment and database search
• Functional genomics
Microarray expression, “gene chips”
Proteomics
• Structure prediction
Comparative modeling
• Function prediction
Structural bioinformatics
• Molecular docking, screening, etc.
Course Overview
CS 790 – Bioinformatics
24