Various Career Options Available

Download Report

Transcript Various Career Options Available

Opportunities in Bioinformatics
Presented By
Dr G. P. S. Raghava
Co-ordinator, Bioinformatic Centre
IMTECH, Chandigarh
Email: [email protected]
Web: http://imtech.res.in/raghava/
What is Bioinformatics (BI) ?
More About Bioinformatics
Historical Background
Media Hype & Confusion
Important Applications of BI
Bioinformatics in India
Demand of BI Professionals
How to Enter in BI (Course & Degrees)
What is Bioinformatics
– Biocomputing: Application of Computer in
Biosciences
– Biocomputing started in 1960’s
– Explosion of Genomic Data
– Access and Management of Data
– Biocomputing+Information Science
– Role of Internet in BI
Core of Bioinformatics
•Relationships between
TDQAAFDTNIVTLTRFVM
EQGRKARGTGEMTQLLNS
LCTAVKAISTAVRKAGIA
HLYGIAGSTNVTGDQVKK
LDVLSNDLVINVLKSSFA
TCVLVTEEDKNAIIVEPE
KRGKYVVCFDPLDGSSNI
DCLVSIGTIFGIYRKNST
DEPSEKDALQPGRNLVAA
GYALYGSATMLV
sequence
3D structure
protein functions
•Properties and evolution of genes, genomes,
proteins, metabolic pathways in cells
•Use of this knowledge for prediction, modelling, and
design
The challenge
(Boguski, 1999)
In 1995, the number of genes in the database started to exceed
the number of papers on molecular biology and genetics in the
literature!
More About Bioinformatics
Multiple
Sequence
Alignment
Database
Homology
Searching
Sequence
Analysis
Genome
Mapping
Protein
Analysis
Proteomics
Bio
Informatics
Sample
Registration &
Tracking
3D
Modeling
Homology
Modeling
Docking
Intellectual
Property
Auditing
Integrated
Data
Repositories
Common
Visual
Interfaces
Computational Biology
in the High-Throughput Era
The Genome and Beyond
Scientific Challenges
 Algorithmic Challenges
 Computational Challenges

Historical Background
Life Science - young compared to
physics and chemistry
 1953 Structure of DNA
 1960s Understanding of “code of life”
 1970s Genetic manipulation technology
 1980s Widespread innovation biotechnology/genetic revolution
 1990s Human Genome Project
 2000s Structural Genomics ?

Media Hype and Confusion
Anybody can do BI
 BI can do anything
 Colleges/Courses/Training
 No Quality Check
 Limited Knowledge of Subject
 More user than developer

Why Bioinformatcs is Required
Data growth is exponential
 Difficult to understand life without BI
 Detection of new diseases
 BI tools allow to save expr. Expend.
 Rational Drug design
 Computer-aided vaccine design

Application of Bioinformatics
Genome Annotation
 Protein Structure Prediction
 Proteomics
 DNA Chip technology
 Disease Diagnostics
 Fingerprinting Technique
 Drug/Vaccine Design

Genome Annotation
The Process of Adding Biology Information and
Predictions to a Sequenced Genome Framework
Protein Structures
Protein Structure Prediction

Experimental Techniques
– X-ray Crystallography
– NMR

Limitations of Current Experimental
Techniques
– Protein DataBank (PDB) -> 17000 protein structures
– SwissProt -> 90,000 proteins
– Non-Redudant (NR) -> 800,000 proteins

Importance of Structure Prediction
– Fill gap between known sequence and structures
– Protein Engg. To alter function of a protein
– Rational Drug Design
Traditional Proteomics
1D gel electrophoresis (SDS-PAGE)
 2D gel electrophoresis
 Protein Chips

– Chips coated with proteins/Antibodies
– large scale version of ELISA

Mass Spectrometry
– MALDI: Mass fingerprinting
– Electrospray and tandem mass
spectrometry


Sequencing of Peptides (N->C)
Matching in Genome/Proteome Databases
Overview of 2D Gel

SDS-PAGE + Isoelectric focusing (IEF)
– Gene Expression Studies
– Medical Applications
– Sample Experiments

Capturing and Analyzing Data
– Image Acquistion
– Image Sizing & Orientation
– Spot Identification
– Matching and Analysis
Comparision/Matcing of Gel Images

Compare 2 gel images
– Set X and y axis
– Overlap matching spots
– Compare intensity of spots

Scan against database
– Compare query gel with all gels
– Calculate similarity score
– Sort based on score
Mass Fingerprinting

Add protease (e.g. trypsin)
– Get fragment size of peptides

Scan against peptides of a protein
obtained theortically by that protease

Scan against all proteomes
DNA Chip Technology
Differential Proteomics:
Fingerprints of Disease
Normal Cells
Disease Cells
Phenotypic
Changes
•Differential protein expression
• Protein nitration patterns
•Altered phosporylation
•Altered glycosylation profiles
Utility
•Target discovery
•Disease pathways
•Disease biomarkers
Fingerprinting Technique

What is fingerprinting
– It is technique to create specific pattern for a given
organism/person
– To compare pattern of query and target object
– To create Phylogenetic tree/classification based on pattern

Type of Fingerprinting
–
–
–
–

DNA Fingerprinting
Mass/peptide fingerprinting
Properties based (Toxicity, classification)
Domain/conserved pattern fingerprinting
Common Applications
–
–
–
–
–
Paternity and Maternity
Criminal Identification and Forensics
Personal Identification
Classification/Identification of organisms
Classification of cells
Drug Design based on Bioinformatics Tools

Detect the Molecular Bases for Disease
– Detection of drug binding site
– Tailor drug to bind at that site
– Protein modeling techniques
– Traditional Method (brute force testing)

Rational drug design techniques
– Screen likely compounds built
– Modeling large number of compounds (automated)
– Application of Artificial intelligence
– Limitation of known structures


Search of Target protein
Search of Lead compound
History of Bioinformatics in India

Biocomputing started in 1950’s
– IISc Banglore (Prof G Ramachandran)
– Mostly analysis of protein structure

Distributed information center (DIC)
– DBT initiate 9 DICs during 1986-7
– National Facilities (IMT,IISc,IARI,JNU,MKU)
– Sub-DICs started (around 50)
– Mirror sites in 1999 (IMT,Pune,JNU,IISc)
Education in Bioinformatics

Role of BIC’s in education
– Workshops, training, course etc started
– Facilities/Infrastructure in BI
– Advanced diploma in BI (Pune,JNU,MKU)
– M.Sc. In bioinformatics

Private Sector
– Number of courses initiated
– Dedicated training centers
Universities
 R&D Institutes

– Ph.D in Bioinformatics (IMT)
Business Comparisons
Company
Chase-Manhattan
AMR Corporation
Nation’s Bank
Sprint
IBM
MCI
Microsoft
United Parcel
Revenues
IT Budget
Pct
16,431,000,000 1,800,000,000 10.95 %
17,753,000,000 1,368,000,000 7.71 %
17,509,000,000 1,130,000,000 6.45 %
14,235,000,000
873,000,000 6.13 %
75,947,000,000 4,400,000,000 5.79 %
18,500,000,000 1,000,000,000 5.41 %
11,360,000,000
510,000,000 4.49 %
22,400,000,000 1,000,000,000 4.46 %
Bristol-Myers Squibb 15,065,000,000
Pfizer 11,306,000,000
Pacific Gas & Electric 10,000,000,000
Wal-Mart104,859,000,000
K-Mart 31,437,000,000
440,000,000
300,000,000
250,000,000
550,000,000
130,000,000
2.92 %
2.65 %
2.50 %
0.52 %
0.41 %
Typical Bioinformatics
Multi-Disciplinary Training
•Scientists
– Biology, Molecular Genetics, Clinical Biochemistry,
Protein Structure Chemistry
•Mathematicians
– Statistics, Algorithms, Image processing
•Computer Scientists
– Database, User Interface/Visualizations, Networking
(Internets/Intranets), Instrument Control
Typical Bioinformatics
Multi-Disciplinary Functions
•Scientists
– Experimental Design & Interpretation
– Laboratory Protocols & Standards/Controls
•Mathematicians
– Analysis & Correlation of Data
– Validation methodologies
•Computer Scientists
– Information Storage / Control Vocabulary
– Data Mining
Bioinformatics Architecture
Users
Workstation
NT
servers
Unix
servers
&
Specialized
Hardware
Web
Browser
MS
Access
Shared
Access
Databases
Proprietary
Internal
Databases
External
External
Proprietary
Public
Databases Databases
Active Livewire
Server
Java & Desktop
Programs
CGI
Web
Server
Business Opportunities in BI
Software development
 Web servers development
 Train manpower in Field of BI
 Database management
 Rational Drug design
 Develop Diagnostic kits
 Assist user in Vaccine development
 Consultant to Biotech Companies

Bioinformatics at IMT, Chandigarh
http://imtech.res.in/bic/
http://imtech.res.in/

Mirror Sites (http://www.imtech.res.in/mirror_sites/)

Public Domain Resources in Biology (www.imtech.res.in/)






IMTECH Library on Internet (/lib/)
Concept of vaccine design
Protein Structure Prediction (Olympic-2000)
Gene Prediction
Software for general use
– GNU software
– SUN Freeware
– PostgreSQL
Site: http//:imtech.res.in/raghava/www.html
THANK YOU!