Macromolecular Structure Database group

Download Report

Transcript Macromolecular Structure Database group

EMBL-EBI
Adel Golovin
MSDsite
The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT2001-00015 under the RTD programme "Quality of Life and Management of Living Resources"
EMBL-EBI
MSDsite
EMBL-EBI
Introduction
 Many protein (and some nucleic acids) structures have
evolved as a framework for an active site – where
chemistry takes place.
 The active site is critical for the understanding the action of
proteins and their role within the cell mechanism.
 The function, of these macromolecules can best be
studied by detailed 3-dimensional knowledge of these
active sites.
 Because the active site is a local 3D environment.
EMBL-EBI
Ligand environment
 The 3D / 1D position that defines the protein
function
 disparate amino acids on the 1D sequence
 A localized volume in 3D
 Where chemistry/binding happens.
EMBL-EBI
MSD site database
 The MSD site database contains local structural
information around all the bound ligands within the PDB
 All proteins deposited to the PDB are checked for
ligand(s). These are HETATM & non-protein/non-nucleicacid/non-water residues.
 Interactions are stored within the DB
 The bound ligand coordinates & associated information
 The ligand environment (i.e. any residue that interacts)
 The interaction types between the ligand and environment.
 All sequences are checked for the registered ProSite
patterns and matches are stored in the DB
 Fast algorithm for a pattern search is embedded into the
DB
EMBL-EBI
Site definitions
 Sites are defined as a ligand environment when a
ligand is presented.
 Sequence motifs matching is applied for site definition
independently from ligands.
 ProSite sites
http://www.expasy.org/prosite
 Catalytic sites atlas
http://www.ebi.ac.uk/thornton-srv/databases/CSA/
 Merops active sites
http://merops.sanger.ac.uk/
EMBL-EBI
Environment
 Covalent Bonds
 Coordinate bonds
 Hydrogen bonds
Structure
 Planes
DB
 Non-bonding
Site DB
 Electrostatics
 Di-Sulphide bonds
 Geometry (distance,
angles..)
Ligand
environment
analysis
PHE
N
A
S
P
O
S
PHE
V
A
L
EMBL-EBI
Possible site searches
 Define search by ligand
 Define search by sequence motif (pattern)
 Define search by metal site geometry
 Define search by environment
 has same environment
 has similar environment
EMBL-EBI
Search options
Structure
Protein
Ligand
Define
target:
equal/like
Filter by
PDB header:
equal/like
Sequence
motif
Search
criteria
Metal
site
geometry
Environment
exact/sub set
EMBL-EBI
Statistics search
Ligand
binding
Atomic bonds
Interactive
Charts
Environment
binding
Sequence
pattern
binding
EMBL-EBI
Search method
Search front
form
Statistics
search
Result list
Detail views
EMBL-EBI
MSDsite interface
Query
editor
Filters
EMBL-EBI
Query generation
HIS|SER:S/H>C2.0 HIS.NE2:S/S>C2.0 HIS.[N]/T>C2.0
Drawing
Complex queries
secondary structure
Distance cut-offs
Bond type
Side/main chain
EMBL-EBI
Search service aims
 Form a site search criteria
 Provide a hit list
Visualize the found sites
XML technology (xpath,..) to search an XML
corresponding to a PDB entry from the hit
list.
EMBL-EBI
Example
Pattern is defined as : HIS-any-HIS-GLU
EMBL-EBI
Hit List Help
EMBL-EBI
Hit List Visualization tools
AstexViewer@MSD-EBI :
Default PDB viewer: chemical/x-pdb.
RasMol script viewer: application/x-rasmol.
EMBL-EBI
Searched site visualization
AstexViewer[tm]
Groups menu
EMBL-EBI
Hit List Link to Details
Bound molecules form
Goto Site details
Sequence from
Atomic bonds
EMBL-EBI
Detail forms header
Links to the other
views of the same
PDB entry
Ligand binding
statistics
Switch on/off
undefined bonds
EMBL-EBI
Sequence form
Show site
search for PDB entries with the
same hetero ID and similar
environment
EMBL-EBI
Bound molecules form
Ligands that
match a search
creteria
Environment
statistics
EMBL-EBI
Environment binding statistics
Filters
HIS HIS HIS
Trnasformed to
HIS>C HIS>C HIS>I
EMBL-EBI
Bound molecules form
Bonds details
on atomic level
EMBL-EBI
Atomic bonds view
Atomic bonds
statistics
EMBL-EBI
Atomic bonds statistics
EMBL-EBI
Detail forms header
Ligand binding
statistics
EMBL-EBI
Ligand binding statistics
Filters
Distribution
type
EMBL-EBI
Ligand binding statistics –
distribution by whole
environment
EMBL-EBI
Pattern binding statistics
Pattern: H-x-H-E
EMBL-EBI
3D aligned views
 Superimposed by ligand
 Superimposed by environment
 Superimposed by pattern
 Superimposed by active site
EMBL-EBI
3D aligned views
EMBL-EBI
3D aligned views – EBIAstexViewer[tm]
EMBL-EBI
EMBL-EBI
Summary
 DB of active sites has been generated
 Application server has been developed
 The DB can be searched
 Based on properties of the ligand environment
 Based on properties of the ligand
 Based on properties of structure and sequence of the parent
protein
EMBL-EBI
Future sites
Data generated by data mining of common
packing of chemical groups is being added.
(Proteins S.F.G, 49:510-528, T.J.Oldfield). This
information contains about 2000 sites, 1/3 with
ligands.