Transcript PPT

EMBL-EBI
MSD Search and
Visualization tools
Jawahar Swaminathan
http://www.ebi.ac.uk/msd
EMBL-EBI
Issues
 The raw database is large and complex:
27,190+ PDB entries
120+ tables in the warehouse, many very large
Cross-referenced against UniProt, PubMed...
 Need to expose as much of the data as possible, without
making the interface too complex
 We want to cater for three categories of user:
"Novice" user
Experienced user
Expert user
http://www.ebi.ac.uk/msd
EMBL-EBI
biobar
A toolbar search application for
Mozilla/Netscape or firefox browsers
http://www.ebi.ac.uk/msd
EMBL-EBI
Biobar (http://biobar.mozdev.org)
http://www.ebi.ac.uk/msd
EMBL-EBI
biobar
 All major bioinformatics databases covered.
 Search genomic, proteomic, structural, literature
and functional databases.
 Links to deposition and analysis tools for
sequence and structural data.
http://www.ebi.ac.uk/msd
EMBL-EBI
MSDlite
A simple form-based query system to
search the MSD Databases
http://www.ebi.ac.uk/msd
EMBL-EBI
MSDlite
http://www.ebi.ac.uk/msd
EMBL-EBI
MSDlite
http://www.ebi.ac.uk/msd
EMBL-EBI
The Atlas Pages
http://www.ebi.ac.uk/msd
EMBL-EBI
The Atlas: Ligands
http://www.ebi.ac.uk/msd
EMBL-EBI
The Atlas: Sequence
http://www.ebi.ac.uk/msd
EMBL-EBI
AstexViewer™@MSD-EBI
 View structures as
wireframe, backbone or
ribbons
 Built-in sequence
viewer
 Calculate and display
surfaces
 Various display
options:
 Ramachandran plots
 Distance matrix
Based on the AstexViewer™ from Astex Technology Limited
 B-factors
and modified under licence by the MSD group
http://www.ebi.ac.uk/msd
EMBL-EBI
Simple search interface
 Strengths:
 simple, easy to use form
 allows multiple search fields to be combined
 relatively fast, despite performing quite complex SQL queries
 Weaknesses:
 not exposing the power of a relational database
 user can't specify the relationship between search fields:
  "name" AND "title" AND "keyword"
  "name" OR "title" OR "keyword"
  ( "name" OR "title" ) AND NOT "keyword"
 the search form is defined by the authors of the search system,
not the author of a query
http://www.ebi.ac.uk/msd
EMBL-EBI
Describing complex searches
 We want to allow the user to entirely control their query
 Since HTML forms are inherently static, we'll use an
applet to provide a dynamic "form" that will let the user:
 choose the fields to be searched
 specify the relationships between search fields
 choose the result fields and how results are
presented
 perform "complex" sub-queries e.g. SSM, FASTA
http://www.ebi.ac.uk/msd
EMBL-EBI
A graphical database search
system
 MSDpro uses an applet for constructing queries
and a server to execute them
 Avoids the need for the user to understand a
complex database schema or know SQL
 The user describes their query entirely
graphically, including logical operations such as
AND, OR and NOT
 Applet generates an XML description of the
user’s query, which is sent to the MSD query
server and converted to SQL automatically
http://www.ebi.ac.uk/msd
EMBL-EBI
MSDpro
A flexible graphical search interface
for advanced searching
http://www.ebi.ac.uk/msd
EMBL-EBI
http://www.ebi.ac.uk/msd
EMBL-EBI
http://www.ebi.ac.uk/msd
EMBL-EBI
Automatic SQL query generation
 The query server is a Java servlet:
 accepts a query description as XML
 converts the user’s query description into a
true SQL query, which is then submitted to the
search database
 Searches can include components that are
executed outside of the database, e.g.
sequence similarity, determined using
FASTA or structural similarity, determined
using SSM
http://www.ebi.ac.uk/msd
EMBL-EBI
Search system is generic
 The search system is designed to be entirely
database-independent
 All information about the architecture of the
search database is stored in XML dictionaries
 Similarly, the search and result fields which the
applet presents to the user are controlled by a
dictionary
 The entire system could move to a completely
different database simply by modifying the
dictionaries
http://www.ebi.ac.uk/msd
EMBL-EBI
Java server
http://www.ebi.ac.uk/msd
EMBL-EBI
Java server architecture
Methods
DB and
external object
ontology
User
interface
DB
Methods
Interface
Ontology
http://www.ebi.ac.uk/msd
EMBL-EBI
Web-services
Some of the new services from MSD are
designed as web-services:
 web-services are network-based services with
published method signatures
 can be accessed via the SOAP protocol from
any language with a SOAP library, via http
 The same services used within MSDpro will be
accessible to any SOAP client
 The MSD query engine will also be available as
a web-service, allowing users to submit
queries programmatically
http://www.ebi.ac.uk/msd
EMBL-EBI
http://www.ebi.ac.uk/msd/
http://www.ebi.ac.uk/msd