PPT - Bioinformatics Research Group at SRI International

Download Report

Transcript PPT - Bioinformatics Research Group at SRI International

Navigation to Related Objects
Mario Latendresse
October 2010, Pathway Tools Workshop
Bioinformatics Research Group
SRI International
[email protected]
1
SRI International Bioinformatics
Available on the Web




2
Applies for compounds, genes, proteins,
reactions, pathways
Once such an object is found, a new menu
appears on the right on the top menu bar
The menu offers to search the object found:
1. In all databases
or
2. Another database
The “In all databases” is demanding (BioCyc
has over 1000 databases); more so for genes
and proteins
SRI International Bioinformatics
Does this Object Exist in Other PGDBs?
3
SRI International Bioinformatics
How is Similarity Computed?
 For
pathways, reactions, and compounds: based
on frame-ids (unique identifiers)
 For
proteins and genes: based on names,
synonyms, and orthologs
 The result page tells you, for each database, if found based
on name/synonyms or orthologs, or both
4
SRI International Bioinformatics
Demo
on
the
“another database”
1.
2.
3.
4.
5.
5
Web
At BioCyc.org, select E. coli K12 as the
database to search
Search for the glycolysis pathway using the
Search->Pathways command
Once the glycolysis pathway is displayed,
Select the command Pathway->Show this
pathway in another database
Select the database to search (e.g., MetaCyc)
If it exists in the other database, the glycolysis
pathway is displayed for this database
SRI International Bioinformatics
of
Demo on the Web of “in all databases”
1.
2.
3.
4.
6
At BioCyc.org, select E. coli K12 as the
database to search
Search for compound N-methyltryptophan.
Once the compound is displayed, select the
command Compound->Show this compound in
all databases
About 28 databases (the number depends on
the current version of BioCyc.org) are found
containing that compound
SRI International Bioinformatics
Result for “In all databases” for a
Compound
7
SRI International Bioinformatics
For Genes and Proteins
 The
search for similar proteins and genes is
based on names (name and synonyms) and
orthologs
 The result page gives the reasons of the match
8
SRI International Bioinformatics
Result “In all databases” for a Gene
9
SRI International Bioinformatics
Implementation
The two commands are implemented using
BioVelo. A general purpose query language for
Pathway Tools
The query itself can be saved since it appears as a
URL in the browser
10
SRI International Bioinformatics
Implementation (2)
 Searching
all databases for a pathway
[ ( Z1,r ): Z1<-dbs, r := Z1~GLYCOLYSIS ]
 Searching
all databases for a compound
[ (Z1,r,r^Chemical-formula): Z1 <- dbs, r:=Z1~PEPTIDE-TRYPTOPHAN ]
 Searching
all databases for a reaction
[ (Z1,r,r^EC-number): Z1 <- dbs, r := Z1~L-IDITOL-2-DEHYDROGENASE-RXN ]
 Searching
all databases for a gene
[ r : l1 := find-objects(META,ECOLI~EG11024),
l2 := find-orthologs(META,ECOLI~EG11024),
both := l1**l2 , ort := l2 -- both, sim := l1 -- both,
r <- … both … ort … sim …]
11
SRI International Bioinformatics
Navigation to Related Objects
Questions?
12
SRI International Bioinformatics