Transcript Slide 1

Searching BioCyc
Ron Caspi
1
SRI International Bioinformatics
Help is One Click Away!
2
SRI International Bioinformatics
The Web Account System
3
SRI International Bioinformatics
The Web Account System
Creating a web
account enables you
to:
Save
Object Groups
Define
page formatting
preferences
Define
Overview layout
preferences
Save
organism groups for
comparative analysis
4
SRI International Bioinformatics
Save Organism Groups with Web Accounts
5

Note the My Lists tab on the
multi-organism selector for
comparative analyses.

When you perform comparative
analyses, you can easily save
groups of organisms for re-using
at a later time.
SRI International Bioinformatics
Define a Favorite Database with Web Accounts
If you create a web
account, you can define
a favorite database that
will be opened by
default when you login
6
SRI International Bioinformatics
Searching
7
SRI International Bioinformatics
Why the Need for Dedicated Search Tools
Search BioCyc for “L-arginine”
2080 results
Need to have specific
tools for finding exactly
what we search for.
8
SRI International Bioinformatics
BioCyc Searches
9

Multiple searches available for finding information in different
ways

The easiest searches to use are fairly coarse

Start by selecting database to search

Simplest search: Quick Search
 At upper right of most pages
SRI International Bioinformatics
Selecting the Database
You can only search one database at a time*!
* With the exception of Google searches

Click on word “change” under
Search menu or under Quick
Search button

In resulting selector, choose a
PGDB





10
Start typing a word in organism
name
Click on letter to navigate to
organisms starting with that letter
Click a frequently used PGDB
Select by Taxonomy
All subsequent searches will
apply to that database
SRI International Bioinformatics
The Quick Search Box
12

What can you type here:
 Gene names (dnaA )
 Compound name (L-lysine)
 Pathway name (peptidoglycan biosynthesis)
 Reaction name (lysine decarboxylase)
 Protein name (peptidase)
 EC number (1.3.1.26)
 Organism name (Escherichia coli)
 Frame ID (CPLX-8024)
 GO term (0006086)
 Links to other databases (O33998)
 An exact term using the format (Peptidase D search:exact)
 Limited term (hydrogen type:compound)

What doesn’t work:
 Exact text using the Google format (“peptidase D”)
SRI International Bioinformatics
Quick Search Results
 Results
are divided into
multiple categories
13
SRI International Bioinformatics
Examples of searches performed by users of
the BioCyc website:
Successful
Ascorbate
EC 3.4.17.5
Sigma factor
Polysulfide reductase
Entner-Doudoroff pathway
Cyanobacteria
DnaA
Unsuccessful
pheV
Transmembrane helix 6
3.4.24.B11
ABC cobalt transporter
affinity of DnaA
A simple auto-correction mechanism tries to correct typos. For
example, searching for “sacrosine” will find “sarcosine”.
14
SRI International Bioinformatics
Quick Gene Search
 Useful
when only interested in genes.
 For
example, compare the results when searching for
“dnaA” by using the Quick Search and Gene Search
buttons.
15
SRI International Bioinformatics
The Search Menu

16
Search Menu
 Object-specific searches
 Advanced search
 Ontologies search
 Google search
 BLAST search
 Search of full-text articles (EcoCyc only)
SRI International Bioinformatics
Google This Site
The BioCyc site is indexed by Google
You can launch a Google text search from:
1. Search → Google This Site
2. The alternative searches box that appears on Quick
Search results pages
17
SRI International Bioinformatics
Object-Specific Searches


The first four items in the search
menu provide a medium-level search
interface against single types of
objects
Use of filtering


18
Click on triangles at the
left to expand or hide
filters
Note that if a filter is
hidden it will not be used
in a search
SRI International Bioinformatics
Compound Search
All buttons – quick way to get complete lists
 Examples for compound searching:
 List
19
SRI International Bioinformatics
Search Genes/Proteins/RNAs
All buttons – quick way to get complete lists
 Extensive filtering options
 List
20
SRI International Bioinformatics
Search Pathways
21
SRI International Bioinformatics
Advanced Search
 The
BioVelo query language
 SAQP: Structured Advanced Query Page
 Permits the definition of complex searches without mastering
BioVelo.
To learn more about
the advanced query
interface, see online
documentation.
22
SRI International Bioinformatics
Sequence Search by BLAST
unusual here – a regular BLAST interface
that permits BLASTing sequences against BioCyc
PGDBs.
 The results are linked to the PGDB gene/protein
pages
 Nothing
23
SRI International Bioinformatics
Growth Media and Phenotype

24
The desktop version of
Pathway Tools allows
definition of growth
media, gene knockout
growth information,
and growth data for
phenotype microarray
plates.
SRI International Bioinformatics
EcoCyc-Specific Searches: Growth Media
Search for growth
media based on:
name
compounds
present
compounds not
present
observed growth
25
SRI International Bioinformatics
EcoCyc-Specific Searches: Textpresso
26

Mining E. coli literature poses special
challenges – because almost every molecular
biology paper references E. coli

The solution – EcoCyc Textpresso! An E. coli
only collection of literature

30,000 full-text articles and 6,500 abstracts.

Full text literature searches

Results presented at bottom of page
SRI International Bioinformatics