BioWG-update_Habibah&Wilfred

Download Report

Transcript BioWG-update_Habibah&Wilfred

PRAGMA 10
Biosciences Working
Group Update
Habibah Wahab, Ph.D
Wilfred W. Li, Ph.D.
On behalf of
Karpjoo Jeong, Ph.D.
Key Activities

Bioinformatics

mpiBLAST-G2
 iGAP/Gfarm/CSF4




Avian Flu Project
Metagenomics Annotation
Computational Chemistry
Biosciences Portal





M*Grid
NCHC Portal
My WorkSphere
Telescience
AMEXg
 APAC portals

Education and Training

PRIME

CNIC – Kai Nan, Zhong-hua
Lu



University of Zurich
PRIUS

Osaka University



hosting 2 students from
UCSD on Avian flu projects
Kohei Ichikawa
Susumu Date
Summer Internship Program

Jilin University


Zhaohui Ding
Xiaohui Wei
2
Publications




[1] X. Wei, J. Jiang, W. W. Li, O. Tatebe, G. Xu, L. Hu, and J. Ju,
"Implementing Data Aware Scheduling and Data Management in Gfarm
using LSFtm Scheduler Plugin Mechanism," Future Generation of Computer
Systems, Submitted, 2006.
[2] X. Wei, Z. Ding, W. W. Li, O. Tatebe, J. Jiang, L. Hu, and P. W.
Arzberger, "Grid Infrastructure for Bioinformatics Applications Based on
CSF4," Future Generations of Computer Systems, Submitted, 2006.
[3] W. W. Li, S. Krishnan, K. Mueller, K. Ichikawa, S. Date, S. Dallakyan,
M. Sanner, C. Misleh, Z. Ding, X. Wei, O. Tatebe, and P. W. Arzberger,
"Building cyberinfrastructure for bioinformatics using service oriented
architecture," CCGrid 2006, Singapore, 2006.
[4] D. Abramson, A. Lynch, H. Takemaya, Y. Tanimura, S. Date, H.
Nakmura, K. Jeong, S. Hwang, J. Zhu, Z.-h. Lu, C. Amoreira, K. K.
Baldridge, H.-C. Lee, C.-W. Wang, H.-L. Shih, T. Molina, W. W. Li, and P. W.
Arzberger, "Deploying Scientific Applications to the PRAGMA Grid testbed:
Strategies and Lessons," CCGrid, Singapore, 2006.
3
mpiBLAST-G2
4
Protein sequences
structure info
sequence info
SCOP, PDB
NR, PFAM
Building FOLDLIB:
PDB chains
SCOP domains
PDP domains
CE matches PDB vs. SCOP
90% sequence non-identical
minimum size 25 aa
coverage (90%, gaps <30, ends<30)
FOLDLIB
Integrative Genome
Annotation Pipeline
(iGAP)
Prediction of :
signal peptides (SignalP, PSORT)
transmembrane (TMHMM, PSORT)
coiled coils (COILS)
low complexity regions (SEG)
Step 1
Structural assignment of domains by
WU-BLAST
Step 2
Structural assignment of domains by
PSI-BLAST profiles on FOLDLIB
Step 3
Structural assignment of domains by
123D on FOLDLIB
Step 4
Functional assignment by PFAM, NR
assignments
Step 5
Domain location prediction by sequence
Step 6
Data Warehouse
5
Distributed analysis in a virtual
filesystem
Virtual Directory Tree
/gfarm/eol/apps
apps
igap
From Cluster-wide to Grid-wide environment
dbs
psiblast Foldlib NR
Gfarm File System
Transparent distributed data access and file affinity-based application scheduling
• Gfarm virtual filesystem
allows existing
application to utilize
distributed compute and
data resources
transparently and
efficiently.
• Applications such as
iGAP and their required
input data may be
automatically replicated
to each node on demand.
6
PRAGMA Gfarm Testbed

Taiwan

 NCHC
 Academia

USA
 AIST
 Titech
sinica

 NCSA
 SDSC
 NBCR
Japan
Korea
 KISTI

China
 CNIC
 JLU
7
CSF4 integrate with Gfarm
 Gfarm


Security
Share Secure Key
GSI Authentication
 User
certificate
 Delegate
 Proxy certificate
User credentials
CSF4
Frontend
Scheduler
A
Frontend
Scheduler
B
GFS
Mutual
Authentication
8
Opal: Web Service Wrapper
9
Opal WSRF Operation Provider
10
M*Grid and e-Glyconjugates portal


Reusable
components to
support a large
community
Comprehensive
environment for
molecular
simulation
studies
11
Computational Chemistry




Use of
Nimrod/G
Workflow
built with
web
services
Gemstone
Led by
Baldridge
12
13
14
ASCC
IOIT-HCM
Upload
files/submit
jobs
Download
& view
User interface results
Hawk
Rocks-52
Aurora
15
Real Science Applications










Rational Drug Disovery of Novel Dengue Therapeutics.
Characterisation of drug binding site(s) on the DNA
Elucidating isoniazid resistance using Molecular Modelling Techniques.
Structure and function of PHA synthase Drug receptor database
Binding mode of andrographolide to Renin, HIV-1 Protease and Tyrosine
kinase enzymes.
Binding of erythromycin and its relatives to ribosome. Molecular Docking
and Molecular Dynamics Simulation Study.
Investigation of the Binding Properties of Some Flavonoids to Calcium
using Molecular Modelling Techniques.
Molecular Modelling of Cytochrome P450 2D6. Effects of Allelic variation
on the enzyme activity.
Structure based drug design of compounds derived from marine natural
products.
Chemical Reactivity as a Tool to Study Carcinogenicity: Reaction between
Estradiol and Estrone 3,4-Quinones Ultimate Carcinogens and Guanine.
16
New Collaboration in the Fight
Against Avian Flu

AIST (Japan), CNIC (China),
Konkook/KISTI (Korea),
UCSD/SDSC (USA), JLU
(China), CGPBRI (Univ.
Hawaii), USM (Malaysia)


Solving real problems using
bioinformatics, molecular
simulation and grid tools


IBM World Community Grid

Avian Flu Proteome Annotation
and Analysis


iGAP
 Rosetta
 MEME




AutoDock
Amber
Gromacs
GAMESS
CHARMM
NAMD
Involve students and postdocs
17
Pictures
18
Participating Institutions

SDSC/UCSD





Jilin University



Osamu Tatebe
Hiroshi Takemiya
Yusuke Tanimura
Satoshi Seikiguchi
Zhaohui Ding
Xiaohui Wei
Osaka University




AIST





Wilfred Li
Tomas Molina
Cindy Zheng
Peter Arzberger

Konkuk



Karpjoo Jeong
Taehoon Kim
Kookmin



Susumu Date
Kohei Ichikawa
Shinji Shimojo
Suntae Hwang
Daeyong Heo
KISTI


Jae-Hyuck Kwak
Young-Chul Hwang
APAC

Rajesh Chhabra
19
Participating Institutions

USM







Hurng-Chun Lee,
Chi-Wei Wang
Horng-Liang Shih
University of Zurich/SDSC

Kim Baldridge

Zhong-Hua Lu
Kai Nan
Bao Ping Yan
University of Wisconsin


Fang-Pang Lin
Whey-Fone Tsai
Weicheng Huang
CNIC



Santosh Mishra
Arun Krishnan
Academia sinica



NCHC



Habibah Wahab
Amin Malik Sah
Chan Huah Yong
BII



Katherine (Trina) McMahon
Other Working Groups



Mason Katz
Yoshio Tanaka
Shinji Shimojo
20
Breakout Session Participants

USM







Drug and DNA interactions
Drug design
Wilfred Li



Gfarm
Additional applications





GAMESS
Gemstone
GAMESS/APBS hybrid pipeline
CNIC



Xiaoming Zhang
Mimos
Mashkuri Yaacob
Irdawah Ab. Rahman
Kohei Ichikawa


Web services
Susumu Date


Bioportal, MPICH-G2, LCG,
Docking
EGEE
Osaka University
Kim Baldridge

Hsin-Yen Chen

UCSD/SDSC



Habibah Wahab
Ahmad Yussof Hassan
Amin Malik Shah Abdul Majid

ASCC
TDW
APAC

Rajesh Chhabra

Grid portals
21
Portals

Biosciences portal

Wiki already set up
 PRAGMA wiki – http://auriga.qut.edu.au/pragma
 set up a PRAGMA portal and wiki


AMEXg



Link to all sites with available applications
Much details in VMD (KB)


One way to install
Tiled Display


For users to try
Could not see before
Gfarm testbed
Other technologies
22
Communications

Biosciences mailing list
 [email protected]



msn, skype
Contact info listed.
Application stack




APBS
Autodock
Amber
GAMESS


Pipelines
Applications compiled for different architectures




With examples
Central site
Complaints about heterogeneity of resources
Shared installations
23
Avian flu Analysis

Two projects planned for PRIME students at
CNIC
 Epitope identification
 Host selectivity
 Need synopsis to refine
collaborations and
subprojects

Scientific discussion during breakout session –
PRAGMA 11
 Discuss
results
 Project coordination
24
Metagenomics Annotation

Sequencing of genomes from native
environmental samples
 Shared
software stack
 Routine analysis
 Use Gfarm/CSF4 for scheduling and data
replication
 Data services
 Portal (shared infrastructure)
25
Supercomputing Demonstrations

Potential Topics
display using VMD – Kim Baldrige
 BioPortal – Grid application portal
 CNIC demonstration – CNGrid
 GridSphere portal to Gfarm/CSF4
 Biosciences Portal.
 Tiled

Booth Location
 SC04
-- KISTI
26
Other Activities

Summer interns



Grant applications


Applications from own funding agencies
Intellectual properties



Australia: visa required
US: J-1 visa
International collaborations
Standard nondisclosure agreements
World community grid

Philanthropic activities
27
ISGC 2006 1~4 May Taipei

EGEE Workshop


Symposium


Its purpose is to introduce the EGEE project, including its goals,
infrastructure, middleware and operations
It focuses on Grid core technology, Grid architecture, applications on
various domains such as High Energy Physics, Bio/Medical, Digital
Archive, and Atmospherics. World-Wide Grid application development,
infrastructure interoperation, and collaboration would also be discussed.
http://www.twgrid.org
28