slides - GEONGRID.org

Download Report

Transcript slides - GEONGRID.org

Scientific Collaboration Cyberspace
TieJian Luo Ph.D.
[email protected]
Agenda

Motivation

Modeling Collaboration

The Challenges for e-scientists

Five Pilot Projects

Lessons learned
IWCG2006
New Science Paradigms
 Thousand years ago:
Experimental Science
- description of natural phenomena
 Last few hundred years:
Theoretical Science
- Newton’s Laws, Maxwell’s Equations …
 Last few decades:
Computational Science
- simulation of complex phenomena
 Today:
e-Science or Data-centric Science
- unify theory, experiment, and simulation
- using data exploration and data mining
 Data captured by instruments
 Data generated by simulations
 Processed by software
 Scientist analyzes databases/files
(With thanks to Jim Gray)
2
 . 
4G
c2
a
 a   3   a 2
 
Scientific Computing Research Paradigm
data
Verify model
(1)problem model
Deduce natural law
extract
(2)experiment,simulation
data collection
information
(1)identify problem domain
result
(2)experiment, data collection
(3)get computing resource
(3)get computing resource
modelmodel
(4)computing, verify
(4)analyze data , deduce law
Data mining
deduce
idea
Scientific
computing
The Problem for the e-Scientist
Experiments &
Instruments
Other Archives
Literature
questions
facts
facts
?
answers
Simulations







Data ingest
Managing a petabyte

Common schema

How to organize it?
How to reorganize it?
How to coexist & cooperate with
others?
Data Query and Visualization
tools
Support/training
Performance


Execute queries in a minute
Batch (big) query scheduling
Scientific Data Life Cycle





Data Acquisition
Data Ingest
Metadata
Annotation
Provenance





IWCG2006
Data Storage
Data Cleansing
Data Mining
Curation
Preservation
What is a Scientific Collaboration ?
 Definition: two or more people work together to create or
achieve the same thing.







Inter-discipline guys
Teamwork
Agreement
Common interests
Divide task into several works
Discuss problems, use instruments and share information
Goal: create new knowledge
 The Problem is how to make this things happen?
 Solution: Human Cooperation + Resource Share
IWCG2006
Basic Collaboration Model---G.Olson
 People to People
 Communication
 Groupware Service
 Access to Facilities
 Interaction with the Physical
World
 Access to Instrument online
 Access to Information
 Digital Libraries, E-Pub
 Search Service
The concept back from 1989
National Collaboratories --Applying IT for Scientific Research , NAP, 1993
IWCG2006
Current collaboration technology


Electronic communication tools send messages, files, data, or documents between people and
hence facilitate the sharing of information.




e-mail
faxing
voice mail
Web publishing




data conferencing — networked PCs share a common "whiteboard" that each user can modify
voice conferencing — telephones allow users to interact
video conferencing (and audio conferencing) — networked PCs share video or audio signals
Internet forums (also known as message boards or discussion boards) — a virtual discussion platform to
facilitate and manage online text messages
chat rooms — a virtual discussion platform to facilitate and manage real-time text messages
electronic meeting systems (EMS) — a conferencing system built into a room. The special purpose room
will usually contain a large screen projector interlinked with numerous PCs.
Electronic conferencing tools also facilitate the sharing of information, but in a more
interactive way.



Collaborative management tools facilitate and manage group activities.






electronic calendars (also called time management software) — schedule events and automatically
notify and remind group members
project management systems — schedule, track, and chart the steps in a project as it is being
completed
workflow systems — collaborative management of tasks and documents within a knowledge-based
business process
knowledge management systems — collect, organize, manage, and share various forms of information
extranet systems (sometimes also known as 'project extranets') — collect, organize, manage and share
information associated with the delivery of a project (eg: the construction of a building)
social software systems — organize social relations of groups
IWCG2006
The Services to Enable Scientific
Collaboration
3 Cyberspaces
Virtual Team
10 Service functions
Team management
Task coordination (workflow)
Project management
Trust Metrics for collaboration
Resource and Interaction
Online communication
Access to instrument
Semantic to scientific data
Information and Knowledge
Publication and Search
Knowledge space
Experts community
Case Study : China State Key Labs
Area of Study
Chemistry
Math and Physics
Geognosy
Biology
Information
Material
Engineering


Labs
22
15
18
38
26
18
25
Users more than 10k
Large instruments more than 6k
Physical Resource and Virtual Team
SKL B
SKL A
Know
how
Know
how
device
people
Info
Virtual
team C
data
people
info
Virtual
team B
SKL D
device
Know
how
people
Info
data
Virtual
team A
SKL C
Know
how
device
device
people
data
Info
data
Domain
Management
Bio
Che
Joint
Phy
Mat
Resources
Sites
Services
Scientific Collaboration
Cyberspace
Know How
Team
Geo
……
Auth
Expert
Metrics
Task
Project
Instrument Acc
Network
Math
Collaboration ‘game rules’
Pub/Search
Services
Eng
Semantic Data
LDAP
ACL
Network com
SSL
……
Network Infrastructure
Data Resource
Instruments
Physical
Computing
Instrument
Archie
Data
Multimedia
Raw data
Modeling Elements
Entity
Participant
Management
Institute
Relationship
Info flow
Instrument
Capital flow
Data Storage
Affiliate org
Data Flow
E-community
Communicate
Management
Behavior
Behavior
{Action Des}
Container
According
constraint
{name}
Object
Collaboration
cyberspace
IWCG2006
Knowledge create and distribute Model
Ensure a fair game !
Legend
Participant
Management
Agency
3
Members
Affiliate org
Labs
(1,m)
(1,n)
Agreement
E-community
Instrument
2
1
Buy rule
2
1
Data storage
Platform
2
2
Collaboration
Cyberspace
{活动描述}
Behavior
{规则名称}
Constraints
Info flow
Capital flow
Outside labs
(1,q)
(1,p)
Extra members
Data Flow
Communicate
1.contribution 2.get Knowledge
3.oversee
Management
SCC Web Architecture

Benefits
Adaptable
More
than 10
different science
subjects templates
Scalability
 Dynamic
growing VO does
not affect platform
performance
Extensibility
Easy plug in the
a new service to
the platform
SCC web site
http://co-lab.chinalab.gov.cn/
Portal model for accessing autonomic resource

user
Benefits:
Trust access path
Single sign-on
Delegate permits and Proxy
Interoperate
SCS
Org.A
Org.B
Org.F
Org.C
Org.D
Org.E
Video and audio interactive component
Benefits
1. Multi node access
2. Plug in SCC
3. Security and efficiency
IWCG2006
Project1:Soil Carbon recycle mechanism
database
A lot of Units
Distribute data across mainland of
China
More than 1K scientists
Soil Carbon Recycle Data Schema
Data
Structure
Attribute
DB
Soil Property DB
Space
DB
Soil Carbon DB
Soil Environment
DB
Country
Graph
1:400
1:100
Soil Type Graph
Province
Graph
1:50
IWCG2006
1:10
Soil Carbon
Type Distribute
Graph
Terrain、
Vegetation、
Precipitation
Typical
Area
Graph
County
Graph
1:5
1:1
Soil Carbon Recycle collaboration cyberspace
IWCG2006
Project 2:FACE(Free Air CO2 Enrichment)
 Find out the mechanism for the rice
growing when the CO2 climate change
 15 Collaboration org.,9 domestic orgs
(3SKL),7 oversea country, 100
research staff.
 Invest 100m RMB, only one in China
 30 science topics
IWCG2006
FACE scientists



Federal Agricultural Research Centre (Germany)
National Institute of Agro-Environmental Sciences (Japan) North
Carolina State University
Tohoku National Agricultural Experiment Station (Japan)
U.S. Water Conservation Laboratory
University of Oklahoma








大气边界层物理与大气化学国家重点实验室 (大气物理所)
作物遗传与种质创新国家重点实验室(吉林农业大学南京农业大学)
土壤与农业可持续发展国家重点实验室 (南京土壤研究所)
上海植物生理生态研究所
沈阳农业大学
沈阳应用生态研究所
扬州大学
北京教育出版社


IWCG2006
Contribution to the FACE community
• Monitor the farm site by video
• Automatic upload the raw data
• Real time display control pane by Browser
IWCG2006
FACE Project deployment
IWCG2006
FACE website
 Old
 Only one function
 Info pub
 New
 10 service for collaboration
 2 years runtime
IWCG2006
Project3: Neutron diffraction online experiment and data sharing
Remote monitor in Browser
1.Protect the staff from radiation
2.Monitor the experiment process
3.Inter-discipline scientists
Neutron emission facility
Neutron reactor remote access deployment
·ÅÖÃ
ʵÑéÑùÆ·
Set sample
Ô¶³Ì ¼à¿Ø·þ Îñ Æ÷
M onitor Server
Êý¾Ý
²É¼¯
Benefit
1 staff are isolated from experiment site
2 remote real time monitor the process
3 scientists online discuss
Êý¾Ý²É¼¯
ÖÐ×ÓÊø
Beam of neutron
Êý¾Ý²É¼¯
ºË ·´ Ó¦¶Ñ
nuclear reactor
Êý¾Ý
²É¼¯
ʵÑéÊÒLab
ʵÑéÔËÐмà¿Ø»ú
computer for monitor experiment status
ÑùÆ·Ð ýת ÒÇ
sample circumgyrate device
ʵÑé¹ý ³Ì ¼à¿Ø»ú
computer for monitor experiment process
Æ×
ÒÇ
pedigree device
ÉãÏñ Í·
camera
Internet
ʵÑé¼à¿Ø´ú Àí»ú
monitor proxy
.......
¼à¿ØʵÑéÔËÐÐ
monitor experiment status
¼à¿ØʵÑé¹ý ³Ì
monitor experiment process
¼à¿ØʵÑéÔËÐÐ
monitor experiment status
Neutron diffraction remote control interface in Browser
Data analysis software
Project 4: Collaboration for BSL3 Labs
(http://clb.gucas.ac.cn)
IWCG2006
IWCG2006
BSL3 setting
实验室
C
C
C
控制及演播室
带云台的
摄像头
摄像机
针眼摄像头
麦克
风
操控服务器
视频服务器
监控视频服务器
控制输入
2M带宽
到互联网
生物安全柜
交换机
倒置显微镜
防火墙
IWCG2006
办公区
Real time monitor BSL3 Labs of FUDAN University
IWCG2006
Project 5: Collaboration for Experiments
Centers (http://cec.gucas.ac.cn)
IWCG2006
The PHI 700 Field Emission
Scanning Auger Nanoprobe
Get data
Trio MRI 磁共振脑成像系统
[北京磁共振脑成像中心]
800MHz核磁共振谱仪
[experiment statues
PHI700对应工作站
Trio MRI 配套工作站
Transfer data
IWCG2006
Implementation
IWCG2006
Large scientific instruments Centers
IWCG2006
Instruments running statistics
Lessons from SCC development
 How to address the user’s application simpler?
 System development is less and less about coding
than about using things and gluing them together.
 Create a flexible enough architecture to allow for
changes.
 Customers will not be able to elucidate at the start
what they want; only by using the system will they be
able to tell you what you should have done.
 SSC 1.0 rely on MS SharePoint ; SSC 2.0 open source
 Future concerns should be e-community trust metrics
IWCG2006
Acknowledge
 Dec.2005--Dec.2007, China Bio-Safe Level 3
Labs Collaborative Cyberspace, Grant from
MOST of China
 Dec.2005--Dec.2007, China Large Scientific
Instruments Collaborative Cyberspace,
Grant from MOST of China
 Jul.2005--Jul.2008, EU-Asia Link Programme
HPC-Grid Computing Course Model, Grant
from EU
IWCG2006
Thanks
IWCG2006