Presentation

Download Report

Transcript Presentation

TAT
CrossGrid in Its Halfway:
Achievements and Challenges
Marian Bubak, Michal Turala and the CrossGrid Collaboration
www.eu-crossgrid.org
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Outline
 Applications
 Overview of X# architecture
 Tool environment
 Grid middleware
 X# and OGSA
 Software development, standards, bodies, …
 Summary
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Medical Application
LB flow
simulation
 Adding small modifications to the
proposed structure results in
immediate changes in the blood flow
Visualization
 Online presentation of simulation
results via a 3D environment
 The progress of the simulation and
the estimated time of convergence
should be available for inspection
Interaction
Cracow Grid Workshop, October 27 – 29, 2003
VE
WD
PC
PDA
Basic Features of Flood Simulation
 Meteorological
• Intensive simulation
(HPC), large input/output
data sets, high availability
of resources
TAT
Data sources
Meteorological
simulations
 Hydrological
• Parametric simulations
(HTC) may require
different models
(heterogeneous
simulations)
Hydrological
simulations
Users
Hydraulic
simulations
 Hydraulic
• Many 1-D simulations
HTC, 2-D hydraulic
simulations require HPC
Output visualization
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Distributed Data Analysis in HEP
 Objectives
• Access to distributed
databases
• Data mining with neural
networks
 Issues
• Typical interactive requests will
run on o(TB) of distributed data
• Transfer/replication may take
hours
• Data are transferred before
interactive session
• Allocation, installation and
setup of the database servers
before the interactive session
Interactive
Session
Resource
Broker
Replica
Manager
Portal
XML in/out
On-line
output
Interactive
Session
Manager
Interactive
DISTRIBUTED
Interactive
Session
Interactive
Session
Interactive PROCESSING
Worker
Session
Interactive
Worker
Session
Worker
Session
Worker
Worker
Cracow Grid Workshop, October 27 – 29, 2003
DB Installation
Interactive Session
Database server
TAT
Weather Forecast and Air Pollution Modeling
 Distributed/parallel code on Grid
• Coupled Ocean/Atmosphere Mesoscale Prediction
System
• STEM-II Air Pollution Code
• Integration of distributed databases
 Data mining applied to downscaling weather forecasts
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Interactions in the CrossGrid Applications
 Interactive steering
 Control of a cascade of simulations
 Distributed data access
 Parameter study with online results
 Interactive estimation of simulation time
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Features of Interactive Grid Computing
 Run time application control
• Performance data on-line
 Possible effects of decisions
• Access to benchmark information
 Interpreting application’s behavior in heterogeneous
open system
• Access to infrastructure performance
 Information meaningful in the context of application field
• more application specific performance data
 Need in on-line standard and user-defined metrics
Cracow Grid Workshop, October 27 – 29, 2003
TAT
CrossGrid Architecture
Application
Applications
MPICH-G
Collective Layer
Libraries
Supporting
Tools
Portal and
HLA
Benchmarks
Tools
Migrating Desktop
Application
Specific Services
Infrastructure
Roaming Access
User Interaction
Grid Visualization
Services
Kernel
OCM-G
Monitoring
Scheduling
DataGrid Data
Agent
Management
Generic
Services
DataGrid Job
Globus
Management
Toolkit
Cracow Grid Workshop, October 27 – 29, 2003
Data Access
TAT
Migrating
Desktop
Application
Tool
Benchmark
OCM-G
Grid Visualization
Kernel
Portal and Roaming
Access
Simulation
Output
(Parallel)
Application
Running
User Interaction
Services
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Data
Management
DataGrid Job
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
TAT
8 rank 1 performs MPI_Recv
8 Rank 0 is pending!
8 Rank 1 is pending!
WARNING: deadlock detected
Tool
Application
Benchmark
MARMOT MPI
Verification Tool
OCM-G
Grid Visualization
Kernel
Portal and Roaming
Access
User Interaction
Services
(Parallel)
Application
Running
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Job
Management
DataGrid Data
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Tool
Application
Benchmark
OCM-G
Portal and Roaming
Access
Performance
Prediction Tool
(Parallel)
Application
Running
Grid Visualization
Kernel
User Interaction
Services
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Job
Management
DataGrid Data
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Tool
Application
Benchmark
OCM-G
Grid Visualization
Kernel
Portal and Roaming
Access
G-PM Performance
Analysis Tool
User Interaction
Services
(Parallel)
Application
Running
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Job
Management
DataGrid Data
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
Site Administrator
Tool
TAT
Application
Benchmark
OCM-G
Jiro-based
Infrastructure
Monitoring
(Parallel)
Application
Running
Grid Visualization
Kernel
Portal and Roaming
Access
User Interaction
Services
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Job
Management
DataGrid Data
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
Site Administrator
Tool
TAT
Application
Benchmark
OCM-G
SANTA-G Network
Packets Monitoring
Grid Visualization
Kernel
Portal and Roaming
Access
User Interaction
Services
(Parallel)
Application
Running
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Job
Management
DataGrid Data
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Migrating
Desktop
Application
Tool
Benchmark
OCM-G
Grid Visualization
Kernel
Portal and Roaming
Access
Benchmark
Results
Benchmark
Running
User Interaction
Services
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Data
Management
DataGrid Job
Management
Globus Toolkit
Multiple
Sites
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Tool
Application
Benchmark
OCM-G
Estimation of Data
Access Time for
Replica
Management
Grid Visualization
Kernel
Portal and Roaming
Access
User Interaction
Services
Infrastructure
Monitoring
Scheduling
Agent
Data
Access
DataGrid Job
Management
Handling
Multiple
Mass
Storage
Systems
DataGrid Data
Management
Globus Toolkit
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Sites of the CrossGrid Testbed
Géant
TCD Dublin
UvA Amsterdam
PSNC Poznan
ICM & IPJ Warsaw
FZK Karlsruhe
USC Santiago
CYFRONET Cracow
CSIC-UC IFCA
Santander
LIP Lisbon
CSIC RedIris Madrid
II SAS Bratislava
Auth Thessaloniki
UAB Barcelona
CSIC IFIC Valencia
DEMO Athens
Cracow Grid Workshop, October 27 – 29, 2003
UCY Nicosia
Resources in CG Testbed
Configuration
Machine
(LCFG)
TAT
 At each site
User
Interface
•
•
•
•
•
3Com
LCFG configuration server
User Interface
Gatekeeper (Computing Element)
Worker Nodes
Storage Element
 16 sites:
IDC
Storage
Element
Worker Nodes Gatekeeper
• 115 CPUs (Worker Nodes)
• 4 TB (Storage Elements)
 National Certification Authority machines
 Grid services (LIP)
Monitoring
CA machine
(offline)
•
Data General
Information Index
•
MyProxy
•
Resource Broker
•
•
3Com
•
Resource
Broker
Replica
Catalogue
IDC
Virtual
Organization
Short lived certificates for long lived processes, used by RB
Virtual Organization Server
•
•
Database for physical replica file location
Certificate Proxy Server
•
•
Matchmaking and load balancing scheduler
Replica Catalogue
•
IDC
Top MDS Information Server, points to site Information Servers
Database for user authentication (CROSSGRID VO)
Monitoring
•
Mapcenter: network monitoring system
Cracow Grid Workshop, October 27 – 29, 2003
And with Related Grids ...
Cracow Grid Workshop, October 27 – 29, 2003
TAT
TAT
X# and OGSA
 OGSA – Open Grid Services Architecture
• General vision of Grid made of services
 OGSI – Open Grid Services Infrastructure
• Defined conventions and basic interfaces of Grid
Services
• OGSI-compliant Grid Service
 Globus Toolkit 3.0
• Contains OGSI-implementation
• OGSI-based higher level Globus Toolkit services
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Migration to GT3
 Replacing GT2 with GT3 on our testbed
 More technical than architectural issue
 CrossGrid uses Globus indirectly (through EDG)
 No stable relese of GT3 yet
 No direct requirements for using GT3
• However experimental installation for Task 1.1
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Migration to OGSA
 Functionalities of CrossGrid from the beginning
designed as services
 Web Service protocols (SOAP) widely used
 Easy to be made OGSI-compliant
• when required
• when stable implementation available
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Tools and OGSA
 MARMOT – MPI verification
• Independent of OGSA – can use any MPI
implementation
 Benchmarks
• Can publish benchmark results in Grid
information systems
 Performance evaluation and prediction tools
• Basically independent of OGSA
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Services
 Portal and Roaming Access
• RAS uses SOAP protocol to communicate
• Little effort to migrate to OGSA
 Scheduling Agents
• Dependent on EDG and Condor
• OGSI-compliant interface can be added
 Monitoring
• Application monitoring is not going to migrate to OGSA – due to
performance issues, however feasibility studies are performed
• Infrastructure monitoring is compatible with EDG R-GMA that is
going to migrate to OGSA
 Optimization of Data Access
• Ready to migrate, SOAP protocol already used
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Software Development and Testing
• Proceeds through all the development phases
• Requires gathering of quality indicators and
producing monthly reports
• Crucial for ensuring quality of code
• Currently in progress
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Static Code Metrics
Measured separately for each task:
• amount of new lines of code since last report
• compliance with coding conventions
• count of packages, functions and inner classes
• complexity number (McCabe metric)
• count of formal Javadoc comments per class
and method
Cracow Grid Workshop, October 27 – 29, 2003
TAT
X# Management in Short
Main Bodies
• Project Coordinator
• CG Office
• Steering Group
• Technical Board
• Architecture Team
• Quality Assurance
• Central Dissemination
Better focus on technical
issues and quality control
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Collaboration with DataGrid
 Architecture
• interoperability, reuse of MW components
• joint proposals to GGF
 Interactive applications
• methodology, generic structure, grid services
• security issues for medical applications
 Tools: benchmarks and application monitoring
 Interoperability of EDG testbeds
• collaboration of testbed support teams
• mutual recognition of Certification Authorities
• common access/usage policy and procedures
• common installation/configuration procedures
• joint infrastructure for HEP applications
 Software Testing
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Collaboration with Other Grid Projects
 GRIDSTART
• Active participation in Technical Board meetings
• Conferences: Cracow Grid Workshop, Across Grids (Santiago) – to
stimulate exchange of information and personal contacts
 GLOBUS
• Regular contacts
• CrossGrid use cases for OGSA
 GridLab
• Development of grid applications
• Application monitoring
• Portals, data access, security
• Participation in joint testbeds (SC’2002)
 EuroGrid and GRIP
• Regular contacts and exchange of information
Cracow Grid Workshop, October 27 – 29, 2003
TAT
Research Areas in CG
(about 90 papers)
 New paradigms for grid application programming
 Grid PSE for complex problems
 Collaborative environments
 Grid monitoring and benchmarking
 Monitoring of grid applications
 Performance of grid applications (analysis and prediction)
 Grid services for interactive applications
 Building of Virtual Organizations (tools, rules)
 Methods and tools for grid middleware deployment
 In F. Berman, G. Fox, T. Hey (eds): Grid Computing, Wiley 2003
 Participation in 6FP projects (EGEE, Oct 15 – 5?)
 GGF – 2 x BOF, and plans for GGF10 in Frankfurt; ICCS’2004
Cracow Grid Workshop, October 27 – 29, 2003
TAT
www.eu-crossgrid.org
•21 partners
•5 Work Packages:
•WP1 – CrossGrid Applications
•WP2 – Grid Tool Environment
•WP3 – New Grid Services
•WP4 - International Testbed
•WP5 – Project Management
Cracow Grid Workshop, October 27 – 29, 2003