Grid Interoperability Update on GridFTP tests
Download
Report
Transcript Grid Interoperability Update on GridFTP tests
Grid Interoperability
Update on GridFTP tests
Gregor von Laszewski
[email protected]
http://www.cogkit.org
Outline
Proposal: Documentation consolidation
Proposal: GIN GridFTP Report
– Proposal for Survey
– Proposal for User Identification
– Proposal for identification of what tests to execute
Update GridFTP
– Java CoG Kit Workflow (Testing) Framework
– Testing GridFTP on the TeraGrid
– Testing GridFTP on the GIN Resources
Summary
Proposal Documentation: Making Interoperability possible
One aspect of interoperability is based on an effective mechanism to
document and distribute the information.
Proposal: Consolidate Documentation
– Today: information is spread across several web sites and Wikis
– Build a single entry point for the working group documentation
Proposal: Improve the contents
– Work on extended report, publishable article
– Most important from my point of view:
• Improve the documentation and description on how to get
accounts on the other “Grids”
Proposal: Test documentation effort
– Have interoperability tests executed by third parties to test
effectiveness of documentation and approaches
– Engage in an iterative improvement process of the documentation
GIN Today: Documentation
Documentation links (We need your help to add to this page):
– This should be done during the workshop or as response to the mail
– http://wiki.nesc.ac.uk/read/gin-jobs?HomePage
– https://forge.gridforum.org/sf/wiki/do/viewPage/projects.gin/wiki/GINData
Problem with OGF/GGF infrastructure
– Gridforge is not used (is it too complicated? Is it too much overhead)
• Only charter is published
– Wikis are the tool of choice for the working group as much of the information
may change quickly. Proposal: use a single Wiki
• If wiki is used a feature is needed to archive the Wiki contents in a single
document.
– OGF schedules do not have pointers to the working group
– Google search in OGF Web page is ineffective
• searching for GIN must give as first link the working group page
– Search in Gridforge is non operational
• Example: search for GIN in wikis on Gridforge returns no results
– Links in the mailing list archives are no longer operational
• Example links to documentation that was send by Erwin in May 06
– Transition to OGF has not been completed.
Proposal: GIN GridFTP Report
Goal: provide an extended document that includes a formal way of reporting on the
activities of this particular subtask as part of the GIN working group in a single
report.
Assure that the goals overlap with the working group goals
Proposed Features that the report may address (in the area of GridFTP)
– Survey of different implementations of GridFTP servers and clients
– Identification of the users for interoperability tests and their goals
• Administrators
• Developers
• Users
– Survey of different tools that monitor GridFTP behavior in production Grids
– Identification of which tests need to be performed to satisfy
• Administrators
• Developers
• Users
– Links to programs and documentation that can perform the tests.
Administrators - Developers - Users
There are different dimensions to be considered when addressing interoperability
issues
– While administrators may care about setting up tests between virtual
organizations, individual user needs are not completely addressed by this test
as they are not conducted by the user itself.
– Lessons learned over the past 5 years have shown that users MUST have the
ability to perform a set of interoperability test themselves. This includes in
most case the answer to the questions:
• Can I copy a file form here to there?
• What is the performance I get when I do the default?
• …
– Administrator may be more interested to obtain the performance and
throughput of the service.
• Test conducted for each individual user are typically not conducted.
– Developers may be more concerned about functional or protocol
interoperability.
– GIN must address not only administrators and developers, but more
importantly the users.
Update GridFTP: Testing with the Java CoG Kit Workflow
Java CoG Kit Workflow
– Java CoG Kit workflow can be
run through a remote service, an
API, or a command line tool.
– Java CoG Kit contains easy to
use workflow language that in the
past has been able to bridge
between different Globus
middleware (GT2, GT3, GT4,
SSH, …)
– Language features include
• Functional programming
language expressible in XML
like syntax
• Variable Futures
• Explicit parallelism
• HTML form generation (XUL
like)
Two concepts:
– Abstractions: copy a file from
here to there regardless of the
implementation / protocol
– Providers: Mapping into Grid
middleware or commodity
technologies
… cont . Java CoG Kit workflow
Transfers are real simple,
specify source, destination,
and provider
<project>
<include file="cogkit.xml"/>
<execute executable="/bin/date"
stdout="thedate"
host="hot.mcs.anl.gov" provider="GT2"/>
<echo message="Job completed. Transferring the output"/>
<transfer srchost="hot.mcs.anl.gov" srcfile="thedate"
desthost="localhost" provider="gridftp"/>
<echo message="Transfer complete"/>
<set name="date">
<readFile file="thedate"/>
</set>
<echo message="The date is {date}"/>
</project>
Technically we could
add other providers
such as
nordu-GridFTP,
RFT,
SRM, …
It is obvious that this can be used to test not only the file transfers but also
job submissions and other things.
See Also:
– http://wiki.cogkit.org/index.php/Java_CoG_Kit_Documentation
What could be tested
File transfer
– Directory creation
– Permissions
– 2nd party transfers
– 3rd party transfers
– …
– All the features that are available
through the Java Cog Kit
abstractions
– We can integrate more features if
necessary
Job execution
– Similar …
Other things
– …
The CoG workflow and its embedded
HTML capabilities make it possible to
create custom tests.
What did we test?
First we wanted a simple test
– Transfer a file form here to there
– Perform the test by someone
outside of the CoG Kit group to
identify if the tool is
• Easy to install
• Functional
• Robust
– Compare if the tests are similar
to the once obtained from Globus
native GridFTP clients
Result:
– In all aspects successful
– Through a single shell script the
testing framework can be
installed and run
– The results have been confirmed
with Raj, who is a member of the
GridFTP development team.
To Do:
– We moved to an SVN archive
this month and need to update
that information so others can
replicate it if they wish)
GridFTPTest.sh
Installing and Running
– 1. Requirements: Apache Ant and a Java SDK
– 2. Mkdir gin
– 3. Get cog-ftptest-install.sh script from Java CoG Kit SVN and place in gin
– 4. Run cog-ftptest-install.sh
– 5. Edit the hosts file and customize the set of sites to include all hosts you like to
test
• host("<name>”
•
service(provider="gsiftp", type="file", url="gsiftp://<host>[:<port>]")
• )
– 6. Run the tests:
• cog-workflow runtests.k
– 7. The output files should appear in the "output" directory
Note: We just moved to an SVN directory and some issue with sourceforge SVN
causes a problem upon download. Sourcforge is working with us to resolve it.
In future we intend to host this in dev.globus.org
GridFTP tests on on TeraGrid (Conducted by Raj K.)
Verification
– Download
– Install
– Configure
– Run
– No problems
– Tool works
as promised
– Raj is a GridFTP developer and confirmed he gets the same results
with the GridFTP tools
GridFTP test on GIN resources (Conducted by Raj K.)
Just changing the hosts to the GIN resources results in
Most Issues are related to the user not being able to
authenticate/authorize
Next Steps
Complete the documentation of the tool
Complete the SVN transition
Have a user apply to all GIN Grid resources and make sure we do not
have authentication issues.
– I thought this is trivial, but in fact it is not.
– Even on TG this is still a challenge today as it is not as simple as just
applying for an account.
Summary: GIN gridFTP from a users perspective
– It is possible to use existing tools such as the CoG Kit to write interoperability
tests that can be executed by the user.
– Tools such as the Java CoG Kit even help briding interoperability issues
– The issue is at this time not interoperability of gridftp servers, we can simply
use the appropriate client. The Java CoG kit has demonstrated that multiclient abstractions are a feasible approach. Hence through a higher level of
abstraction we can provide a considerable notion of interoperability.
– Surprisingly we found that the today’s real issue is that a user has to spend
days/weeks to get onto the Grid due to post account bootstraping of the users
environment to use the Grid.
– Proposal: We should as community make it possible that users can get on the
Grid “fast”
• If it takes a user more than one day to get access to the about 30 Grid
resources we tested with this effort need to do something about it.
Testing GridFTP on the TeraGrid
Testing GridFTP on the GIN Resources
Summary