Network Emulation for Researching Long-Fat
Download
Report
Transcript Network Emulation for Researching Long-Fat
Network Emulation for Researching Long-Fat
Networks, Delay Tolerant
Networks and Grid Environments
Presented by:
Eric Coe
Computer Systems Research Department, M1–102
The Aerospace Corporation
www.aero.org
[email protected]
(310) 336-1911
February 9, 2005
Outline
• Aerolab: Aerospace Testbed based on EMULAB
– What it is, how it works, why we need it
• Define the following and motivate why network
emulation is appropriate
– Long Fat Network (aka High-Speed Bandits)
– Delay or Disruption Tolerant Networks
– Grid Environments
• Demo: So how easy is it?
• Future Directions
2
What is Aerolab?
Time- and Space-Shared Network Emulation System
• Construction of Complex Topologies
• Emulation of Link Conditions—Delay, Loss and Bandwidth
(dynamic)
• Boot Multiple Operating Systems—FreeBSD, Linux, and U of
Utah’s OS-Kit (real-time Linux, Windows on the Way)
• Runs on Commodity Hardware—PCs and Cisco Switch
• Scales to Hundreds of Physical Nodes and supports Multiple
Logical Nodes per Physical Nodes
Easy-To-Use Front-End
• Automates Configuration of OS, Topology and Links
• Supports both ns2 style scripting and Web-based GUI
Originally developed by University of Utah under a
National Science Foundation Grant
3
Aerolab Architecture
4
Utah’s Emulab
physical
nodes
programmable
switch
control
nodes
5
Why Network Emulation (Utah)
“We evaluated our system on five nodes.”
-job talk from university with 300-node cluster
“We evaluated our Web proxy design with 10 clients on
100Mbit ethernet.”
“Simulation results indicate ...”
“Memory and CPU demands on the individual nodes
were not measured, but we believe will be modest.”
“The authors ignore interrupt handling overhead in their
evaluation, which likely dominates all other costs.”
“Resource control remains an open problem.”
6
Why Network Emulation (Aerospace)
• “Different projects need to evaluate/operate different
networks”
• “Network set-up and running cables is costly and open
to configuration errors (HW/SW)”
• “Aerolab should hide the networking “magic” so that
researchers can experiment at the application or
protocol level”
• “In theory simulation and the real-world are the same
but in practice they are different”
• “Repeatability, Repeatability, Repeatability and
archiving. Old demos can be run w/ little overhead.”
7
Why Network Emulation (more..)
• “You have to know the right people to get access to the
cluster or real network.”
• “The cluster or real network is hard to use.”
• “<Experimental network X> runs FreeBSD 2.2.x.”
• “October’s schedule for <experimental network Y> is…”
• “<Experimental network Z> is tunneled through the
Internet”
8
Aerolab’s Value to Aerospace
Enhances Networking Capabilities
• Large Scale Experiments
• Complex Topologies
• Hardware and Software “In The Loop”
Automates Laboratory Operations
• Reduces Labor and Complexity
• Improves Repeatability and Allows “Archiving” Experiments
• Facilitates Resource Sharing
Emulab is not just the latest cool thing, it’s the
right way to run a networking lab.
9
Outline
• Aerolab: Aerospace Testbed based on EMULAB
– What it is, how it works, why we need it
• Define the following and motivate why network
emulation is appropriate
– Long Fat Network (aka High-Speed Bandits)
– Delay or Disruption Tolerant Networks
– Grid Environments
• Demo: So how easy is it?
• Future Directions
• Potential Collaboration
10
Long Fat Networks (LFN)
What they are:
• Characterized as networks with large bandwidth-delay products
• Typically gigabit speeds and have delays larger than 70 msec
• Examples: networks of compute-clusters; satellite networks;
transcontinental or transoceanic high-speed links.
• Network where standard TCP has poor performance
11
Long Fat Networks (LFN)
What an Aerolab like system provides:
• Install custom kernel images that contain various transport
protocols (HS-TCP, BIC, CUBIC, WEB100, XCP)
• Total control of the networking environment
• Run what-if scenario’s
• Guarantee no other traffic on the network or only “known” traffic
• Packet loss can be attribute to “WHO”.
• Dynamic Link Characteristics (satellite)
• Varying bandwidth
• Varying error rates, probability of packet loss
• Repeatability: When results from this week do not match last
weeks. .
12
Delay Tolerant Networks (DTN)
What they are:
• Characterized as networks with no contemporaneous end to end
path
• Occur due to intermittent connectivity which is caused by
• large propagation delays
• nodes going up and down frequently
• with highly mobile nodes in sparse network
• Networks where standard TCP has no performance due to the lack
of an end to end path
13
Delay Tolerant Networks (DTN)
14
Delay Tolerant Networks (DTN)
How we classify:
• Application basis
• Interplanetary Network (networking the solar system)
• Sensor networks
– Static nodes (due to nodes choosing their on and off periods
– Mobile nodes (mobility process dictates links presence or
absence)
– Hybrid case
• Link types
• Deterministic: Contacts can be scheduled
• Probabilistic: Contacts happen based on some distribution
• Dynamic: Contacts happen in a purely opportunistic way
15
Delay Tolerant Networks (DTN)
What an Aerolab like system provides:
• Dynamic link conditions for delay, bandwidth, and even link
availability
• Ability to run Bundling Protocol on top of a topology and over
different topologies
• Ability to evaluate routing protocols across the variety of networks
discussed earlier
• A common facility for other researchers to conduct experiments
Avoids everyone having their own “version/vision” of what constitutes a
DTN
16
Grid Environments
What they are:
• Networks of loosely coupled computational resources
• Heterogeneous by nature in both capacity and availability
• Basic grid services and applications must inhabit a dynamic,
heterogeneous, distributed environment
• Issue: How to quantitatively evaluate grid services and
applications in such an environment?
17
Quantitative Evaluation of Grids
What an Aerolab-like system could support for grid research:
• Playback mechanism for network behavior, cross traffic
• Injecting errors for testing fault tolerance, robustness
• Testing autonomic computing methods
(AC control cycle: monitor, analyze, plan, execute)
• Remove a link or halt processes on a host
• Evaluate response of autonomic control system
•
•
•
•
Testing information or resource discovery on larger networks
Quantitatively evaluate scalability in larger topologies
Quantitative evaluation wrt topology structure and connectedness
Tools needed to provide these capabilities in a general way
18
Example: Evaluation of Grid Application Performance
DARPA Active Networks Project
Exploit application topology to
compute lower-bound timestamp in
distributed simulations
Globus used to manage Xbone
and start HLA-compliant simulation
Very hard to run large test cases
USC
ISI
Aerospace
discover
APSimScript
Globus
overlay
request
start
overlay
config
MDS-2
register
XBone
Manager
APSim
APSim
APSim
HLA/RTI
HLA/RTI
HLA/RTI
ATMD
ATMD
overlay
HLA Time Mgmt done
“in the network” by
overlay of active time
mgmt daemons
19
Example: Evaluation of Grid Application Performance
Solution: Emulate larger topologies on Utah EmuLab
Set of quasi-random, tree topologies generated
4, 8, 16, 32, 64 end-hosts
9, 15, 22, 29, 34 interior service hosts
Tree topologies constrained to have average 2.5 degree of connectedness
550 Lower Bound Timestamp (DRN) calculations done for each topology
Example: topology w/ 32 end-hosts
20
Optimistic Mesh Generation on the Grid
Optimistic Delauney Mesh Generation
Optimism controlled by number of cavity
expansion msgs – affects BW demand, latency
Already run on clusters and TeraGrid
Work in Progress: run on AeroLab to do repeatable
parametric tests controlling available bandwidth
Nave, Chrisochoides, Lee, Deerfield, International Meshing Roundtable, Sept. 2004
21
Aerolab
Demo: So how easy is it?
#1. Create a topology
#2. Do something useful
22
Future Directions
• Different OSes, namely Solaris, real-time Linux and
Windows
• Incorporate Hardware emulators (IXP,SPIRNET)
• Incorporate passive network sniffers (layer 1)
• Wireless nodes, possible mobile campus nodes
• Merge with the Aerospace cluster (more nodes)
• Integrate 10 Gigabit capability
• Federate with other testbeds
• More nodes...more nodes...more nodes
23
Aerolab
Thank You!
Questions?
24