Thoughts on TCP benchmarking

Download Report

Transcript Thoughts on TCP benchmarking

A WAN-in-LAB for
Protocol Development
Netlab, Caltech
Lachlan Andrew, George Lee,
Steven Low(PI), John Doyle, Harvey Newman
Outline





What and why is WAN-in-Lab?
What can I do with WiL?
Why would I use WiL?
How do I use WiL?
Future plans
What is WAN-in-Lab?

“Wide Area” Network in a laboratory
• Real fibre delays
• Carrier-class routers, switches, …
Why -- Spectrum of tools
cost
?
UltraLight
PlanetLab
Abilene
NLR
LHCNet
CENIC
etc
DummyNet
EmuLab
ModelNet
NS2
WAIL
SSFNet
QualNet
JavaSim
Mathis formula
Optimization
Control theory
Nonlinear model
Stocahstic model
abstraction
live netwk WANinLab emulation
simulation
maths
All scales are important– WAN-in-Lab fills a gap
What can I do with WAN-in-Lab?
Other groups’ interests

Protocol development
• FAST, delay-based
• MaxNet, explicit signalling
• ADPM, single-bit explicit signalling



Impact of small buffers (U. Pittsburgh)
Test automatic configuration of routers
(MonALISA, Ultralight)
Test distributed file-system (MojaveFS)
TCP Benchmarking

Our current main direction
Evaluating others’ protocols, not ours

Web interface

• Submit kernel patch
• Standard tests automatically performed
• Results mailed back

Explicit or implicit signalling protocols
Physical topology
Svr2
Eth2
10.2.12.2
Svr10
Eth2
10.2.21.2
Svr1
Eth2
10.2.11.2
POS1/1
10.0.12.2
Eth2
10.1.12.2
Svr9
Svr14
Eth2
10.2.24.2
POS9/1
10.0.24.2
POS1/1
10.0.24.4
RB POS2/1
Gi2/2 10.0.23.2
10.2.22.1
Eth2
10.1.12.2
POS1/1
10.0.12.1 POS9/1
10.0.13.1
POS1/1 RD
10.0.24.4
POS1/1
10.0.23.3
Eth2
10.3.11.2
POS9/1
10.0.13.3
Disk4
RA
eth2
10.1.13.2
Svr13
RC
Eth??
10.1.51.2
SA:gi1/0/12
Eth1
10.3.12.2
Eth??
10.1.13.2
Disk1
Svr6
Eth2
10.3.12.18
Svr7
Eth2
10.3.12.17
Svr11
Disk2
Disk3
Capabilities: Delay

24 spools of 100km fibre, many loopbacks
• Set delay by MEMS switching loops in/out

130ms physical delay
• more with IP loopback

2 Dummynets: long delay for cross-traffic
125 ms, 1.8ms steps
External connections

Linked to Ultralight, 10Gbps Physics WAN
Smooth migration testing -> deployment

Delay

• longer
• jitter


Cross traffic
Monitor
data routed
through WiL
Why use WAN-in-Lab?
Why use WiL?

Complement other levels of abstraction,
not replace them

Different ways to use it: reasons for each

Standard platform for TCP benchmarking
• Easier to compare with others’ results
• No need to write your own test suite
Artifacts of software delays


Packets sent on 1ms “ticks”
1Gbps = 83,333 pk/s
83 packets
1ms
How can I use WAN-in-Lab?
Management structure
Wil-ns.cs.caltech.edu
Start script, configure, compile
Servers
Network boot
Read-only FS
virtual /etc
scratch disk
Data plane
Time sharing

Coarse switching between projects
• Servers rebooted, routers reconfigured


Switchover takes ~5 minutes
Book in advance
• For longer bookings, book further in advance
• Also “ad hoc” bookings for individual hosts

Can log in while others have booked
Future plans
Future plans

Benchmarking infrastructure
• Standardise tests
• Use it ourselves
• Develop “indices” of TCP performance


Better control over capacities and buffers
Better cross-traffic generation
• Currently Harpoon


Investigate differences from DummyNet
Integrate DAG cards
Conclusion


WAN-in-Lab fills the gap between
emulation and live network experiments
Seeks to be as realistic as possible
• Long links, simple topology

Focus will be on TCP benchmarking

We welcome people to use it
<http://wil.cs.caltech.edu>
Spare Slides
Case Study: MaxNet
Aim: Wind Tunnel of Networking

WAN in Lab
• Capacity: 2.5 – 10 Gbps
• Delay: 0 – 120 ms round trip

Breakable
• Won’t take down live network

Flexible, active debugging
• Passive monitoring, AQM

Configurable & evolvable
• Topology, rate, delays, route
• Modular design stays up to date

Integral part of R&A networks
• Transition from theory, implementation, demonstration,
deployment
• Transition from lab to marketplace

Global resource
• Part of global infrastructure UltraLight led by Harvey Newman
Equipment







4 Cisco 7609 routers with OC48 line cards
6 Cisco ONS 15454 switches
A few dozen high speed servers
1G switch to routers/servers
Calient switch for OC48
2,400 kilometres of fibre, optical
amplifiers, dispersion compensation
modules
63ms aggregate RTT delay, in two hops
• 120ms using IP loopbacks
Accounts


Mail wil at cs.caltech.edu
Sudo access to “network” commands
• Ifconfig/…/
• Custom commands to set topologies

Login to routers if required

Separate accounts for “benchmark only”
Configuration -- Delays

Want maximum delay from limited fibre
• Signals traverse fibre 16 times

4 WDM wavelengths
4 OC48 (2.5G) MUXed onto OC192 (10G)

Lots of transponders

• WDM amplifier joins 100km spools  200km
Configuration – delays
OC48 slot
16x200km
-------WDM Wavelength--------
Bidirectional 100km
Bidirectional 100km
Amp
Configuration – delays



Delay varied by adjusting the number of
OC48 hops traversed
Calient optical switch selects required
hops
Hop lengths 200km up to 1600km
• Maximise granularity given limited switch ports
Switch
Projects


TCP benchmarking
FAST
• Delay-based congestion control

MaxNet
• Explicit signalling congestion control

MojaveFS
• New distributed file system

University of Pittsburgh
• TCP with small buffers

University of Melbourne
• Single-bit congestion marking
WAN-in-Lab testbed



Dummynet and simulation introduce artifacts
Also need to test on real equipment
WAN with real delays, located in a single room
• Connected to an external WAN (Ultralight)

Open for the community to use for benchmarking
OC-48
OC-48
WAN-in-Lab capabilities
Current
Planned
Two 2.5G bottlenecks
Multiple 1G bottlenecks
Six 2.5G bottlenecks
Two “real” delays
Up to six “real” delays
(Emulate cross traffic delay)
End-to-end RTT, drop
Per-router delay, drop
(movable DAG cards)
Configuration -- delays
OC48 slot
-------WDM Wavelength--------
Bidirectional 100km
Bidirectional 100km
Amp
Using WAN-in-Lab

Contact me – lachlan at caltech . Edu

Coarse timesharing
• Some users set up experiments while others
run experiments

Software setup still being developed
• Your chance to influence our directions to tailor
it to your needs
Sample MaxNet results

Achieves realistic delay at 1Gbit/s