Introduction to the PPNCG - University of Manchester
Download
Report
Transcript Introduction to the PPNCG - University of Manchester
Introduction to the PPNCG
Networking for the PPARC Community
Introduction to the PPNCG
UK Network Topologies
External Connectivity – Europe & US
Astronomy & Astrophysics Sites
Grid Network Monitoring
PingER – World wide Monitoring
QoS – a micro Introduction
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
1
Introduction to the PPNCG
Membership includes HEP and Astronomy users
Dave Terrett , Bob Bentley, Ralph Spencer
Remit
Ensure the community has the required networking facilities
Monitor end-to-end performance
Investigate new network applications / technologies
Provide advice on kit / facilities
Active Network Monitoring
PPNCG ping, ftp and traceping
ICFA monitoring
Report problems to UKERNA
Regular meetings with UKERNA invited
Recognised as a subject group in JNUG and JISC
Links to several Grid Projects
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
2
SuperJANET4: Backbone and
Access links
Worldcom supplied the transmission
UKERNA layer the IP service
Core PoP IP router at Worldcom
Backbone Access Router at MANs
Access Links:
Large MAN 2.5 Gbit -> 10-20 Gbit
Medium MAN 622 Mbit -> 2.5 Gbit
4 node DWDM development net
Deployment Status:
Backbone Oct 00 Routers Nov 00
All sites Mar 01
Proved to be Stable
Constant growth of traffic
Upgrade Backbone to 10Gbit Jun 02
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
3
SuperJANET4: ping rtt Core routers
Jun 01
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
4
SuperJANET4: ping rtt Site nodes
Lancaster
Glasgow
MAN / LAN Issues
Bristol
Cambridge
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
5
London MAN Upgrade
UDP Packet loss
%
UDPmon Tests
Manchester – London
MAN was 155 Mbit ATM
UDP Throughput
Mbit/s
1st Oct
Time interval in Weeks
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
Richard HJ
6
Previous External Connectivity
Europe:
TEN-155
155Mbit Access link
US:
6 * 155 Mbit links
Peer in Hudson St.
622 Mbit to Esnet
622 Mbit to Abilene.
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
7
Europe – Access links (1)
ICFAMON Plot from RAL to CERN for 19th Oct to 1st Nov 2001
UK Access link 155 Mbit ATM
Sustained rate 130 Mbit
Contract to end of Nov 01
Bad news for users !
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
8
Europe – Access links (2)
Traceping Oxford to CERN for 31st October 2001
loss around
ten155-gw.ja.net router
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
J. Macallister
9
New External Connectivity
6 * 155 Mbit links
2.5Gbit line installed
IP commodity peer in London
Research traffic over 2.5G bit
Peer in Hudson St.
622 Mbit to Esnet
622 Mbit to Abilene.
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
10
Connectivity to Europe : Geant
Start mid November 2001
UKERNA switched off TEN-155 3 Dec 2001
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
11
Connectivity to Europe
ICFAMON Plot from DL to CERN for 18th Feb to 3rd Mar 2002
UK Dante Access link 2.5 Gbit POS
Remember 19th Oct to 1st Nov 2001
Access link over loaded
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
12
Monitoring: US Traffic
UKERNA Traffic data Kbit/s. Blue Traffic from US; Maroon Traffic to US
7 day periods 1 hour averages
14 Jan 2002 (800Mbit/s)
peak 86% of total 930 Mbit
17 Jan 2002
Peering altered 22 Jan
Weekend-Before
Weekday-After
Weekday-Before
22 Jan 2002
Weed day peak 175 Mbit/s
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
13
Monitoring: US Traffic
UKERNA Traffic data Kbit/s. Blue Traffic from US; Maroon Traffic to US
7 Dec 2001 (900kbit/s)
29 Jan 2002 (175kbit/s)
peak is 88% of total BW 930 Mbit
10 minute averages
10 minute averages
Last 7 days 1 hour averages
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
14
Astronomy & Astrophysics Sites
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
15
Connectivity to Australia
ICFAMON Plot from DL to Anglo-Australian Observatory
for 11th Apr to 24th Apr 2002
Packet loss reasonable
rtt improves ~420 ms to ~300 ms
Variations ~100ms
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
16
Connectivity to US
ICFAMON Plots for 11th Apr to 24th Apr 2002
DL to NOAO, Arizona
DL to Goddard GSFC NASA
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
17
Connectivity to Hawaii
ICFAMON Plot from DL to The Joint Astronomy Centre
for 11th Apr to 24th Apr 2002
Packet loss good
rtt ~210 ms
Variations – queuing
traceroute:
Cross SuperJANET4 to NY OK
Cross Abilene to Seattle OK
Enters uhnet
Stops after 2-3 routers
No connectivity to La Palma
traceroute ends in iac.es network
Tenerife ?
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
18
Grid Network Monitoring
Several tools in test – plugged into a coherent structure:
PingER, RIPE one way times, iperf, UDPmon, rTPL, GridFTP, and
NWS prediction engine
continuous tests for last few months to selected sites:
DL Man RL UCL CERN Lyon Bologna SARA NBI SLAC …
The aims of monitoring for the Grid:
to inform Grid applications, via the middleware, of the current status of the
network – input for resource broker and scheduling
to identify fault conditions in the operation of the Grid
to understand the instantaneous, day-to-day, and month-by-month
behaviour of the network – provide advice on configuration etc.
Network information published in LDAP schema
Will be used by UK GridPP and e-science centres
AstroGrid ?
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
19
Network Monitoring Architecture
LDAP
Schema
Grid Apps
GridFTP
PingER
(RIPE TTB)
IperfER
UDPmon
rTPL
NWS
etc
Local Network
Monitoring
Store & Analysis
of Data (Access)
Backend LDAP script to fetch metrics
Monitor process to push metrics
local
LDAP
Server
Grid Application access via
LDAP Schema to
- monitoring metrics;
- location of monitoring data.
Access to current and historic data
and metrics via the Web, i.e. WP7
NM Pages, access to metric forecasts
Robin Tasker
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
20
Network Monitoring Components
Clients WEB Display
Predictions
LDAP
Web I/f
LDAP
Table
plot
Grid Broker
LDAP
LDAP
Table
raw
raw
plot
Analysis
LDAP
Table
raw
plot
raw
raw
Scheduler
Cron
script
control
Cron
script
Cron
script
control
Tool
Ping
Netmon
UDPmon
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
iPerf
Ripe
21
Ping & UDP throughput MAN-RAL
From 20 Oct 01
PingER rtt (ms)
dl – RAL
1000 byte packet
Forecast
UDPmon Zero packet loss!
UDPmon throughput Mbit/s
man – RAL
300 * 1400 byte frames
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
22
Ping & UDP throughput MAN-CERN
From 20 Oct 01
PingER rtt (ms)
dl – cern
1000 byte packet
Forecast
UDPmon throughput Mbit/s
man – cern
300 * 1400 byte frames
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
23
iperf TCP & UDP throughput MAN-SARA
From 20 Oct 01
Iperf TCP throughput Mbit/s
ucl – sara
262144 byte buffer
Forecast
UDPmon throughput Mbit/s
man – sara
300 * 1400 byte frames
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
24
iperf & Pinger UK-Bologna
From 20 Oct 01
Iperf throughput
ucl – Bologna
262144 byte buffer
Forecast in green
PingER rtt (ms)
dl – Bologna
1000 byte packet
Forecast
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
25
UDPmon Loss
iperf throughput UCL-SARA
From 1 Nov 01 – Geant Operational
Throughput
Mbit/s
MAN – SARA
Iperf Throughput
Mbit/s
UCL – SARA
262144 byte
buffer
Geant Enabled
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
Routing Stable
26
PingER deployment
Les Cottrell
Measurements from
34 monitors in 14 countries
Over 600 remote hosts
Over 72 countries
Over 3300 monitor-remote site pairs
Measurements go back to Jan-95
Reports on RTT, loss, reachability, jitter, reorders, duplicates …
Countries monitored
Contain 78% of world population
99% of online users of Internet
Lightweight (100bps/host pair)
Very useful for inter-regional and poor links, need more intensive for high
performance & Grid sites
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
27
Losses: World by region, Jan ‘02
Packet loss <1%=good, <2.5%=acceptable, < 5%=poor, >5%=bad
Russia,
S America bad
Balkans,
M East,
Africa,
S Asia,
Caucasus poor
Monitored
Region \
Monitor
BR CA DK DE HU IT JP
RU CH
(1) (2) (1) (1) (1) (3) (2)
(2) (1)
Country
COM
0.2
0.3
Canada
1.8 1.6 0.3 0.5 9.0 0.3 1.4 21.7 0.7
US
0.4 2.6 0.2 0.3 8.0 0.1 1.4 13.8 0.3
C America
Australasia
E Asia
1.2 3.5 1.0 1.1 9.0 0.9 2.0 5.2 1.5
Europe
0.4 5.6 0.3 0.5 5.4 0.4 1.3 15.5 1.1
NET
1.7 6.2 1.0 1.3 8.0 1.6 3.6 21.9 0.7
FSU4.5
0.5 9.8 0.5 1.6 11.2 4.3
Balkans
Mid East
4.6 1.4 3.0 8.5 2.8 3.2 11.8 2.0
Africa
5.8
1.5 12.0 1.2 4.2 11.9 2.0
Baltics
5.3 0.8 2.3 7.7 2.2 3.5 10.8 4.8
S Asia
1.6 7.3 0.1 3.1 9.2 3.0 3.9 17.9 1.5
Caucasus
S America 24.1 11.3 0.6 0.9 6.7 12.9 7.7 23.0 9.3
Russia
35.9 24.1 22.2 13.4 23.8 21.7 13.6 0.7 8.7
Astronomy Sysman Meeting 29/30 April 02
Avg
7.5 6.9 2.8 2.4 9.8 3.7 3.9 13.8 3.1
R. Hughes-Jones Manchester
Pairs
64 144 54 67 70 203 190 114 209
UK US
(3) (16) Avg
0.3 0.2
0.7
0.5 3.5
1.3
0.9 2.7
0.9 0.9
0.8
1.8 1.3
1.4
1.5 2.6
1.0
1.0 2.9
0.8
0.9 4.3
1.2
2.0 4.0
3.8 3.8
2.5
2.1 4.2
1.9
2.5 4.8
2.1
3.9 4.3
3.1
3.0 4.9
3.2 3.2
1.1
6.6 9.5
24.1 12.7 18.3
3.2
2.8 4.4
192 1990
A
v
gAvg
-NA +
(WEU
H+ JP Pairs
Region
COM
0.27
23
Canada
0.74 126
US
0.88 2149
C America
0.89
19
Australasia 1.30
18
E Asia
1.61 215
Europe
1.38 852
NET
2.00
85
FSU2.09
48
Balkans
3.83 109
Mid East
2.70
57
Africa
2.72
45
Baltics
3.12
67
S Asia
3.12
97
Caucasus
3.22
19
S America
6.30 203
Russia
17.57
91
28
Avg
3.16
Pairs
Quality
improvement seen
from SLAC &
NASA
NASA results courtesy of
Andy Germain, NASA,
Astronomy Sysman Meeting 29/30 April 02
GSFC
R. Hughes-Jones Manchester
29
100
Iperf mem-mem vs file copy disk to disk
Fast Ethernet
OC3 Over 60Mbits/s iperf >> file copy
Disk
limited
0
Iperf TCP Mbits/s
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
400
30
QoS: Terms and Concepts
Configurable Queues
Discard
Test
Dequeue
Identify & Classify
Police
Sort
Fail
Identifying frames – marking / setting IP precedence bits
Sorting frames into queues
Selecting which frame to send
Action taken when a queue is full
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
31
QoS: What Next?
Dante propose the following services:
IP Premium (EF)
(AF) difficult to define in a way that suits most NRNs
Best Efforts
Scavenger “Less than best efforts”
UKERNA ran a Think Tank to Study QoS requirements in the UK
MB-NG Network development project to test MPLS and QoS
SuperJANET is expected to offer similar services to Dante
Applications need end to end QoS – so we need to cross:
LAN
SuperJANET4
Dante
Remote NRN
Remote LAN
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
32
More Information Some URLs
PPNCG Home page with Stop Press:
http://ppncg.rl.ac.uk/
PPNCG Page for monitoring Astronomy & Astrophysics Sites
http://icfamon.dl.ac.uk/ppncg/astronomy.html
and e-mail:
[email protected]
DataGrid WP7 Networking:
http://www.gridpp.ac.uk/wp7/index.html
IEPM PingER home site:
http://www-iepm.slac.stanford.edu/
IEPM-BW site:
http://www-iepm.slac.stanford.edu/bw
Astronomy Sysman Meeting 29/30 April 02
R. Hughes-Jones Manchester
33