slides - Columbia Network Research Center

Download Report

Transcript slides - Columbia Network Research Center

Deriving Traffic Demands for Operational
IP Networks: Methodology and Experience
Anja Feldmann*, Albert Greenberg, Carsten Lund,
Nick Reingold, Jennifer Rexford, and Fred True
Internet and Networking Systems Research Lab
AT&T Labs-Research; Florham Park, NJ
*University of Saarbruecken
1
Traffic Engineering For Operational IP Networks

Improve user performance and network efficiency by tuning router
configuration to the prevailing traffic demands.
– Why?
– Time Scale?
AS 7018
(AT&T)*
customers or
peers
backbone
*synthetic loads
customers or
peers
2
Traffic Engineering Stack
 Topology
of the ISP backbone
– Connectivity and capacity of routers and links
 Traffic
demands
– Expected/offered load between points in the network
 Routing
configuration
– Tunable rules for selecting a path for each flow
 Performance
objective
– Balanced load, low latency, service level agreements …
 Optimization
procedure
– Given the topology and the traffic demands in an IP network,
tune routes to optimize a particular performance objective
3
Traffic Demands
 How
to model the traffic demands?
– Know where the traffic is coming from and going to
– Support what-if questions about topology and routing changes
– Handle the large fraction of traffic crossing multiple domains
 How
to populate the demand model?
– Typical measurements show only the impact of traffic demands
» Active probing of delay, loss, and throughput between hosts
» Passive monitoring of link utilization and packet loss
– Need network-wide direct measurements of traffic demands
 How
to characterize the traffic dynamics?
– User behavior, time-of-day effects, and new applications
– Topology and routing changes within or outside your network
4
Outline
 Sound
traffic model for traffic engineering of
operational IP networks
 Methodology for populating the model
 Results
 Conclusions
5
Outline
 Sound
traffic model for traffic engineering of
operational IP networks
– Point to Multipoint Model
 Methodology
for populating the model
 Results
 Conclusions
6
Traffic Demands
Big Internet
Web Site
User Site
7
Traffic Demands
Interdomain Traffic
AS 2
AS 3, U
AS 3
Web Site
User Site
AS 3, U
U
AS 1
AS 4, AS 3, U
AS 3, U
AS 4
•What path will be taken between AS’s to get to the User site?
•Next: What path will be taken within an AS to get to the User site?
8
Traffic Demands
Zoom in on one AS
OUT1
25
110
110
User Site
Web Site
300
200
75
300
110
10
OUT2
50 110
IN
OUT3
Change in internal routing configuration changes flow exit point!
9
Point-to-Multipoint Demand Model
 Definition: V(in,
{out}, t)
– Entry link (in)
– Set of possible exit links ({out})
– Time period (t)
– Volume of traffic (V(in,{out},t))
 Avoids
the “coupling” problem with traditional point-topoint (input-link to output-link) models:
Pt to Pt Demand Model
Pt to Pt Demand Model
Traffic Engineering
Traffic Engineering
Improved Routing
Improved Routing
10
Outline
 Sound
traffic model for traffic engineering of
operational IP networks
 Methodology for populating the model
– Ideal
– Adapted to focus on interdomain traffic and to meet
practical constraints in an operational, commercial IP
network
 Results
 Conclusions
11
Ideal Measurement Methodology
 Measure
traffic where it enters the network
– Input link, destination address, # bytes, and time
– Flow-level measurement (Cisco NetFlow)
 Determine
where traffic can leave the network
– Set of egress links associated with each network address
(forwarding tables)
 Compute
traffic demands
– Associate each measurement with a set of egress links
12
Adapted Measurement Methodology
Interdomain Focus
 A large
fraction of the traffic is interdomain
 Interdomain traffic is easiest to capture
– Large number of diverse access links to customers
– Small number of high speed links to peers
 Practical
solution
– Flow level measurements at peering links (both
directions!)
– Reachability information from all routers
13
Inbound and Outbound Flows on Peering Links
Outbound
Peers
Customers
Inbound
Note: Ideal methodology applies for inbound flows.
14
Most Challenging Part:
Inferring Ingress Links for Outbound Flows
Example
Outbound traffic flow
measured at peering link
output
destination
? input
Customers
? input
Use Routing simulation to trace back to the ingress links!
15
Computing the Demands
Forwarding
Tables
 Data
Configuration
Files
NetFlow
SNMP
NETWORK
– Large, diverse, lossy
– Collected at slightly different, overlapping time intervals, across
the network.
– Subject to network and operational dynamics. Anomalies
explained and fixed via understanding of these dynamics
 Algorithms,
details and anecdotes in paper!
16
Outline
 Sound
traffic model for traffic engineering of
operational IP networks
 Methodology for populating the model
 Results
– Effectiveness of measurement methodology
– Traffic characteristics
 Conclusions
17
Experience with Populating the Model

Largely successful
– 98% of all traffic (bytes) associated with a set of egress links
– 95-99% of traffic consistent with an OSPF simulator

Disambiguating outbound traffic
– 67% of traffic associated with a single ingress link
– 33% of traffic split across multiple ingress (typically, same city!)

Inbound and transit traffic (uses input measurement)
– Results are good

Outbound traffic (uses input disambiguation)
– Results may be good enough for traffic engineering, but there are limitations
– To improve results, may want to measure at selected or sampled customer links;
e.g., links to email, hosting or data centers.
18
Proportion of Traffic in Top Demands (Log Scale)
Zipf-like distribution. Relatively small number of heavy demands dominate.
19
Time-of-Day Effects (San Francisco)
midnight EST
midnight EST
Heavy demands at same site may show different time of day behavior
20
Discussion
 Distribution
of traffic volume across demands
– Small number of heavy demands (Zipf’s Law!)
– Optimize routing based on the heavy demands
– Measure a small fraction of the traffic (sample)
– Watch out for changes in load and egress links
 Time-of-day
fluctuations in traffic volumes
– U.S. business, U.S. residential, & International traffic
– Depends on the time-of-day for human end-point(s)
– Reoptimize the routes a few times a day (three?)
 Stability?
– Yes and No
21
Outline
 Sound
traffic model for traffic engineering of
operational IP networks
 Methodology for populating the model
 Results
 Conclusions
– Related work
– Future work
22
Related Work
 Bigger
picture
– Topology/configuration (technical report)
» “IP network configuration for traffic engineering”
– Routing model (IEEE Network, March/April 2000)
» “Traffic engineering for IP networks”
– Route optimization (INFOCOM’00)
» “Internet traffic engineering by optimizing OSPF weights”
 Populating
point-to-point demand models
– Direct observation of MPLS MIBs (GlobalCenter)
– Inference from per-link statistics (Berkeley/Bell-Labs)
– Direct observation via trajectory sampling (next talk!)
23
BRAVO
(Backbone Routing, Analysis, Visualization, and Optimization)

Data model
– Physical level, IP level, router-complex level
– Traffic demands, router attributes, link attributes

Routing model
– Shortest-path routing, OSPF tie-breaking
– Multi-homed customers, inter-domain routing
– Book-keeping to accumulate load on each link

Visualization environment
– Coloring/sizing to illustrate link and node statistics
– Querying to subselect links and nodes
– Histograms of statistics
– What-if experiments with new routing configurations
24
Traffic Flow Through Backbone
Source node: public peering link in New York (Sprint NAP)
Destination nodes: WorldNet access routers
Color/size of node: proportional to traffic to this router (high to low)
Color/size of link: proportional to traffic carried (high to low)
25
Future Work
 Analysis
of stability of the measured demands
 Online collection of topology, reachability, & traffic
data
 Modeling the selection of the ingress link (e.g., use
of multi-exit descriptors in BGP)
 Tuning BGP policies to the prevailing traffic
demands
 Interactions of Traffic Engineering with other
resource allocation schemes (TCP, overlay networks
for content delivery)
26
Backup
27
AS 7018
28
Identifying Where the Traffic Can Leave
 Traffic
flows
– Each flow has a dest IP address (e.g., 12.34.156.5)
– Each address belongs to a prefix (e.g., 12.34.156.0/24)
 Forwarding
tables
– Each router has a table to forward a packet to “next hop”
– Forwarding table maps a prefix to a “next hop” link
 Process
– Dump the forwarding table from each edge router
– Identify entries where the “next hop” is an egress link
– Identify set all egress links associated with a prefix
29
Measuring Only at Peering Links
 Why
measure only at peering links?
– Measurement support directly in the interface cards
– Small number of routers (lower management overhead)
– Less frequent changes/additions to the network
– Smaller amount of measurement data
 Why
is this enough?
– Large majority of traffic is interdomain
– Measurement enabled in both directions (in and out)
– Inference of ingress links for traffic from customers
30
Full Classification of Traffic Types at Peering Links
Outbound
Peers
Customers
Internal
Transit
Inbound
31
Flows Leaving at Peer Links
 Single-hop
transit
– Flow enters and leaves the network at the same router
– Keep the single flow record measured at ingress point
 Multi-hop
transit
– Flow measured twice as it enters and leaves the network
– Avoid double counting by omitting second flow record
– Discard flow record if source does not match a customer
 Outbound
– Flow measured only as it leaves the network
– Keep flow record if source address matches a customer
– Identify ingress link(s) that could have sent the traffic
32
Results: Populating the Model
Ingress
Inbound
Egress
Effectiveness
Netflow Reachability
Good
Netflow Netflow &
Reachability
Outbound Packet Netflow &
filters Reachability
Internal
X
Reachability
Good
Transit
Data Used
Pretty
Good
X
33