Transcript Π f(Xij)
A Survey on Physical Network
Topology Estimation
October 21, 2005
Chikayama-Taura Lab.
Tatsuya Shirai
2005/10/21
1
Background
Progress of parallel processing
technologies
Costs of parallel processing
Cost of computation
Cost of communication
Clusters, Grid Environments
Cost of communication becomes bigger
with larger scale
2005/10/21
2
Allocation Policy
Needs to closely allocate hosts frequently
communicating with each other
With multiple clusters, allocate within clusters
In single cluster, allocate to use the same
switches
2005/10/21
3
Difficulty of estimate the cost
of communication
Shared link
Each hosts can solely communicate at
100Mbps
But all hosts can communicate at less than
50Mbps at a time
All hosts need to work together to know this
relation
1
2005/10/21
2
3
100Mbps
4
4
Desired Functions
Ideally,
Present network information to users
Configure allocation automatically
Needs to analyze network topology
2005/10/21
5
Other applications
Network trouble shooting
Discovery of bottlenecks
Research on routing protocol
Simplification of local network
etc…
2005/10/21
6
Agenda
Background
Network Topology
End-to-End Measurement
Researches
Conclusion
2005/10/21
7
Agenda
Background
Network Topology
End-to-End Measurement
Researches
Conclusion
2005/10/21
8
Network Topology
A structure of a network
node
host
router
switch, hub
link
2005/10/21
9
IP Layer Topology
Structure of network
node
host
router
switch, hub
Link
Difficulty in collecting
information of LAN
structure
2005/10/21
10
Protocol-Based Algorithms
Protocol
[Richard et al, ‘04]
, etc.
Hardware-dependent
SNMP [Yuri et al, ’01] ,
Customized Protocol
Some hubs or switches doesn’t support
required protocols.
Deterministic estimation
2005/10/21
11
End-to-End Measurement
Metric
Hardware independent
Packet loss rates [Bestavros et al, ‘02]
Delays [Coates et al, ‘01]
Always possible to measure topologies of
hosts who can communicate with root
Probabilistic
2005/10/21
12
Classification
IP layer
Protocol
based
End-to-End
Measurement
Nodes
Hosts,
routers
Hosts,
routers,
switches
Hosts,
routers,
switches,
hubs, …
Hardware
dependency
dependent
dependent
independent
Estimation
Deterministic Deterministic Probabilistic
2005/10/21
13
Agenda
Background
Network Topology
End-to-End Measurement
Researches
Conclusion
2005/10/21
14
End-to-End Measurement
Assume topologies are Tree-structured
Only one route exists between two hosts.
Does not be changed while measuring
Estimate branches of routes connecting
hosts
2005/10/21
15
Estimated topology using
End-to-End Measurement
unused
non-branching
branching
actual topology
2005/10/21
estimated topology
16
End-to-End Measurement
Assume topologies are Tree-structured
Only one route exists between two hosts.
Does not be changed while measuring
Estimate branches of routes connecting
hosts
Variance in the measurements
2005/10/21
17
Variance of measurements
With a small variance, estimation is
deterministic
With a large variance, estimation is
probabilistic
Use statistics
Search the topology that fits the most with
measurement
2005/10/21
18
End-to-End Measurement
Assume topologies are Tree-structured
Only one route exists between two hosts.
Does not be changed while measuring
Estimate branches of routes connecting hosts
Variance in the measurements
Procedures consist of 2 steps
1. Measurement
2. Estimation
2005/10/21
19
Agenda
Background
Network Topology
End-to-End Measurement
Researches
Conclusion
2005/10/21
20
Researches
Maximum Likelihood Network Topology
Identification from edge-based unicast
measurements [Coates et al. ’01 SIGMETRICS]
Metric : Delay
Estimation: Maximum Likelihood Estimation
2005/10/21
21
Measurement –Sandwich Probe –
1
Measure delay of a link
shared 2 hosts (e.g. 2 and 4) d
1. Send a small packet to 4
2. After constant time, send a
large packet to 2
3. Without break, send a small
packet to 4 again
2
3
4
d+⊿d
2005/10/21
22
Measurement
The arrival of the second
packet is delayed because
the large packet is slower
Assume that all branched
nodes are not store &
forward
Can measure delay (or
bandwidth) of shared link
X42 = μ1+d
X32 = μ1+μ2+d
2005/10/21
1
d
μ1
μ2
2
3
4
X32
X42
23
Estimation
Assume delay of each shared link obeys
Gaussian f(x)
Search the topology best fitting the
measurements
⇒ Maximum Likelihood Estimation
(MLE)
2005/10/21
24
Likelihood
The value of “fitting”
1
μ1
Set particular topology and
delay as a parameter
Likelihood = Π f(Xij)
3
2
X32= μ1+μ2
2005/10/21
μ1
4
X42= μ1
μ1
25
Search Space of MLE
Give many possible topologies to search
for MLE
Too wide to compute all topologies
Premise
Similar topologies have similar likelihoods
⇒ Markov Chain Monte Carlo (MCMC)
(e.g. Hill Climbing)
2005/10/21
26
Similar Topologies –Step–
Birth step
Insert a node
1
2
3
2005/10/21
Death step
1
4
2
3
Delete a node
1
4
2
3
1
4
2
3
4
27
Procedure of MLE
1. Give a topology at random
2. Make a small modification
3. If the new topology has greater likelihood,
adopt new topology
4. If a likelihood is at local maximum,
return to procedure 1
5. Otherwise goto2
Can get a great likelihood topology in feasible
time
2005/10/21
28
Experiment
Experimental Setup
Measurement
The root host and ten other hosts
2
Sent 8600 probes (O(n ))
For 8 minutes
MLE
For 30-120 seconds
2005/10/21
29
The estimated
topology using
traceroute
The estimated
topology using
Coates’ method
2005/10/21
30
Agenda
Background
Network Topology
End-to-End Measurement
Researches
Conclusion
2005/10/21
31
Conclusion
Conclusion
I Indicated importance of topology
estimation and introduced one methods
with End-to-End measurement
Future Works
Topology Estimation within LAN of many
nodes
2005/10/21
32