Matrix Analytic Methods
Download
Report
Transcript Matrix Analytic Methods
Matrix Analytic Methods Some Real Life Applications
David Lucantoni
DLT Consulting, L.L.C
www.DLTconsulting.com
Queueing Theory and Network Applications, Korea University, 2011
Objectives of this Talk
Review Matrix Analytic Methods pioneered by Marcel Neuts
and others
Present real life examples where Matrix Analytic Methods
have been used successfully in industry
Congestion due to Internet Dial Up
Application to Common Channel Signaling (SS7)
Modeling WiFi Hotspot Traffic
Multiplexing bursty traffic in an ATM or IP network
Call Acceptance for a VoIP System
Transient Results and applications
Queueing Theory and Network Applications, Korea University, 2011
Objectives of this Talk (Cont’d)
No new mathematical analyses
A detailed description of any of these models would take the
whole talk
Will only give general description of model parameters
Detailed definitions and analyses are widely available in the literature
Will give some matrix results along with the corresponding scalar
(exponential or Poisson) results
Queueing Theory and Network Applications, Korea University, 2011
Analysis of Matrix Analytic Models
Models are analyzed by purely probabilistic arguments (e.g.,
Markov Renewal Theory)
Explicit expressions lead to very stable numerical algorithms for
computation of performance metrics
Matrix analytic methods avoid
the complex analysis used in previous approaches
the necessity of numerically computing the roots of an analytic
function
Numerical instability when roots are close to each other or multiple
Queueing Theory and Network Applications, Korea University, 2011
Phase-Type (PH) Distributions
Will only consider continuous-time distributions here
Phase-type distributions contain many well-known and
popular distributions
Exponential distribution, (m=1)
Erlang distribution
Hyper-exponential distribution
Queueing Theory and Network Applications, Korea University, 2011
Phase-Type Distributions (Cont’d)
Defined as the time till absorption in an (m+1)-state Markov
process with 1 absorbing state
Let T be the m×m matrix of infinitesimal rates between
transient states, α, the vector of probabilities of starting in a
particular transient state and T0 be the rates into the
absorbing state
Let X be a phase-type random variable and F(x)=P(X≤x)
Then
F(x) = 1- e- l x (exponential) F(x) = 1- a eTxT 0 (Phase - type)
Queueing Theory and Network Applications, Korea University, 2011
Assuring Emergency Services Access (e.g.,
911) in the presence of long holding times
due to Internet dial-up calls
The Problem
No dial-tone condition - “Customers hear thin air.”
VERY SERIOUS – e.g., 90/day life threatening emergency calls .
Priorities non-implementable
Preliminary suspicion on maintenance activities.
Could it be congestion ? Red flag – internet usage !
Model solved using a finite quasi-birth-and-death process (QBD)
[Ramaswami, et al., 2005]
Queueing Theory and Network Applications, Korea University, 2011
Cable Telephony
TDM/T-1
ADM
HDT
GR-303
Backbone
Transport OC-48+
SS7
DACS
GR-303
ADM
DWDM
Proprietary
Protocol
CLASS 5
SWITCH
CMTS
DWDM
Master Headend or
Backbone
PSTN
Primary Hub
(100K - 200K HHP) Each
Network
Interface
Unit
EMS
Systems
DOCSIS
Protocol
DHCP/DNS
TFTP
Servers
Cable
Modem
Fiber Node
Transport
Central
Office
Zone 4
Zone 5
16 Per Secondary
Hub
Zone 1
Zone 2
Zone 3
Queueing Theory and Network Applications, Korea University, 2011
Effect of Long Holding Times (Cont’d)
Preliminary Analysis
problem in evening hours 7 PM – 11 PM
no discernible spatial pattern
Push-backs
Engineering followed “standard” procedures
Only 5-8% of calls are internet dial-ups
Field measurements “do not support” congestion hypothesis
Queueing Theory and Network Applications, Korea University, 2011
Phase Type Model
(using the EM-algorithm) ; fit uses ~4M data points
Phase Type Fit
Probability Observed
0.1
5.4
0.2
12
0.3
21
0.4
32.4
0.5
48
0.6
72.6
0.7
120.6
0.8
232.8
0.9
601.8
0.95
1237.8
0.99
3952.2
0.995
6074.4
Fitted
5.8
12.7
21.2
32.2
47.7
72.2
120.2
235.9
597.6
1268.0
4035.2
7168.4
Queueing Theory and Network Applications, Korea University, 2011
The Length Biasing Effect
Remaining holding time distribution: [1-F(x)] / μ
RESIDUAL DISTRIBUTION
h(x)=[1-F(x)]/μ
Probability Observed Residual
0.1
5.4
40.8
0.2
12
116.1
0.3
21
238.0
0.4
32.4
423.7
0.5
48
703.0
0.6
72.6
1124.6
0.7
120.6
1809.7
0.8
232.8
3285.3
0.9
601.8
7028.2
0.95
1237.8 11073.8
0.99
3952.2 20489.2
Median
Mean
48
297
703.0
2367.0
Queueing Theory and Network Applications, Korea University, 2011
Empirical Validation
Empirical data on residuals (~11 K observations) vs. Model
Queueing Theory and Network Applications, Korea University, 2011
Impact of Long Holding Times
Insensitivity relates only to blocking over an infinite horizon.
Heavy tails
PERSISTENCE of congestion.
So, short term performance is quite different.
“Standard” procedures do not work !
Bad periods compensated by long very good periods. Good
situation for control !!! (See the paper and patents for
possible controls.)
Queueing Theory and Network Applications, Korea University, 2011
Several Other Real Life
Applications
Excessive Link Oscillations in the Common Channel
Signaling (SS7) Network [Ramaswami and Wang,
1993]
Solved using Phase-type distributions
Modeling and characterization of large-scale Wi-Fi
traffic in public hot-spots [Ghosh, et al., 2011]
Solved using an M/G/∞ where G is ln PH distribution
Queueing Theory and Network Applications, Korea University, 2011
The Batch Markovian Arrival Process
(BMAP)
Natural generalization of the Poisson process
Contains many well-known arrival processes (in all cases,
correlated batch sizes are easily handled)
Poisson process (m=1)
PH renewal process
Markov modulated Poisson process (MMPP)
Superposition of BMAPs is again a BMAP
Useful for modeling arrivals to an internal node in a network
Queueing Theory and Network Applications, Korea University, 2011
The Batch Markovian Arrival Process (Con’t)
Consider an m-state Markov process and assume that at each
transition of this process, there is a probability of a batch arrival
that depends both on the state before and after the transition
Let Dn, n≥1, denote the infinitesimal rate of a batch arrival of size
n, keeping track of the underlying states before and after the
transition
The generating function D(z) is given by
D(z) = å n=0 Dn z n
¥
For Poisson : D(z) = - l + l z
Queueing Theory and Network Applications, Korea University, 2011
BMAP/G/1 Queue
The BMAP/G/1 queue is extremely useful for modeling the performance of
broadband packet networks
Traffic is bursty
Service times are NOT phase-type
Asynchronous Transfer Mode (ATM)
Fixed length cells result in deterministic service times
IP networks
Finite packet sizes finite mixture of deterministic service times
BMAP models very bursty traffic and is closed under superposition, i.e.,
modeling the output of nodes entering another node
Queueing Theory and Network Applications, Korea University, 2011
Busy Period of the BMAP/G/1 Queue
Let h(s) be the Laplace-Stieltjes transform of the
service time distribution
Let G(z) be the probability generating function of the
number of customers served during a busy period
(keeping track of the arrival phase). Then we have
G(z) = zh(l - lG(z))
(M / G /1)
G(z) = zh(-D[G(z)]) (BMAP / G /1)
Queueing Theory and Network Applications, Korea University, 2011
Busy Period of the BMAP/G/1 Queue
(Cont’d)
Define G to be G(1) and g to be the stationary
probability vector of G
It is easy to show that g is also the stationary
probability vector of the phase of the arrival process
during idle periods
For M/G/1, the probability that the system is empty is 1ρ; for BMAP/G/1, the probability that the system is
empty and that the arrival process is in phase j is the jth
element of (1-ρ)g
Queueing Theory and Network Applications, Korea University, 2011
Virtual Waiting Time of the
BMAP/G/1 Queue
Let W(s) be the Laplace-Stieltjes transform of the
virtual waiting time distribution (keeping track of the
arrival phase)
Let g be the stationary probability vector of the
stochastic matrix G
Then we have
s(1- r )
W (s) =
s - l + l h(s)
M / G /1
W (s) = s(1- r )g[sI + D(h(s))]-1
BMAP / G /1
Queueing Theory and Network Applications, Korea University, 2011
Stationary Queue Length at Departures
Let Xn be the stationary probability that there are n
customers in the queue at departures
Let X(z) =
n
,
X
z
å 0 n then
¥
(1- r )(z -1)A(z)
X(z) =
z - A(z)
M / G /1
X(z) = (1- r )l -1gD(z)A(z)[zI - A(z)]-1
BMAP / G /1
Queueing Theory and Network Applications, Korea University, 2011
Computing these Distributions
We compute these distributions by numerically
inverting the transforms [Choudhury, Lucantoni and
Whitt, 1994]
These algorithms are for inverting multi-dimensional
transforms which we use later for computing transient
distributions
Queueing Theory and Network Applications, Korea University, 2011
Known Results for the BMAP/G/1
Queue
Stationary distributions [Lucantoni, 1993]
Queue length distribution at arrivals, departures, arbitrary time
Waiting time distributions
Transient distributions (given the appropriate initial conditions)
[Lucantoni, Choudhury, and Whitt,1994], [Lucantoni, 1998]
P(system empty at time t)
Workload at time t
Queue length at time t
Delay of nth arrival
Queue length at nth departure
P(nth departure occurs less than or equal to time x)
Queueing Theory and Network Applications, Korea University, 2011
Modeling the Input to an Interior
Network Node
Consider the output from several nodes which all go to
the same node
Ideal candidate for the MAP/G/1 queue
Queueing Theory and Network Applications, Korea University, 2011
Multiplexing Bursty Traffic in an ATM
Network
ATM requirement is that the probability of
blocking must be less than 10-9
Tails of distributions are the most
important performance measure
Superposition of 64 on-off sources each
modeled as a 2-state MMPP
Exact Analysis
Far from Poisson
Exponential tail does not become relevant
until blocking is 10-40
Queueing Theory and Network Applications, Korea University, 2011
Advantage of MAP/G/1 Approach over MatrixGeometric Approach When service time has
Many Phases
We computed the tail of the waiting time distribution in an
MMPP64/E1024/1 queue by an by a distribution that is close to an
E1024 and matches the first three moments of the deterministic
distribution
The matrix-geometric approach would require using matrices of
order 65,536
The MAP/G/1 approach looks at the process embedded at
departures; since we don’t have to keep track of the phase of
service, the matrices are only of order 64
Queueing Theory and Network Applications, Korea University, 2011
Advantage of MAP/G/1 (Cont’d)
G satisfies
¥
G=ò e
0
D[G]x
dH(x)
When H is E1024, the integral on the right hand side is
computed by inverting a 64×64 matrix 10 times
Much easier than computing the matrix exponential
Queueing Theory and Network Applications, Korea University, 2011
IP Data Traffic is Bursty
Sample Measurements
4500
from a previous
assignment
Highly bursty
Mean = 720 pkts/sec
4000
3500
Packets
Sampled every second
Packet Counts
5000
3000
2500
2000
1500
1000
500
0
0
1
2
Hours
Queueing Theory and Network Applications, Korea University, 2011
3
4
Simulated Poisson Traffic
Packet Counts - Poisson
800
780
Poisson traffic looks
760
bursty
Packets
740
720
700
However, note the scale
of the y-axis
680
660
640
0
5
10
Minutes
Queueing Theory and Network Applications, Korea University, 2011
15
Poisson Process Compared to
Data
Packet Counts - Poisson
3000
3000
2500
2500
2000
1500
2000
1500
1000
1000
500
500
0
0
5
10
Minutes
Packet Counts - Measurements
3500
Packets
Packets
3500
15
0
0
5
10
Minutes
Queueing Theory and Network Applications, Korea University, 2011
15
Fit an D-MMPP to Data
[Heyman, Lucantoni,
2003]
Choose λ1 so that the
peak is at the 95th
percentile of a Poisson
distribution, i.e.,
l1 + 2 l1 = peak
Then
l1 =
(
)
1+ peak -1
2
Queueing Theory and Network Applications, Korea University, 2011
Fit an MMPP to Data (Cont’d)
Lower bound of data covered by λ1 is
l1 - 2 l1
Let this be the upper bound covered by λ2, i.e.,
l2 + 2 l2 = l1 - 2 l1
Then
l2 =
(
l1 - 2
)
2
Continue…
Queueing Theory and Network Applications, Korea University, 2011
Fit an MMPP to Data (Cont’d)
Compute the transition probabilities by
number _ of _ transitions _ from _ i _ to _ j
pij =
number _ of _ transitions _ out _ of _ i
For performance computations convert to a continuous
time MMPP (see paper)
Queueing Theory and Network Applications, Korea University, 2011
Fit an MMPP to Data
Q-Q plot: plots quantiles
Should be linear if both
come from same
distribution
Very good fit
Queueing Theory and Network Applications, Korea University, 2011
Simulated MMPP Traffic
Packet Counts - MMPP
3000
3000
2500
2500
2000
1500
2000
1500
1000
1000
500
500
0
0
5
10
Minutes
Packet Counts - Measurements
3500
Packets
Packets
3500
15
0
0
5
10
Minutes
Queueing Theory and Network Applications, Korea University, 2011
15
Another Example
Simulated MMPP Packet Counts
500
500
400
400
300
300
200
200
100
100
0
0
5
10
Minutes
Measurement Packet Counts
600
Packets
Packets
600
15
0
0
5
10
Minutes
Queueing Theory and Network Applications, Korea University, 2011
15
Call Acceptance for a VoIP System
A startup company was offering VoIP service to clients by buying a fixed
bandwidth virtual circuit (“fat pipe”) through the Internet
Wanted to know how big the fat pipe should be to handle a certain
number of voice calls and still meet the Quality of Service (QoS) e.g., tail
of the delay distribution
Measurements were taken for IP voice calls for different codecs, e.g.,
G.723, G.729, etc., and different measurement intervals
We developed a program where the user inputs the file containing the
measurements, the measurement interval and the target delay
Queueing Theory and Network Applications, Korea University, 2011
Call Acceptance for a VoIP System (Cont’d)
We fit an MMPP to the measurement data and by solving
the delay using an MMPP/D/1 queue
If the computed QoS is greater or less than the targeted
QoS, we adjust the service rate and re-compute; continue
adjusting the rate using a binary search until the computed
QoS is within a given tolerance
The program then outputs the size of the required fat pipe
and the required buffer size
Queueing Theory and Network Applications, Korea University, 2011
Notes on Transient Results
Generalized every result derived by Takács for the
transient M/G/1 queue [Takács, 1962]
All results are direct matrix analogues of the scalar
results
Using only probabilistic arguments, resulted in
expressions that can be computed numerically without
computing the roots of an analytic equation
Queueing Theory and Network Applications, Korea University, 2011
Sample Transient Result (emptiness function)
Consider a MAP/G/1 queue (similar results hold for the
BMAP/G/1 queue)
Let Px0(t) be an m×m matrix where the (i,j) entry is the
probability that the system is empty at time t with the arrival
process is in phase j given the at time 0 the work in the
system is x and the arrival process is in phase i
Let px0(s) be the Laplace-Stieltjes transform of Px0(t)
Queueing Theory and Network Applications, Korea University, 2011
Emptiness Function (Cont’d)
Then
-(s+ l - lG(s)) x
e
px 0 (s) =
s + l - lG(s)
M / G /1
px 0 (s) = e -(sI -D[G(s)])x (sI - D[G(s)])-1
MAP / G /1
Queueing Theory and Network Applications, Korea University, 2011
Transient Performance Measures
We compute the virtual waiting time (work in system) at
time t for a given queue length at time 0 by numerically
inverting the 2-dimensional transform, W(s1,s2)
Queueing Theory and Network Applications, Korea University, 2011
Sample Transient Results Virtual Delay Distribution
Superposition of
four identical twostate MMPP’s
Service times are
distributed as E16
Utilization = 0.7
queue length = 0 at
time 0
queue length = 32
at time 0
Queueing Theory and Network Applications, Korea University, 2011
Sample Transient Results Unstable System
Utilization (ρ) = 2.0
Queue length = 0 at
time 0
Queueing Theory and Network Applications, Korea University, 2011
Potential Use of Transient Results
Call acceptance algorithms
Current call acceptance algorithms are based on stationary
probabilities
Given the number of calls in the system, the time to reach a
stationary distribution can be MUCH longer than the interarrival and inter-departure times
Call acceptance tables could be computed a-priori based on
decision time intervals, the current state of the system, the
average and peak bandwidth and burstiness parameter of the
arriving call
Queueing Theory and Network Applications, Korea University, 2011
References
1. V. Ramaswami, D. Poole, S. Ahn, S. Byers, and A. Kaplan, "Assuring emergency
services access: .providing dial tone in the presence of long holding time internet dialup calls," Interfaces, vol. 35, pp. 411-22, 2005
2. V. Ramaswami and J. L. Wang, "Analysis of the Link Error Monitoring Protocols in the
Common Channel Signaling Network," IEEE/ACM Transactions on Networking, vol.
1, pp. 34-47, 1993
3. Ghosh, R. Jana, V. Ramaswami, J. Rowland, and N. K. Shankaranarayanan, "Modeling
and characterization of large-scale Wi-Fi traffic in public hot-spots," in IEEE
INFOCOM, Shanghai, China, 2011
4. D. M. Lucantoni, "The BMAP/G/1 queue: A tutorial," in Models and techniques for
Performance Evaluation of Computer and Communications Systems, L. D. a. R.
Nelson, Ed., ed: Springer Verlag, 1993, pp. 330-58
5. D. M. Lucantoni, G. L. Choudhury, and W. Whitt, "The transient BMAP/G/1 queue,"
Stoch. Mod., vol. 10, pp. 145-82, 1994
6. D. M. Lucantoni, "Further transient analysis of the BMAP/G/1 queue," Stoch. Mod.,
vol. 14, pp. 461-78, 1998
Queueing Theory and Network Applications, Korea University, 2011
References (Cont’d)
7.
Heymen and Lucantoni, “Modeling Multiple IP Traffic Streams with Rate Limits,”
IEEE/ACM Trans. On Networking, 11, No. 6, 2003
8.
Choudhury, Lucantoni and Whitt, “Multidimensional transform inversion with
applications to the transient M/G/1 queue," Ann. Appl. Prob ., 4, No. 3, 719-740,
1994
7.
L. Takács, Introduction to the Theory of Queues. New York: Oxford University
Press, 1962
Queueing Theory and Network Applications, Korea University, 2011
Where to Find Additional References
Some Lucantoni papers can be downloaded from
www.DLTconsulting.com
Follow the links to my Publications page (or just Google
my name)
More references are listed in those papers
Queueing Theory and Network Applications, Korea University, 2011