Engineering for QoS and the limits of service differentiation
Download
Report
Transcript Engineering for QoS and the limits of service differentiation
Tequila Workshop
Jan 2001
The statistical nature of
traffic and its impact on
the realisability of QoS
guarantees
Jim Roberts, France Telecom R&D
([email protected])
France Télécom R&D
Quality of service: a commodity?
Example SLS:
Scope: N/N
Flow identification: EF-valued DSCP, set of destination prefixes
Traffic conformance: token bucket (r,b)
Excess treatment: drop
Service schedule: Oct 3, 9:00 - 11:00
Performance parameters: 0% loss
The role of traffic engineering:
What is the relation between (r,b) and user traffic characteristics ?
How can the network guarantee 0% loss ?
How much does this service cost ?
Maybe these questions don’t have a satisfactory answer...
depending on the statistical nature of traffic and the realisability of QoS
guarantees
France Télécom R&D
Outline
What is “Quality of Service” ?
Characterising IP traffic
Performance for stream applications
Performance for elastic applications
QoS and pricing
France Télécom R&D
QoS and reservation
users express their demand in terms of aggregates
different classes (EF, AF1-4, ...)
different scopes : point to point,..., point to world, (world to point?)
e.g., 2 Mb/s “class 1” from A to B, 5 Mb/s “class 3” from A to C or D,...
network filters traffic at ingress
packets are “in” or “out” ... or “nearly in”
e.g., token bucket, sliding window,...
network “reserves” bandwidth
admission control / traffic engineering
using policy servers, signalling,...
resource provisioning
relies on “adequate provisioning”
e.g., service differentiation through different overbooking factors
France Télécom R&D
Doubts about aggregates
traffic characterization
can a user choose its filter parameters?
how can the network reserve enough resources?
what about the small user?
end-to-end performance
what absolute quality of service?
what relative quality of service?
pricing
pricing for value...
...or pricing for cost?
France Télécom R&D
QoS and end-to-end performance
transparency for streaming applications
audio and video: interactive or playback
QoS low packet loss and delay
scope for differentiation: real time/non-real time, hi-fi / lo-fi,...
response time for elastic applications
Web, e-mail, file transfer, MP3,...
QoS high throughput
scope for differentiation: interactive/background, large flows/small flows,...
QoS is a statistical phenomenon
probabilities, averages,...
...depending on available capacity
...and traffic demand
QoS is often binary
“good enough”...
...or “too bad” !
France Télécom R&D
Outline
What is Quality of Service?
Characterising IP traffic
Performance for stream applications
Performance for elastic applications
QoS and pricing
France Télécom R&D
Internet traffic is self-similar
a self-similar process
variability at all time scales
due to:
infinite variance of flow size
TCP induced burstiness
Ethernet traffic, Bellcore 1989
France Télécom R&D
Internet traffic is self-similar
a self-similar process
variability at all time scales
due to:
infinite variance of flow size
TCP induced burstiness
a practical consequence
difficult to characterise a traffic
aggregate
10 s
Ethernet traffic, Bellcore 1989
France Télécom R&D
Traffic on a US backbone link
(Thomson et al, 1997)
traffic intensity is predictable ...
... and stationary in the busy hour
France Télécom R&D
Traffic on a French backbone link
traffic intensity is predictable ...
... and stationary in the busy hour
tue
12h
France Télécom R&D
wed
thu
18h
fri
sat
00h
sun
mon
06h
IP flows
a flow = one instance of a given application
a "continuous flow" of packets
basically two kinds of flow, stream and elastic
stream flows
audio and video, real time and playback
rate and duration are intrinsic characteristics
highly variable rate and duration
Poisson arrival process (?)
elastic flows
digital documents ( Web pages, files, ...)
rate and duration are measures of performance
highly variable size
Poisson arrivals (?)
95% of packets are in elastic flows
France Télécom R&D
Modelling traffic demand
stream traffic demand
arrival rate x bit rate x duration
elastic traffic demand
arrival rate x size
a stationary process in the "busy hour"
e.g., Poisson flow arrivals, independent flow size
traffic
demand
Mbit/s
busy hour
France Télécom R&D
time of day
Outline
What is Quality of Service?
Characterising IP traffic
Performance for stream applications
Performance for elastic applications
QoS and pricing
France Télécom R&D
Open loop control for stream
traffic
buffered of bufferless multiplexing ?
jitter control ?
admission control or adaptive applications ?
reservation or implicit admission control ?
scope for service differentiation ?
user-network
interface
France Télécom R&D
network-network
interface
user-network
interface
Buffered multiplexing
performance
a buffer to absorb rate overload
admission control to ensure
Pr[buffer overflow]<e
but performance depends on complex
traffic characteristics
e.g., self-similarity
QoS of buffered multiplexing is
uncontrollable
0
buffer size
0
NB. token bucket is a virtual queue
difficult choice of r and b parameters?
no satisfactory descriptor for
more variable
variable rate flows or aggregates
less variable
log Pr[saturation]
France Télécom R&D
“Bufferless” multiplexing: alias
rate envelope multiplexing
admission control to ensure Pr [Lt>C] < e
performance depends only on stationary rate distribution
loss rate E [(Lt -C)+] / E [Lt]
performance is insensitive to self-similarity (and other correlation)
“negligible jitter” for flows shaped at the ingress (cf. INFOCOM 2001)
output
rate C
combined
input
rate Lt
time
France Télécom R&D
Efficiency of bufferless
multiplexing
low loss imposes small amplitude of rate variations ...
peak rate << link rate (eg, 1%)
... or low utilisation
overall mean rate << link rate
we may have both in an integrated network
priority to streaming traffic
residue shared by elastic flows
France Télécom R&D
Implicit admission control
accept new flow only if transparency preserved
given flow peak rate
and estimated available bandwidth
reject new flow if necessary
by discarding first packets (probes)
uncritical decision threshold if streaming traffic is light
in an integrated network
France Télécom R&D
Differentiation for stream traffic
different delays?
priority queues, WFQ, ...
but what guarantees?
different loss?
different utilisation (WFQ, ...)
"spatial queue priority"
delay
delay
loss
loss
partial buffer sharing, push out
or negligible loss and delay for all
elastic-stream integration ...
... and low stream utilisation
loss
delay
France Télécom R&D
Provisioning for negligible
blocking
"classical" teletraffic theory; assume
Poisson arrivals, rate l
constant rate per flow r
mean duration 1/m
mean demand, A = l/m r bits/s
blocking probability for capacity C
B = E(C/r,A/r)
E(m,a) is Erlang's formula:
m
E(m,a)= a /
m!
scale economies
generalizations exist:
for different rates
for variable rates
France Télécom R&D
ai
i m i !
utilization (r=a/m) for E(m,a) = 0.01
r
0.8
0.6
0.4
0.2
m
0
20
40
60
80
100
Outline
What is Quality of Service?
Characterising IP traffic
Performance for stream applications
Performance for elastic applications
QoS and pricing
France Télécom R&D
Closed loop control for elastic
traffic
impact of packet scale on flow scale response time?
performance of statistical bandwidth sharing ?
need for admission control ?
scope for service differentiation ?
user-network
interface
France Télécom R&D
network-network
interface
user-network
interface
Bandwidth and packet loss rate
congestion
avoidance
loss rate
p
B(p)
a multi-fractal arrival process
but loss and bandwidth related by TCP (cf. Padhye et al.)
Wmax
for p = 0
RTT
1
B( p )
for small p > 0
2
RTT 2 p / 3 T0 min 1,3 3p / 8 p(1 32 p )
1
for p 1
64T0
thus, p = p(B): i.e., loss rate depends on bandwidth share
France Télécom R&D
Bandwidth sharing
reactive control (TCP, scheduling) shares bottleneck bandwidth
unequally
depending on RTT, protocol implementation, etc.
and differentiated services parameters
optimal sharing in a network: objectives and algorithms...
max-min fairness, proportional fairness, maximal utility,...
... but response time depends more on traffic process than the static
sharing algorithm!
Example: a linear network
route 0
route 1
France Télécom R&D
route L
Flow level performance of a
bottleneck link
assume perfect fair shares
link rate C, n elastic flows
each flow served at rate C/n
assume Poisson flow arrivals
an M/G/1 processor sharing queue
load, r = arrival rate x size / C
performance insensitive to size distribution
Pr [n transfers] = rn(1-r)
E [response time] = size / C(1-r)
instability if r > 1
i.e., unbounded response time
stabilized by aborted transfers...
... or by admission control
link capacity C
fair shares
a processor sharing queue
throughput
C
r
0
0
France Télécom R&D
1
Generalizations of PS model
non-Poisson arrivals
transfer
Poisson
session
arrivals
Poisson sessions
general session structure
discriminatory processor sharing
weight fi for class i flows
service rate fi
rate limitations (same for all flows)
maximum rate per flow (eg, access rate)
minimum rate per flow (by admission control)
France Télécom R&D
flows
processor
sharing
think time
infinite
server
Admission control can be useful
France Télécom R&D
Admission control can be useful
France Télécom R&D
Admission control can be useful
...
... to prevent disasters at sea !
France Télécom R&D
Admission control can also be
useful for IP flows
improve efficiency of TCP
reduce retransmissions overhead ...
... by maintaining throughput
implicit admission control
discard packets of new flows
when available capacity is low
prevent instability
due to overload (r > 1)...
...and retransmissions
avoid aborted transfers
user impatience
"broken connections"
a means for service differentiation...
France Télécom R&D
Choosing an admission control
threshold
N = the maximum number of flows admitted
negligible blocking when r<1, maintain quality when r>1
M/G/1/N processor sharing system
bandwidth C/N; bandwidth C/N , for r>1
Pr [blocking] = rN(1 - r)/(1 - rN+1) (1 - 1/r) , for r>1
uncritical choice of threshold
eg, 1% of link capacity (N=100)
1
300
.8
Blocking probability
E [Response time]/size
200
.6
.4
r = 1.5
r = 1.5
100
.2
r = 0.9
r = 0.9
0
0
0
France Télécom
R&D100
200
N
0
100
200
N
Impact of access rate on
backbone sharing
TCP throughput is limited by access rate...
modem, DSL, cable
... and by server performance, TCP receive window, other links,...
backbone link transparent unless saturated!
ie, unless r > 1 (or r > 0.9...)
throughput
C
backbone link
(rate C)
access links
(rate<<C)
access
rate
0
0
France Télécom R&D
1
r
Differentiation for elastic traffic
different utilization
separate pipes
class based queuing
different per flow shares
WFQ
impact of RTT,...
throughput
C
access
rate
0
r
0
discrimination in overload
impact of aborts (?)
or by admission control
1
throughput
C
access
rate
0
France Télécom R&D
0
r
1
Integrating streaming and
elastic traffic
priority to packets of streaming flows
low utilization negligible loss and delay
using EF ?
elastic flows use all remaining capacity
better response times
per flow fair queuing (?)
to prevent overload
implicit admission control...
...and adaptive routing
an identical admission criterion for streaming and elastic flows
available rate > R
France Télécom R&D
Differentiation by accessibility
block class 1 when 100 flows in progress
- block class 2 when N2 flows in progress
in underload: both classes have negligible blocking (B1 B2 0)
in overload: discrimination is effective
if r1 < 1 < r1 + r2, B1 0,
B2 (r1+r2-1)/r2
if 1 < r1,
B1 (r1-1)/r1, B2 1
1
1
1
r1 = r2 = 0.4
r1 = r2 = 0.6
0
0
N2
France Télécom R&D
r1 = r2 = 1.2
B2
.33
B2B10
B2
B1
0
0
N2
B1
.17
100
0
N2
100
Provisioning for negligible
blocking for elastic flows
"elastic" teletraffic theory; assume
Poisson arrivals, rate l
mean size s
blocking probability for capacity C
utilization r = ls/C
m = admission control limit
B(r,m) = rm(1-r)/(1-rm+1)
impact of access rate
C/access rate = m
B(r,m) E(m,rm)
utilization (r) for B = 0.01
r
0.8
E(m,rm)
0.6
0.4
0.2
m
0
France Télécom R&D
20
40
60
80
100
Outline
What is Quality of Service?
Characterising IP traffic
Performance for stream applications
Performance for elastic applications
QoS and pricing
France Télécom R&D
Service differentiation and
pricing
different QoS requires different prices...
or users will always choose the best
...but streaming and elastic applications are qualitatively different
choose streaming class for transparency
choose elastic class for throughput
no need for streaming/elastic price differentiation
different prices exploit different "willingness to pay"...
bringing greater economic efficiency
...but QoS is not stable or predictable
depends on route, time of day,..
and on factors outside network control: access, server, other networks,...
network QoS is not a sound basis for price discrimination
France Télécom R&D
Pricing to pay for the network
fix a price per byte
to cover the cost of infrastructure and operation
estimate demand
at that price
provision network to handle that demand
with excellent quality of service
optimal price
revenue = cost
capacity
demand
demand
capacity
$$$
$$$
time of day
France Télécom R&D
time of day
Price differentiation
maximise value by exploiting different “willingness to pay”
business, professional, residential
price components
flat rate subscription
per byte charge ( 0)
time of day variations
price differences based on stable criteria
e.g., access rate, available services
pay for differentiated accessibility...
e.g., flat rate payment for guaranteed reliability
...but not for congestion
i.e., pay more for worse quality !
France Télécom R&D
Conclusions
a statistical characterisation of demand
a stationary random process in the busy period
a flow level characterisation (streaming and elastic flows)
transparency for streaming flows
rate envelope ("bufferless") multiplexing
the "negligible jitter conjecture"
response time for elastic flows
a "processor sharing" flow scale model
instability in overload (i.e., E[demand]>capacity)
service differentiation
distinguish streaming and elastic classes
limited scope for within-class differentiation
flow admission control in case of overload
pricing
per byte + flat rate charges
France Télécom R&D
C
r
0
0
1