Transcript Slides
Quality and Performance
Evaluation of VoIP End-points
Wenyu Jiang
Henning Schulzrinne
Columbia University
NYMAN 2002
September 3, 2002
Motivations
The quality of VoIP depends on both
the network and the end-points
Extensive QoS literature on network
performance, e.g., IntServ, DiffServ
Focus is on limiting network loss & delay
Little is known about the behavior of
VoIP end-points
Performance Metrics for VoIP
End-points
Mouth-to-ear (M2E) delay
Clock skew
Whether it causes any voice glitches
Amount of clock drift
Silence suppression behavior
C.f. network delay
Whether the voice is clipped (depends much on
hangover time)
Robustness to non-speech input, e.g., music
Robustness to packet loss
Voice quality under packet loss
Measurement Approach
Capture both original and output audio
Use “adelay” program to measure M2E delay
Assume a LAN environment by default
Serve as a baseline of reference, or lower bound
stereo
signal
PC
line in
notebook
speaker
original
audio
(mouth)
coupler
coupler
IP phone
output
audio
In
Out
ethernet
(ear)
LAN
IP phone
In
Out
ethernet
VoIP End-points Tested
Hardware End-points
Software clients
Cisco, 3Com and Pingtel IP phones
Mediatrix 1-line SIP/PSTN Gateway
Microsoft Messenger, NetMeeting (Win2K, WinXP)
Net2Phone (NT, Win2K, Win98)
Sipc (Solaris, Ultra-10)
Operating parameters:
In most cases, codec is G.711 -law, packet
interval is 20ms
Example M2E Delay Plot
3Com to Cisco, shown with gaps > 1sec
Delay adjustments correlate with gaps,
despite 3Com phone has no silence
suppression 60 experiment
1-1
experiment 1-2
silence gaps
M2E delay (ms)
55
50
45
40
35
0
50
100
150 200
time (sec)
250
300
350
Visual Illustration of M2E
Delay Drop, Snapshot #1
3Com to Cisco
1-1 case
Left/upper
channel is
original audio
Highlighted
section shows
M2E delay
(59ms)
Snapshot #2
M2E delay
drops to
49ms, at
time of
4:16
Snapshot #3
Presence of
a gap during
the delay
change
Effect of RTP M-bits on Delay
Adjustments
Cisco phone sends M-bits, whereas Pingtel
phone does not
Presence of M-bits results in more adjustments
100
Cisco to 3Com 1-1
Pingtel to 3Com 2-1
new talkspurt (M-bit=1)
90
M2E delay (ms)
80
70
60
50
40
30
20
0
50
100
150 200
time (sec)
250
300
Sender Characteristics
Certain senders may introduce delay
spikes, despite operating on a LAN
300
Mediatrix to 3Com 3-1
Mediatrix to Cisco 1-1
Mediatrix to Pingtel 1-1
250
M2E delay (ms)
200
150
100
50
0
50
100
150 200
time (sec)
250
300
Average M2E Delays for IP
phones and sipc
Averaging the M2E delay allows more compact
presentation of end-point behaviors
Receiver (especially sipc) plays an important role in
M2E delay
250
M2E delay (ms)
200
150
100
50
0
Receiver
3Com
Cisco
Mediatrix
Pingtel
sipc
Cisco G.729
Average M2E Delays for PC
Software Clients
Messenger 2000 wins the day
Its delay as receiver (68ms) is even lower than Messenger
XP, on the same hardware
It also results in slightly lower delay as sender
NetMeeting is a lot worse (> 400ms)
Messenger’s delay performance is similar to or better
than a GSM mobile phone.
A
B
AB
BA
MgrXP (pc)
MgrXP (notebook)
109ms
120ms
Mgr2K (pc)
NM2K (pc)
96.8ms 68.5ms
NM2K (notebook)
Mobile (GSM) PSTN (local number)
401ms
421ms
115ms
109ms
Delay Behaviors for PC Clients,
contd.
Net2Phone’s delay is also high
~200-500ms
V1.5 reduces PC->PSTN delay
PC-to-PC calls have fairly high delays
A
B
AB
BA
N2P v1.1 NT P-2 (pc2)
PSTN
(local number)
292ms
372ms
201ms
373ms
N2P v1.5 W2K K7 (pc)
196ms
401ms
N2P v1.5 W2K K7 (pc)
N2P v1.5 W98 P-3 525ms
(notebook2)
350ms
N2P v1.5 NT P-2 (pc2)
Effect of Clock Skew: Cisco to
3Com, Experiment 1-1
Symptom of
playout buffer
underflow
Waveforms
are dropped
Occurred at
point of delay
adjustment
Bugs in
software?
Clock Drift Rates
Mostly symmetric between two devices
Sipc has unusually high drift rates, > 300
ppm (parts per million)
Drift Rates 3Com
(in ppm)
Cisco
Mediatrix Pingtel
sipc
3Com
55.4
43.3
41.2
-333
-11.8
-12.1
-381
Cisco
-55.2
-0.4
Mediatrix
-43.1
11.7
Pingtel
-40.9
12.7
sipc
343
403
-0.8
2.8
-380
376
Drift Rates for PC Clients
Drift Rates not always symmetric!
But appears to be consistent between Messenger
2K/XP and Net2Phone on the same PC
Existence of 2 clocking circuits in sound card?
A
B
AB
BA
MgrXP (pc)
172
87.7
Mgr2K (pc)
MgrXP
(notebook)
165
85.6
NM2K (pc)
NM2K (notebook) ?
-33?
Net2Phone NT (pc2)
PSTN
290
-287
Net2Phone 2K (pc)
166
82
Mobile (GSM)
0
0
Packet Loss Concealment
Common PLC methods
Silence substitution (worst)
Packet repetition, with optional fading
Extrapolation (one-sided)
Interpolation (two-sided), best quality
Use deterministic bursty loss pattern
3/100 means 3 consecutive losses out of every
100 packets
Easier to locate packet losses
Tested 1/100, 3/100, 1/20, 5/100, etc.
PLC Behaviors
Loss tolerance (at 20ms interval)
Level of audio distortion by packet loss
By measuring loss-induced gaps in output audio
3Com and Pingtel phones: 2 packet losses
Cisco phone: 3 packet losses
Inaudible at 1/100 for all 3 phones
Inaudible at 3/100 and 1/20 for Cisco phone, yet
audible to very audible for the other two.
Cisco phone is the most robust
Probably uses interpolation
Effect of PLC on Delay
No affirmative effect on M2E delay
E.g., sipc to Pingtel
80
0/100
3/100
1/20
mouth-to-ear delay (ms)
75
70
65
60
55
50
0
10
20
30
40
time (sec)
50
60
Silence Suppression
Why?
Saves bandwidth
May reduce processing power (e.g., in
conferencing mixer)
Facilitates per-talkspurt delay adjustment
Key parameters
Silence detection threshold
Hangover time, to delay silence suppression and
avoid end clipping of speech
Usually 200ms is long enough [Brady ’68]
Hangover Time
Measured by feeding ON-OFF
waveforms and monitor RTP packets
Cisco phone’s is the longest (2.3-2.36
sec), then Messenger (1.06-1.08 sec),
then NetMeeting (0.56-0.58 sec)
A long hangover time is not necessarily
bad, as it reduces voice clipping
Indeed, no unnatural gaps are found
Does waste a bit more bandwidth
Robustness of Silence
Detectors to Music
On-hold music is often used in
customer support centers
Need to ensure music is played without
any interruption due to silence suppression
Tested with a 2.5-min long soundtrack
Messenger starts to generate many
unwanted gaps at input level of -24dB
Cisco phone is more robust, but still
fails from input level of -41.4dB
M2E Delay under Jitter
Delay properties under the LAN environment
serves as a baseline of reference
When operating over the Internet:
Fixed portion of delay adds to M2E delay as a constant
Variable portion (jitter) has a more complex effect
Preliminary results
Used typical cable modem
delay traces
Tested sipc to Cisco
No audible distortion due
to late loss
Added delay is normal
180
170
160
150
140
130
120
110
100
90
High jitter (uplink)
Low jitter (downlink)
mouth-to-ear delay (ms)
0
20 40 60 80 100 120 140 160 180
time (sec)
Conclusions
Average M2E delay:
Low (mostly < 80ms) for hardware IP phones
Software clients: low (< 120ms) for Messenger, particularly
Messenger 2000 (68.5ms)
The application (receiver) is the most vital in determining delay
Clock drift rates
Are high for some software clients (sipc, Net2Phone)
In some cases non-symmetric!
Packet loss concealment quality
Poorly implemented end-points can easily undo good network QoS
Acceptable in all 3 IP phones tested, w. Cisco being most robust
Silence detectors in Cisco phone, Messenger, NetMeeting
Long hangover time, no audible unwanted gaps for speech input
May falsely predict music as silence
Future Work
Further tests with more end-points on
how jitter influences M2E delay
Measure the sensitivity (threshold) of
various silence detectors
Investigate the non-symmetric clock
drift phenomena
Additional experiments as more brands
of VoIP end-points become available