Special University Oral Examination

Robust Low-Latency Voice and Video Communication
over Best-Effort Networks

Yi Liang
Information Systems Laboratory
Department of Electrical Engineering

Wednesday, March 12, 2003
1:00 PM, CIS-X Auditorium
(Refreshments at 12:45 PM)

Abstract

The quality of service limitation of today's best-effort networks poses a major challenge for low-latency multimedia communication. Excessive delay, packet loss, variations in throughput, and high delay jitter all impair the performance of communication. In this work, we address all these challenges from the client side, the transport, and source coding.


On the client side, a new playout scheduling scheme is proposed to improve the trade-off between buffering delay and packet loss for real-time voice communication. In this scheme the network delay is estimated from past statistics and the playout time of the media packet is adaptively adjusted in a highly dynamic way. Proper reconstruction of continuous output audio is achieved by scaling individual media packets using a time-scale modification technique based on the Waveform Similarity Overlap-Add algorithm. Results of subjective listening tests show that this operation does not impair audio quality. Simulation results based on Internet measurements show that the trade-off between buffering delay and late loss can be significantly improved.


On the transport, we further improve the performance of communication by exploiting diversity of multiple transmission channels, where the source media are coded into multiple redundant streams and sent over independent network paths. The playout time of the received media packets is determined based on an adaptive multi-stream playout scheduling technique that uses a Lagrangian cost function to trade off delay and loss. Experiments demonstrate further gains of reduced latency and distortion, resulting from path diversity.


Compared to audio, compressed video exhibits much stronger dependency across packets. Today's Internet video streaming systems employ buffering and retransmission to guarantee the correct reception of each packet, which leads to high latency in media delivery. In this work, we present an efficient low-latency streaming system that does not require retransmission of lost packets. A source coding scheme that dynamically manages the dependency across packets is proposed using optimal reference picture selection. The optimal selection of the reference is achieved within a rate-distortion framework and is adapted to the channel conditions, which minimizes the expected end-to-end distortion under the given rate constraint. An improved trade-off between compression efficiency and error-resilience has been achieved. The increased error-resilience eliminates the need of retransmission, which makes it possible to reduce the delivery latency from 5-15 seconds to a few hundred milliseconds, with similar video quality maintained.