MPhil Thesis Defence


"Fast Video Transcoding and Graphics Hardware Acceleration"

By

Mr. Chi Wang Ho


Abstract

Video transcoding is one of the key technologies in modern multimedia
systems to enable effective access for a wide diversity of devices and
formats. In applications that deliver live videos over bandwidth limited
channel to multiple users, a transcoder is needed to provide dynamic
adaptations in the bit-rate and the format of the video bit-stream in
order to serve different users simultaneously. However, a robust and
flexible transcoder usually requires a cascaded decoding and re-encoding,
which is computational expensive. Thus, it is important to re-use the
information of the input video bit-stream as efficient as possible to
reduce the complexity while maintaining high output quality.

In addition, the transcoding process can be further speed up by making use
of computer graphics hardware. Graphics processing units (GPUs) provide
tremendous memory bandwidth and computational horsepower, and
general-purpose computation on GPUs (GPGPU) has demonstrated a great
success in recent years. Some researchers have used GPU for image and
video processing, such as transformation, motion compensation and motion
estimation, etc. Further research on using GPU for video processing, in
particular video encoding, is needed to exploit the possibilities of speed
improvement.

This thesis is divided into two parts. We first address the
algorithm-level solutions for fast transcoding by re-using the information
extracted from the input video bit-stream. Two rate control schemes are
proposed. The first proposed method, namely Zero-Residue Pre-Selection
(ZRPS), reduces the computational complexity of a well-known rate control
scheme TMN-8 by selecting a sub-set of macroblocks for the calculations.
The rate information of the input video bit-stream is used to select these
macroblocks. Another rate control scheme, Low-Complexity Rate Control
(LCRC), is proposed to use the relationship between the rate of the input
video bit-stream and the output video bit-stream on a per-row basis. Since
no explicit complexity measurement of macroblocks is needed, it is good
for H.264/AVC under the rate-distortion optimization framework. In
addition, we also propose a motion vector refinement scheme - Minimum Cost
Tendency Search (MCTS) for transcoding from H.263 to H.264/AVC. MCTS
explicitly considers the difference between their cost functions for
motion estimation and results in a more stable performance over a wide
range of output quality. In the second part of this thesis, a GPU-based
motion estimation for H.264/AVC is presented. It is flexible enough that
can apply on different graphics chips and obtain a good performance with
simple fine tuning.


Date:				Tuesday, 15 August 2006

Time:				10:00a.m.-12:00noon

Venue:				Room 5501
				Lifts 25-26

Committee Members:		Dr. Oscar Au (ECE, Supervisor)
				Dr. Gary Chan (Supervisor)
				Dr. Qian Zhang (Chairperson)
				Dr. Philip Fu


**** ALL are Welcome ****