Page 1 |
Save page Remove page | Previous | 1 of 123 | Next |
|
small (250x250 max)
medium (500x500 max)
Large (1000x1000 max)
Extra Large
large ( > 500x500)
Full Resolution
All (PDF)
|
This page
All
|
LOW COMPLEXITY AND HIGH EFFICIENCY PREDICTION TECHNIQUES FOR VIDEO CODING by Chung-Cheng Lou A Dissertation Presented to the FACULTY OF THE USC GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Ful llment of the Requirements for the Degree DOCTOR OF PHILOSOPHY (ELECTRICAL ENGINEERING) December 2011 Copyright 2011 Chung-Cheng Lou
Object Description
Title | Low complexity and high efficiency prediction techniques for video coding |
Author | Lou, Chung-Cheng |
Author email | clou@usc.edu;roylou@gmail.com |
Degree | Doctor of Philosophy |
Document type | Dissertation |
Degree program | Electrical Engineering |
School | Viterbi School of Engineering |
Date defended/completed | 2011-09-07 |
Date submitted | 2011-09-28 |
Date approved | 2011-09-28 |
Restricted until | 2011-09-28 |
Date published | 2011-09-28 |
Advisor (committee chair) | Kuo, C.-C. Jay |
Advisor (committee member) |
Ortega, Antonio Nakano, Aiichiro |
Abstract | Video compression has been extensively studied in the last two decades. The success of a coding algorithm relies on the effective removal of spatial and temporal redundancies in input video sequences. On the other hand, effective spatial and temporal prediction techniques demand high computational complexity, which makes it challenging to implement in resource-limited mobile devices. This research focuses on two topics: 1) complexity reduction of temporal prediction without significant rate-distortion (RD) performance degradation; and 2) the development of a more effective spatial prediction technique to enhance the RD performance. ❧ For the first topic, complexity reduction in temporal prediction is achieved by the development of an adaptive motion search range (SR) selection algorithm. A good choice of the SR size helps reduce memory access bandwidth while maintaining the RD coding performance. To begin with, we get a motion vector predictor (MVP) for a target block based on motion vectors (MVs) of its spatially and temporally neighboring blocks, which form a MV prediction set. Then, we relate the variance of the MV prediction set to the SR. That is, a larger variance implies lower accuracy of the MVP and a larger SR. Finally, we derive a probability model for the motion vector prediction difference (MVPD), the difference between the optimal MV and the MVP, to quantify the probability for a chosen SR to contain the optimal MV. The superior performance of the proposed SR selection algorithm is demonstrated by experimental results. ❧ For the second topic, a novel multi-order-residual-prediction (MORP) coding approach is proposed to improve spatial prediction efficiency in video coding. We observe that the compression ratio of a video coding algorithm depends on the nature of sequences as indicated by the ratio between inter and intra blocks in the bit-stream. When the percentage of intra blocks increases, the prediction efficiency decreases, thus leading to a poorer coding gain. In other words, one bottleneck of video coding lies in poor intra prediction efficiency. To address this issue, we propose an MORP coding scheme that adopts a second-order prediction scheme after the traditional first-order prediction. Different prediction techniques are adopted in different stages to tailor to the nature of the corresponding residual signals. The proposed MORP scheme outperforms H.264/AVC for the intra block coding and, thus, improves the overall coding efficiency. ❧ Finally, we analyze prediction inefficiency of the proposed MORP scheme and present an enhanced intra prediction coding called the generalized line-based intra prediction (GLIP) to improve it. The GLIP allows partial prediction of a coding block by enabling a subset of the neighboring prediction pixels. The residual signal after the first order prediction consists of the local line structure while the GLIP is designed to exploit this feature. The vector quantization (VQ) technique is used to approximate and encode the shape of the binarized residual signal. The coded patterns of image residuals can be predicted using partial line structures based on known neighboring pixel values. A variable length coding scheme is adopted to encode the codewords of the VQ codebook to reduce the prediction overhead. The proposed GLIP algorithm effectively reduces the residual bits for both intra- and inter- blocks and outperforms H.264/AVC by a significant margin. |
Keyword | motion estimation; MV search window prediction; mobile devices; memory-constrained system; video coding; motion search range; motion vector prediction; adaptive search range selection; high efficiency video coding |
Language | English |
Part of collection | University of Southern California dissertations and theses |
Publisher (of the original version) | University of Southern California |
Place of publication (of the original version) | Los Angeles, California |
Publisher (of the digital version) | University of Southern California. Libraries |
Provenance | Electronically uploaded by the author |
Type | texts |
Legacy record ID | usctheses-m |
Contributing entity | University of Southern California |
Rights | Lou, Chung-Cheng |
Physical access | The author retains rights to his/her dissertation, thesis or other graduate work according to U.S. copyright law. Electronic access is being provided by the USC Libraries in agreement with the author, as the original true and official version of the work, but does not grant the reader permission to use the work if the desired use is covered by copyright. It is the author, as rights holder, who must provide use permission if such use is covered by copyright. The original signature page accompanying the original submission of the work to the USC Libraries is retained by the USC Libraries and a copy of it may be obtained by authorized requesters contacting the repository e-mail address given. |
Repository name | University of Southern California Digital Library |
Repository address | USC Digital Library, University of Southern California, University Park Campus MC 7002, 106 University Village, Los Angeles, California 90089-7002, USA |
Repository email | cisadmin@lib.usc.edu |
Archival file | uscthesesreloadpub_Volume71/etd-LouChungCh-301.pdf |
Description
Title | Page 1 |
Contributing entity | University of Southern California |
Repository email | cisadmin@lib.usc.edu |
Full text | LOW COMPLEXITY AND HIGH EFFICIENCY PREDICTION TECHNIQUES FOR VIDEO CODING by Chung-Cheng Lou A Dissertation Presented to the FACULTY OF THE USC GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Ful llment of the Requirements for the Degree DOCTOR OF PHILOSOPHY (ELECTRICAL ENGINEERING) December 2011 Copyright 2011 Chung-Cheng Lou |