A Rate Perceptual-Distortion Optimized Video Coding HEVC

Bumshik LEE  Jae Young CHOI  

IEICE TRANSACTIONS on Information and Systems   Vol.E101-D   No.12   pp.3158-3169
Publication Date: 2018/12/01
Publicized: 2018/08/24
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2018EDP7164
Type of Manuscript: PAPER
Category: Image Processing and Video Processing
HEVC,  SSIM,  perceptual video coding,  

Full Text: PDF(1.7MB)>>
Buy this Article

In this paper, a perceptual distortion based rate-distortion optimized video coding scheme for High Efficiency Video Coding (HEVC) is proposed. Structural Similarity Index (SSIM) in transform domain, which is known as distortion metric to better reflect human's perception, is derived for the perceptual distortion model to be applied for hierarchical coding block structure of HEVC. A SSIM-quantization model is proposed using the properties of DCT and high resolution quantization assumption. The SSIM model is obtained as the sum of SSIM in each Coding Unit (CU) depth of HEVC, which precisely predict SSIM values for the hierarchical quadtree structure of CU in HEVC. The rate model is derived from the entropy, based on Laplacian distributions of transform residual coefficients and is jointly combined with the SSIM-based distortion model for rate-distortion optimization in an HEVC video codec and can be compliantly applied to HEVC. The experimental results demonstrate that the proposed method achieves 8.1% and 4.0% average bit rate reductions in rate-SSIM performance for low-delay and random access configurations respectively, outperforming other existing methods. The proposed method provides better visual quality than the conventional mean square error (MSE)-based RDO coding scheme.