A novel low complexity local hybrid pseudo-SSIM-SATD distortion metric towards perceptual rate control

Joshi, Yetish, Loo, Jonathan, Shah, Purav ORCID: https://orcid.org/0000-0002-0113-5690, Rahman, Shahedur and Chang, Yoong Choon (2013) A novel low complexity local hybrid pseudo-SSIM-SATD distortion metric towards perceptual rate control. In: IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), 2013, 05-07 June 2013, Brunel, London. (doi:10.1109/BMSB.2013.6621695)

PDF (Pre print version) - Final accepted version (with author's formatting)
Download (3MB) | Preview


The front-end block-based video encoder applies an Image Quality Assessment (IQA) as part of the distortion metric. Typically, the distortion metric applies uniform weighting for the absolute differences within a Sub-Macroblock (Sub-MB) at any given time. As video is predominately designed for Humans, the distortion metric should reflect the Human Visual System (HVS). Thus, a perceptual distortion metric (PDM), will lower the convex hull of the Rate-Distortion (R-D) curve towards the origin, by removing perceptual redundancy and retaining perceptual clues. Structured Similarity (SSIM), a perceptual IQA, has been adapted via logarithmic functions to measure distortion, however, it is restricted to the Group of Picture level and hence unable to adapt to the local Sub-MB changes. This paper proposes a Local Hybrid Pseudo-SSIM-SATD (LHPSS) Distortion Metric, operating at the Sub-MB level and satisfying the Triangle Equality Rule (≤). A detailed discussion of LHPSS's Psuedo-SSIM model will illustrate how SSIM can be perceptually scaled within the distortion metric space of SATD using non-logarithmic functions. Results of HD video encoded across different QPs will be presented showing the competitive bit usage under IbBbBbBbP prediction structure for similar image quality. Finally, the mode decision choices superimposed on the Intra frame will illustrate that LHPSS lowers the R-D curve as homogeneous regions are represented with larger block size.

Item Type: Conference or Workshop Item (Paper)
Research Areas: A. > School of Science and Technology > Computer and Communications Engineering
Item ID: 17113
Notes on copyright: © 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Useful Links:
Depositing User: Purav Shah
Date Deposited: 30 Jun 2015 13:26
Last Modified: 01 Jun 2019 22:43
ISBN: 9781467360470
URI: https://eprints.mdx.ac.uk/id/eprint/17113

Actions (login required)

Edit Item Edit Item

Full text downloads (NB count will be zero if no full text documents are attached to the record)

Downloads per month over the past year