Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-11T01:14:26.152Z Has data issue: false hasContentIssue false

Residual sign prediction in transform domain for next-generation video coding

Published online by Cambridge University Press:  11 October 2019

Alexey Filippov
Affiliation:
Huawei Technologies, Moscow Research Center, Media Algorithms laboratory, 17/2, Krylatskaya str., Moscow 121614, Russia
Vasily Rufitskiy
Affiliation:
Huawei Technologies, Moscow Research Center, Media Algorithms laboratory, 17/2, Krylatskaya str., Moscow 121614, Russia
Alexander Karabutov
Affiliation:
Huawei Technologies, Moscow Research Center, Media Algorithms laboratory, 17/2, Krylatskaya str., Moscow 121614, Russia
Jianle Chen*
Affiliation:
Huawei Technologies, 2330 Central Expy, Santa Clara, CA 95050, USA
*
Corresponding author: J. Chen, Email: jianle.chen@huawei.com

Abstract

In this paper, we present a technique that is known as Residual Sign Prediction in Transform Domain (TDRSP) and is aimed at increasing compression performance by reducing the bits overhead of residue sign. These signs are typically coded by entropy coders in bypass mode that results in the high cost of sign bins which require 1 bit per bin in a bitstream. TDRSP allows us to reduce this cost by predicting residue signs so that the probability of a guess is significantly higher than 50%. Hence, arithmetic coding with contexts becomes applicable to the signs. In contrast to Residual Sign Prediction (RSP) performed in spatial domain, TDRSP avoids switching between domains and carries out calculations completely in transform domain to efficiently decrease the computational complexity of RSP. Simulations performed on top of Versatile Video Coding test model reference software (VTM-1.0) in accordance with the Joint Video Experts Team common test conditions show that more than 2.0% and up to 1.8% of the Bjøntegaard Delta rate can be achieved for All Intra and Random Access configurations, respectively.

Information

Type
Industrial Technology Advances
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Authors, 2019
Figure 0

Fig. 1. Deblocking filtering.

Figure 1

Fig. 2. Definitions of d1, i, j and d2, i, j [13].

Figure 2

Fig. 3. Previous neighboring blocks used in improved DC prediction [15].

Figure 3

Fig. 4. Exploring inter-row/column redundancy for dc [17].

Figure 4

Fig. 5. Three patterns over a block boundary for two horizontally neighboring blocks [16].

Figure 5

Fig. 6. Overview of sign estimation [21].

Figure 6

Fig. 7. Comparison of conventional sign coding with the technique proposed in [21].

Figure 7

Table 1. Steps of technique proposed in [21].

Figure 8

Fig. 8. Pixels used to derive cost value in [22].

Figure 9

Table 2. Hypotheses check for three coefficients by reusing templates.

Figure 10

Fig. 9. Usage of two lists of coefficients for CABAC context determination.

Figure 11

Fig. 10. Frequency-domain sign prediction concept.

Figure 12

Table 3. TDRSP performance over VTM1.0.

Figure 13

Table 4. TDRSP performance over VTM1.0, TDRSP applied only to intra-coded blocks.

Figure 14

Table 5. TDRSP performance over BMS 1.0.

Figure 15

Table 6. TDRSP performance over BMS1.0, TDRSP applied only to intra-coded blocks.

Figure 16

Table 7. TDRSP performance over IFVC1.0, TDRSP applied only to intra-coded blocks.

Figure 17

Table 8. Per-sequence estimation of probability of correct sign prediction TDRSP performance in Random access mode.