Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-09T07:37:50.020Z Has data issue: false hasContentIssue false

Region-of-interest-based rate control scheme for high-efficiency video coding

Published online by Cambridge University Press:  17 December 2014

Marwa Meddeb
Affiliation:
Departement of TSI, Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, F-75634 Paris Cedex 13, France
Marco Cagnazzo*
Affiliation:
Departement of TSI, Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, F-75634 Paris Cedex 13, France
Béatrice Pesquet-Popescu
Affiliation:
Departement of TSI, Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, F-75634 Paris Cedex 13, France
*
Corresponding author: Marco Cagnazzo Email: cagnazzo@telecom-paristech.fr

Abstract

This paper presents a novel rate control scheme designed for the newest high efficiency video coding (HEVC) standard, and aimed at enhancing the quality of regions of interest (ROI) for a videoconferencing system. It is designed to consider the different regions at both frame level and coding tree unit (CTU) level. The proposed approach allocates a higher bit rate to the region of interest while keeping the global bit rate close to the assigned target value. The ROIs, typically faces in this application, are automatically detected and each CTU is classified in a region of interest map. This binary map is given as input to the rate control algorithm and the bit allocation is made accordingly. The algorithm is tested, first, using the initial version of the controller introduced in HEVC test model (HM.10), then, extended in HM.13. In this work, we first investigate the impact of differentiated bit allocation between the two regions using a fixed bit rate ratio in intra-coded frames (I-frames) and Bidirectionally predicted frames (B-frames). Then, unit quantization parameters (QPs) are computed independently for CTUs of different regions. The proposed approach has been compared to the reference controller implemented in HM and to a ROI-based rate control algorithm initially proposed for H.264 that we adopted to HEVC and implemented in HM.9. Experimental results show that our scheme has comparable performances with the ROI-based controller proposed for H.264. It achieves accurate target bit rates and provides an improvement in region of interest quality, both in objective metrics (up to 2 dB in PSNR) and based on subjective quality evaluation.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
Copyright © The Authors, 2014
Figure 0

Fig. 1. RC scheme for HEVC.

Figure 1

Fig. 2. R–D performances of the R-λ algorithm, compared URQ model.

Figure 2

Fig. 3. Comparison of bit fluctuation per frame of R-λ and URQ models for sequence Johnny.

Figure 3

Table 1. R-D performance of R-λ algorithm using hierarchical and adaptive bit allocation, compared to equal bit allocation.

Figure 4

Table 2. Intra bit allocation refinement weights.

Figure 5

Fig. 4. ROI-based rate control scheme for HEVC.

Figure 6

Fig. 5. Test sequences and ROI maps (a) Johnny (ROI represents 13%), (b) KristenAndSara (ROI represents 14%), (c) FourPeople (ROI represents 10%).

Figure 7

Table 3. Control accuracy comparison of the reference and the proposed controller for inter frames using HM.10.

Figure 8

Table 4. Control accuracy comparison of the reference and the proposed controller for intra frames using HM.13.

Figure 9

Fig. 6. ΔPSNR ROI and non-ROI (dB) for the last 25 GOPs of FourPeople at 128 kbps and using hierarchical bit allocation.

Figure 10

Table 5. Control accuracy comparison of the reference and the proposed controller for inter frames using HM.13.

Figure 11

Fig. 7. Comparison of QP repartition at the CTU level of Johnny.

Figure 12

Table 6. Control accuracy comparison of the reference and the proposed controller in HM.13

Figure 13

Fig. 8. Subjective comparison of Johnny coded at 128 kbps for an I frame. (a) Reference RC, (b) Proposed RC.

Figure 14

Fig. 9. Subjective comparison of Johnny coded at 128 kbps for a B frame. (a) Reference RC, (b) Proposed RC.

Figure 15

Fig. 10. Subjective comparison of KristenAndSara coded at 128 kbps for an I frame. (a) Reference RC, (b) proposed RC.

Figure 16

Fig. 11. Subjective comparison of KristenAndSara coded at 128 kbps for a B frame. (a) Reference RC, (b) proposed RC.

Figure 17

Fig. 12. Subjective comparison of FourPeople coded at 128 kbps for an I frame. (a) Reference RC, (b) proposed RC.

Figure 18

Fig. 13. Subjective comparison of FourPeople coded at 128 kbps for a B frame. (a) Reference RC, (b) proposed RC.

Figure 19

Fig. 14. SSIM map comparison Johnny. (a) Original frame, (b) SSIM maps.

Figure 20

Fig. 15. SSIM map comparison KristenAndSara (a) Original Frame, (b) SSIM maps.

Figure 21

Fig. 16. SSIM map comparison FourPeople. (a) Original frame, (b) SSIM maps.

Figure 22

Fig. 17. R-D performances of R-λ ROI-based algorithm and URQ ROI-based model compared to URQ reference RC algorithm.

Figure 23

Table 7. RC results using URQ model at 128 kbps.

Figure 24

Fig. 18. Comparison of bit fluctuation per GOP of R-λ and URQ ROI-based models at low and high bit rate for sequence Johnny.

Figure 25

Fig. 19. Comparative ROI-based R-D performances of different methods. (a) Johnny, (b) KristenAndSara, and (c) FourPeople.