Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-09T20:14:51.709Z Has data issue: false hasContentIssue false

High dynamic range image compression based on visual saliency

Published online by Cambridge University Press:  27 May 2020

Jin Wang*
Affiliation:
Faculty of Information Technology, Beijing University of Technology, 100124Beijing, China Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing, China
Shenda Li
Affiliation:
Faculty of Information Technology, Beijing University of Technology, 100124Beijing, China
Qing Zhu
Affiliation:
Faculty of Information Technology, Beijing University of Technology, 100124Beijing, China
*
Corresponding author: Jin Wang Email: ijinwang@bjut.edu.cn

Abstract

With wider luminance range than conventional low dynamic range (LDR) images, high dynamic range (HDR) images are more consistent with human visual system (HVS). Recently, JPEG committee releases a new HDR image compression standard JPEG XT. It decomposes an input HDR image into base layer and extension layer. The base layer code stream provides JPEG (ISO/IEC 10918) backward compatibility, while the extension layer code stream helps to reconstruct the original HDR image. However, this method does not make full use of HVS, causing waste of bits on imperceptible regions to human eyes. In this paper, a visual saliency-based HDR image compression scheme is proposed. The saliency map of tone mapped HDR image is first extracted, then it is used to guide the encoding of extension layer. The compression quality is adaptive to the saliency of the coding region of the image. Extensive experimental results show that our method outperforms JPEG XT profile A, B, C and other state-of-the-art methods. Moreover, our proposed method offers the JPEG compatibility at the same time.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
Copyright © The Author(s), 2020 published by Cambridge University Press in association with Asia Pacific Signal and Information Processing Association.
Figure 0

Fig. 1. Framework of JPEG XT.

Figure 1

Fig. 2. Framework of our proposed method. (a) Encoder (b) Decoder.

Figure 2

Table 1. Correlation between residual image and saliency map.

Figure 3

Fig. 3. Visualization of residual layers and saliency maps from different methods for test images Memorial, AtriumNight, Tree, and Nave. For each row, the first image is the residual layer, the second to the fifth images are saliency map extracted by Cheng's method [13], Hou's method [14], Achanta's method [15], and Goferman's method [16] respectively. (a) Memorial, (b) AtriumNight, (c) Tree, (d) Nave.

Figure 4

Fig. 4. Visualization of residual layers and saliency maps from different methods for test images Rosette, BigFogMap, Rend06, and Rend09. For each row, the first image is the residual layer, the second to the fifth images are saliency map extracted by Cheng's method [13], Hou's method [14], Achanta's method [15], and Goferman's method [16] respectively. (a) Rosette, (b) BigFogMap, (c) Rend06, (d) Rend09.

Figure 5

Fig. 5. Example of extension layer quality distribution matrix. (a) Reconstructed LDR image, (b) Saliency map, (c) Extension layer quality distribution.

Figure 6

Algorithm 1. Extension layer quality calculation

Figure 7

Algorithm 2. Compression algorithm

Figure 8

Algorithm 3. Decompression algorithm

Figure 9

Fig. 6. Test image set.

Figure 10

Table 2. Details of test images.

Figure 11

Fig. 7. Different quality ranges.

Figure 12

Fig. 8. Bitstream balancing.

Figure 13

Fig. 9. HDR-VDP-2 results comparison with different methods.

Figure 14

Fig. 10. SSIM results comparison with different methods.

Figure 15

Fig. 11. Visual quality comparison of memorial with different methods at 3.2 bpp. (On the left is the original image, top two rows on the right are non-salient regions of reconstructed image, and two bottom rows on the right are salient ones. From left to right, the first row is original image, official implementation of JPEG XT profile A,B,C, the second row is Choi's method10 [10], Wei's method21 [21], Feyiz's method22 [22], and our proposed method, respectively.)

Figure 16

Fig. 12. Visual quality comparison of AtriumNight with different methods at 2.0 bpp. (On the left is the original image, top two rows on the right are non-salient regions of reconstructed image, and two bottom rows on the right are salient ones. From left to right, the first row is original image, official implementation of JPEG XT profile A,B,C, the second row is Choi's method10 [10], Wei's method 21 [21], Feyiz's method 22 [22], and our proposed method, respectively.)

Figure 17

Fig. 13. Visual quality comparison of BigFogMap with different methods at 2.7 bpp. (On the left is the original image, top two rows on the right are non-salient regions of reconstructed image, and two bottom rows on the right are salient ones. From left to right, the first row is original image, official implementation of JPEG XT profile A,B,C, the second row is Choi's method [10], Wei's method [26], Feyiz's method [27], and our proposed method, respectively.)