Hostname: page-component-89b8bd64d-sd5qd Total loading time: 0 Render date: 2026-05-09T06:08:43.433Z Has data issue: false hasContentIssue false

Rate-dependent seam carving and its application to content-aware image coding

Published online by Cambridge University Press:  27 February 2013

Yuichi Tanaka*
Affiliation:
Graduate School of Engineering, Utsunomiya University, Utsunomiya, Tochigi 321-8585, Japan Graduate School of BASE, Tokyo University of Agriculture and Technology, Koganei, Tokyo 184-8588, Japan
Taichi Yoshida
Affiliation:
Department of Electronics and Electrical Engineering, Keio University, Yokohama, Kanagawa 223-8522, Japan
Madoka Hasegawa
Affiliation:
Graduate School of Engineering, Utsunomiya University, Utsunomiya, Tochigi 321-8585, Japan
Shigeo Kato
Affiliation:
Graduate School of Engineering, Utsunomiya University, Utsunomiya, Tochigi 321-8585, Japan
Masaaki Ikehara
Affiliation:
Department of Electronics and Electrical Engineering, Keio University, Yokohama, Kanagawa 223-8522, Japan
*
Y. Tanaka email: ytnk@cc.tuat.ac.jp

Abstract

Content-aware image resizing (CAIR) is desired because it preserves prominent regions in a resized image. However, CAIR requires high computational complexity to perform in mobile devices, though it is desired for these displays. Moreover, transmitting the side information for CAIR from the encoder is a problem since it usually requires high bitrates compared with those for image data. In this paper, we present a rate-dependent CAIR method that produce various retargeting results based on the bitrates for side information. Furthermore, we apply the proposed technique to wavelet-based image coding. Our proposed content-aware image coding method provides good performances for both CAIR and image coding.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
The online version of this article is published within an Open Access environment subject to the conditions of the Creative Commons Attribution-NonCommercial-ShareAlike license . The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
Copyright © The Authors, 2013.
Figure 0

Fig. 1. Visual comparisons of the CAIR methods (Mt. Evans image). Black colored regions represent pixels to be pruned. The original size is 1024 × 768 pixels and it is resized to 1024 × 512. From left to right: (a) original image, (b) seam paths calculated by SC, (c) seam paths calculated by RD-SC, (d) CAIR result by SC and (e) CAIR result by RD-SC.

Figure 1

Table 1. Notation with Respect to an Image I.

Figure 2

Fig. 2. Structure of RD-SC.

Figure 3

Fig. 3. Relationship between s and a. Blue solid and red dashed lines represent s and a, respectively.

Figure 4

Fig. 4. fk, g(ik) where L(k) = 32. The number in legend is corresponding to those in (4–8).

Figure 5

Algorithm 1 Optimization of L(k) with piecewise functions

Figure 6

Fig. 5. Recursive seam approximation process. The red lines represent the division points of so, sub.

Figure 7

Fig. 6. Illustrative example of (13). White and black dots represent core (remaining) and seam pixels, respectively.

Figure 8

Algorithm 2 Iterative optimization of a

Figure 9

Fig. 7. Illustrative example of Algorithm 2.

Figure 10

Table 2. Codeword Assignment for fk, g(i) of RD-SC

Figure 11

Algorithm 3 Multiple seam pruning

Figure 12

Fig. 8. Lifting-based seamlet. Top: transformation of I(i, a(i)). Bottom: illustrative example of seamlet for one seam.

Figure 13

Fig. 9. Encoder structure of the proposed CAIC.

Figure 14

Fig. 10. New parent–children structure of the proposed CAIC. An arrow with a solid line represents the relationship.

Figure 15

Fig. 11. Test images.

Figure 16

Table 3. Sizes of Test Images (HI × WI)

Figure 17

Table 4. Parameter Settings for RD-SC.

Figure 18

Fig. 12. Bitrates for side information. The solid lines represent required bitrates of SPI by SC.

Figure 19

Fig. 13. Approximated seams by RD-SC. The enlarged portions of Pisa Tower are shown. Black lines represent approximated seam paths.

Figure 20

Fig. 14. Approximated seams by RD-SC for various ω. A total of 128 seams are removed regardless of τ for comparison purpose.

Figure 21

Table 5. Linear interpolation results for Crew image.

Figure 22

Fig. 15. Approximated seams by RD-SC for different Nm (Tm = 4). A total of 128 seams are removed regardless of τ for comparison purpose.

Figure 23

Fig. 16. PSNRs of interpolated seams and bitrates for SPI (Crew image, 128 seams).

Figure 24

Fig. 17. Enlarged portions of retargeted Crew image.

Figure 25

Fig. 18. Approximated seams by RD-SC for different Tm (Nm = 1). A total of 128 seams are removed regardless of τ for comparison purpose.

Figure 26

Fig. 19. Resizing results. The left, center, and right columns correspond to the resized images by scaling, SC, and RD-SC, respectively. From top to bottom: Park Joy, Pisa Tower, Beach, and Crew.

Figure 27

Table 6. Execution time (s) of Beach image.

Figure 28

Fig. 20. R-D curves of reconstructed image at full resolution. From left to right, top to bottom: Park Joy, Beach, Pisa Tower, and Crew.

Figure 29

Fig. 21. Reconstructed Park Joy images. Each image is encoded at 1.0 bpp.

Figure 30

Fig. 22. Reconstructed Pisa Tower images. Each image is encoded at 1.0 bpp.

Figure 31

Fig. 23. Reconstructed Beach images. Each image is encoded at 1.0 bpp.

Figure 32

Fig. 24. Reconstructed Crew images. Each image is encoded at 1.0 bpp.

Figure 33

Fig. 25. R-D curves of reconstructed core images for various Δ.