Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-14T15:55:13.988Z Has data issue: false hasContentIssue false

Two-stage pyramidal convolutional neural networks for image colorization

Published online by Cambridge University Press:  08 October 2021

Yu-Jen Wei
Affiliation:
Department of Electrical Engineering, National Taipei University of Technology, Taipei, Taiwan, R.O.C
Tsu-Tsai Wei
Affiliation:
Department of Electrical Engineering, National Taipei University of Technology, Taipei, Taiwan, R.O.C
Tien-Ying Kuo*
Affiliation:
Department of Electrical Engineering, National Taipei University of Technology, Taipei, Taiwan, R.O.C
Po-Chyi Su
Affiliation:
Department of Computer Science and Information Engineering, National Central University, Taoyuan City, Taiwan, R.O.C
*
Corresponding author: Tien-Ying Kuo Email: tykuo@ntut.edu.tw

Abstract

The development of colorization algorithms through deep learning has become the current research trend. These algorithms colorize grayscale images automatically and quickly, but the colors produced are usually subdued and have low saturation. This research addresses this issue of existing algorithms by presenting a two-stage convolutional neural network (CNN) structure with the first and second stages being a chroma map generation network and a refinement network, respectively. To begin, we convert the color space of an image from RGB to HSV to predict its low-resolution chroma components and therefore reduce the computational complexity. Following that, the first-stage output is zoomed in and its detail is enhanced with a pyramidal CNN, resulting in a colorized image. Experiments show that, while using fewer parameters, our methodology produces results with more realistic color and higher saturation than existing methods.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2021 published by Cambridge University Press in association with Asia Pacific Signal and Information Processing Association
Figure 0

Fig. 1. Flowchart of proposed architecture.

Figure 1

Fig. 2. Consequences of using low-resolution chroma maps. (a) Original images. (b) Images created using low-resolution chroma maps.

Figure 2

Fig. 3. Low-resolution chroma map generation network (LR-CMGN).

Figure 3

Fig. 4. Refinement colorization network (RCN).

Figure 4

Fig. 5. Problems in using objective image quality assessment metrics to evaluate colorization results. (a) Ground truth. (b) Iizuka et al. [23]. (c) Zhang et al. [25]. (d) Proposed method.

Figure 5

Fig. 6. Images produced by colorization methods with different stages. (a) Only first-stage. (b) Proposed two-stage method.

Figure 6

Fig. 7. Original design of refinement network.

Figure 7

Fig. 8. Results of using different refinement network structures. (a) Original design (13 convolutional layers). (b) Pyramidal structure.

Figure 8

Fig. 9. Comparisons of colorization results using different color spaces. (a) YUV. (b) Lab. (c) HSV.

Figure 9

Fig. 10. Comparisons of different colorization methods used on indoor images. (a) Ground truth. (b) [23]. (c) [25]. (d) [8]. (e) [32]. (f) Proposed method.

Figure 10

Fig. 11. Comparisons of different colorization methods used on outdoor images. (a) Ground truth. (b) [23]. (c) [25]. (d) [8]. (e) [32]. (f) Proposed method.

Figure 11

Fig. 12. More results by applying our method to grayscale images. (a) Grayscale image. (b) Ground truth. (c) Proposed method.

Figure 12

Table 1. Comparisons of the number of models parameters.

Figure 13

Fig. 13. Failure cases of our colorization method. (a) Ground truth. (b) Proposed method.