Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-08T20:57:47.121Z Has data issue: false hasContentIssue false

Discriminating multiple JPEG compressions using first digit features

Published online by Cambridge University Press:  22 December 2014

Simone Milani*
Affiliation:
Dipartimento di Elettronica, Informazione, e Bioingegneria (DEIB), Politecnico di Milano, P.za Leonardo da Vinci 32, 20133 Milano, Italy
Marco Tagliasacchi
Affiliation:
Dipartimento di Elettronica, Informazione, e Bioingegneria (DEIB), Politecnico di Milano, P.za Leonardo da Vinci 32, 20133 Milano, Italy
Stefano Tubaro
Affiliation:
Dipartimento di Elettronica, Informazione, e Bioingegneria (DEIB), Politecnico di Milano, P.za Leonardo da Vinci 32, 20133 Milano, Italy
*
Corresponding author: S. Milani simone.milani@polimi.it

Abstract

The analysis of JPEG double-compressed images is a problem largely studied by the multimedia forensics community, as it might be exploited, e.g., for tampering localization or source device identification. In many practical scenarios, like photos uploaded on blogs, on-line albums, and photo sharing web sites, images might be JPEG compressed several times. However, the identification of the number of compression stages applied to an image remains an open issue. We proposes a forensic method based on the analysis of the distribution of the first significant digits of the discrete cosine transform coefficients, which follow Benford's law in images compressed just once. Then, the detector is optimized and extended in order to identify accurately the number of compression stages applied to an image. The experimental validation considers up to four consecutive compression stages and shows that the proposed approach extends and outperforms the previously-published algorithms for double JPEG compression detection.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Authors, 2015
Figure 0

Fig. 1. Bock diagram for multiple JPEG compression.

Figure 1

Fig. 2. Probability mass functions of FDs and frequency spectra of their differences χ(yN) with respect to a Benford's model at different coding stages. The parameter F denotes the normalized frequency for DFT coefficients of χ. The graphs are related to coefficient at position (1, 0) for image house. (a) P[m1 = k]. (b) P[m2 = k]. (c) P[m3 = k]. (d) P[m4 = k]. (e) DFT(χ(m1)). (f) DFT(χ(m2)). (g) DFT(χ(m3)). (h) DFT(χ(m4)).

Figure 2

Fig. 3. Accuracy of single versus double compression detector for different feature arrays. (a) three p(m) values, (b) two p(m) values, and (c) four p(m) values.

Figure 3

Fig. 4. Block diagram of the proposed detector.

Figure 4

Table 1. Confusion matrix for QFN = 75 and dataset D0. (a) Proposed method, (b) classifier in [18] (adapted).

Figure 5

Table 2. Confusion matrix for QFN = 80 and dataset D0. (a) Proposed method, (b) classifier in [18] (adapted).

Figure 6

Table 3. Confusion matrix for QFN = 90 and dataset D0. (a) Proposed method, (b) classifier in [18] (adapted).

Figure 7

Table 4. Confusion matrix with QFN = 80 on dataset D0 for the detection of five compression stages (proposed algorithm).

Figure 8

Fig. 5. ROC curves of different double compression detectors for different QFs on dataset D0. (a) QF = 70, (b) QF = 80.

Figure 9

Table 5. Computational time for the proposed method and the classifier in [18] (adapted).

Figure 10

Fig. 6. Accuracy versus dQF for images coded with QFN = 70 on dataset D0. The graph also reports the average PSNR decrement (dB) for each point.

Figure 11

Table 6. Confusion matrix for QFN = 80 from datasets D1. (a) Proposed method, (b) classifier in [18] (adapted).

Figure 12

Table 7. Confusion matrix for QFN = 80 from datasets D2. (a) Proposed method, (b) classifier in [18] (adapted).