Hostname: page-component-6766d58669-76mfw Total loading time: 0 Render date: 2026-05-16T08:02:58.607Z Has data issue: false hasContentIssue false

An Overview of Coding Tools in AV1: the First Video Codec from the Alliance for Open Media

Published online by Cambridge University Press:  24 February 2020

Yue Chen*
Affiliation:
Google, USA
Debargha Mukherjee*
Affiliation:
Google, USA
Jingning Han
Affiliation:
Google, USA
Adrian Grange
Affiliation:
Google, USA
Yaowu Xu
Affiliation:
Google, USA
Sarah Parker
Affiliation:
Google, USA
Cheng Chen
Affiliation:
Google, USA
Hui Su
Affiliation:
Google, USA
Urvang Joshi
Affiliation:
Google, USA
Ching-Han Chiang
Affiliation:
Google, USA
Yunqing Wang
Affiliation:
Google, USA
Paul Wilkins
Affiliation:
Google, USA
Jim Bankoski
Affiliation:
Google, USA
Luc Trudeau
Affiliation:
Mozilla, USA
Nathan Egge
Affiliation:
Mozilla, USA
Jean-Marc Valin
Affiliation:
Amazon, USA
Thomas Davies
Affiliation:
Cisco, UK and Norway
Steinar Midtskogen
Affiliation:
Cisco, UK and Norway
Andrey Norkin
Affiliation:
Netflix, USA
Peter de Rivaz
Affiliation:
Argon Design, UK
Zoe Liu
Affiliation:
Visionular, USA
*
Corresponding authors: Y. Chen and D. Mukherjee E-mails: yuec@google.com and debargha@google.com
Corresponding authors: Y. Chen and D. Mukherjee E-mails: yuec@google.com and debargha@google.com

Abstract

In 2018, the Alliance for Open Media (AOMedia) finalized its first video compression format AV1, which is jointly developed by the industry consortium of leading video technology companies. The main goal of AV1 is to provide an open source and royalty-free video coding format that substantially outperforms state-of-the-art codecs available on the market in compression efficiency while remaining practical decoding complexity as well as being optimized for hardware feasibility and scalability on modern devices. To give detailed insights into how the targeted performance and feasibility is realized, this paper provides a technical overview of key coding techniques in AV1. Besides, the coding performance gains are validated by video compression tests performed with the libaom AV1 encoder against the libvpx VP9 encoder. Preliminary comparison with two leading HEVC encoders, x265 and HM, and the reference software of VVC is also conducted on AOM's common test set and an open 4k set.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Authors, 2020
Figure 0

Fig. 1. Partition tree in VP9 and AV1 [32].

Figure 1

Fig. 2. Recursive-filter-based intra predictor.

Figure 2

Fig. 3. Outline of the operations required to build the proposed CfL prediction [20].

Figure 3

Fig. 4. Example of multi-layer structure of a golden-frame group [32].

Figure 4

Fig. 5. Motion field estimation [32].

Figure 5

Fig. 6. Overlapping regions defined for AV1 OBMC.

Figure 6

Fig. 7. Affine warping in two shears [32].

Figure 7

Fig. 8. Wedge codebooks for square and rectangular blocks [32].

Figure 8

Fig. 9. Line number k following direction 0 to 7 in an $8\times 8$ block [29].

Figure 9

Fig. 10. The CDEF constraint function [29].

Figure 10

Fig. 11. In-loop filtering pipeline with optional super-resolution [32].

Figure 11

Fig. 12. Film grain estimation and synthesis framework [31].

Figure 12

Table 1. Encoder configurations.

Figure 13

Table 2. BDRate(%) of libvpx-vp9, x265, HM, and VTM in comparison with libaom AV1 encoder on the objective-1-fast set and the sjtu4k set.

Figure 14

Table 3. Mutual Avg-PSNR BDRates(%) between libvpx-vp9, x265, libaom-AV1, HM, VTM on the objective-1-fast set.

Figure 15

Table 4. Mutual Avg-PSNR BDRates(%) between libvpx-vp9, x265, libaom-AV1, HM, VTM on the sjtu4k set.

Figure 16

Table 5. Encoding and decoding complexity of libvpx-vp9, x265, HM, VTM using libaom-AV1 as the baseline.