Hostname: page-component-89b8bd64d-72crv Total loading time: 0 Render date: 2026-05-06T23:37:39.752Z Has data issue: false hasContentIssue false

Fair Use in Training AI Models: A Review and Prospect of the Relevant Legal Development in China

Published online by Cambridge University Press:  24 April 2026

Jiyu Zhang*
Affiliation:
Law and Technology Institute, Law School, Renmin University of China, China
Xinmeng Li
Affiliation:
Law School, Renmin University of China, China
*
Corresponding author: Jiyu Zhang; Email: zjy@ruc.edu.cn

Abstract

The use of copyrighted works in training large AI models has sparked numerous lawsuits globally. This Article examines China’s evolving regulatory landscape, and analyzes two academic proposals for China’s Artificial Intelligence Law, identifying key areas of divergence and consensus regarding the fair use of copyrighted works in AI training. By comparing three different legal approaches to characterizing AI model training, this Article argues that this process qualifies as fair use. This is because machine learning leverages vast corpora to internalize underlying linguistic and creative patterns, rather than storing or directly reproducing the protected works. As a result, the use of copyrighted material in the training phase qualifies as incidental reproduction and transformative use, which, according to our empirical study, does not unreasonably harm the legitimate rights and interests of copyright holders. Furthermore, given the market failure in AI model training licensing, this Article contends that recognizing AI model training as fair use better aligns with China’s legal framework and the practical needs of technological development. To ensure legal certainty, this Article proposes introducing a machine learning exception within either the ongoing revision of the Regulations for the Implementation of the Copyright Law, or future AI legislation in China.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of German Law Journal e.V
Figure 0

Figure 1. Average Output Word Length of Original Works

Figure 1

Figure 2. Average Original-to-Chapter Ratio of Large Models’ Output

Figure 2

Figure 3. Similarity of Large Model Output to Original Works