Research Article
Open access
Published on 24 April 2025
Download pdf
Ding,Y. (2025). Fair Use in AI Data Training: Judgment Criteria and Balancing Mechanisms. Theoretical and Natural Science,101,69-77.
Export citation

Fair Use in AI Data Training: Judgment Criteria and Balancing Mechanisms

Yaorunyang Ding *,1,
  • 1 Institute of Foreign Languages, Heilongjiang University

* Author to whom correspondence should be addressed.

https://doi.org/10.54254/2753-8818/2025.CH22295

Abstract

Against the backdrop of the rapid development of artificial intelligence technology, the issue of fair use in AI data training has sparked widespread attention and discussion. Large model training relies on massive amounts of data, whose technical characteristics differ significantly from how traditional works are used. However, market failures are prevalent, with high licensing costs and difficulties in obtaining rights holders' permissions hindering the effective operation of traditional authorization mechanisms. Therefore, the use of works in large-scale model training should be considered as fair use, since it has a limited impact on the legitimate rights of copyright holders while offering significant social and public benefits. In addition, within the framework of copyright law, it is essential to clarify the rules and criteria for fair use in machine learning and to define the obligations and responsibilities of AI data trainers. This will help balance the interests of copyright holders, society, and data trainers, thereby promoting the healthy and sustainable development of AI technology and the adaptive evolution of copyright law.

Keywords

AI data training, Fair use, Interest balancing, Machine learning types

[1]. Washington Law Review, [M]. How Copyright Law Can Fix Artificial Intelligence’s Implicit Bias Problem, 2018 ,93(2), pp. 583-584. the Journal of Legal studies, [J]. An Economic Analysis of Copyright Law, ,1989,18(2), pp.326-361

[2]. Liang Zhiwen. On Legal Protection of Artificial Intelligence Creations [J]. Legal Science (Journal of Northwestern University of Politics and Law), 2017, 35 (05):156-165

[3]. L.Ray Patterson, Stanley W. Lindberg, The Nature of Copyright: A Law of Users’ Right, 1991

[4]. Liu Chuntian: Intellectual Property Law, [M] Higher Education Press, 2015 5th Edition, p. 122

[5]. Sounds of Science: Copyright Infringement in AI Music Generator Outputs, [J] Cath. U. J. L. & Tech, 2021,29(2), pp.185-218.

[6]. Wu Han Dong. A Question on Copyright Law of Artificially Generated Works [J]. Chinese and Foreign Jurisprudence, 2020, 32 (03): 653-673.

[7]. Du Jialu. Behavioral Interpretation and Benefit Balance: Fair Use of Artificial Intelligence Training Copyright [J]. Electronic intellectual property, 2024, (06): 27-39.

[8]. Article 47 of the Copyright Law of the People's Republic of China

[9]. Google Books Library Project,Books[EB/OL]. [2014-05-06].

[10]. Campbell v. Acuff-Rose Music, Inc., 510 U.S. 569, 577 (1994).

[11]. Artificial Intelligence’s Fair Use Crisis, COLUM.J.L. & ARTS ,2017,41, pp.45-97.

[12]. United States Copyright Act, Article 17, Section 107

[13]. Authors Guild,Inc. v.Google,Inc. 804 F.3d 202(2nd Cir. 2015).

Cite this article

Ding,Y. (2025). Fair Use in AI Data Training: Judgment Criteria and Balancing Mechanisms. Theoretical and Natural Science,101,69-77.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of CONF-MPCS 2025 Symposium: Mastering Optimization: Strategies for Maximum Efficiency

ISBN:978-1-80590-017-7(Print) / 978-1-80590-018-4(Online)
Conference date: 21 March 2025
Editor:Anil Fernando, Marwan Omar
Series: Theoretical and Natural Science
Volume number: Vol.101
ISSN:2753-8818(Print) / 2753-8826(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).