Tagged / centre for intellectual property and policy management

Conversation article: How a New York Times copyright lawsuit against OpenAI could potentially transform how AI and copyright work

Professor Dinusha Mendis writes for The Conversation about the potential copyright implications of AI as a lawsuit is lodged by the New York Times against the creator of ChatGPT…

How a New York Times copyright lawsuit against OpenAI could potentially transform how AI and copyright work

Stas Malyarevsky / Shutterstock

Dinusha Mendis, Bournemouth University

On December 27, 2023, the New York Times (NYT) filed a lawsuit in the Federal
District Court in Manhattan against Microsoft and OpenAI, the creator of ChatGPT,
alleging that OpenAI had unlawfully used its articles to create artificial intelligence (AI) products.

Citing copyright infringement and the importance of independent journalism to democracy, the newspaper further alleged that even though the defendant, OpenAI, may have “engaged in wide scale copying from many sources, they gave Times content particular emphasis” in training generative artificial intelligence (GenAI) tools such as Generative Pre-Trained Transformers (GPT). This is the kind of technology that underlies products such as the AI chatbot ChatGPT.

The complaint by the New York Times states that OpenAI took millions of copyrighted news articles, in-depth investigations, opinion pieces, reviews, how-to guides and more in an attempt to “free ride on the Times’s massive investment in its journalism”.

In a blog post published by OpenAI on January 8, 2024, the tech company responded to the allegations by emphasising its support of journalism and partnerships with news organisations. It went on to say that the “NYT lawsuit is without merit”.

In the months prior to the complaint being lodged by the New York Times, OpenAI had entered into agreements with large media companies such as Axel-Springer and the Associated Press, although notably, the Times failed to reach an agreement with the tech company.

The NYT case is important because it is different to other cases involving AI and copyright, such as the case brought by the online photo library Getty Images against the tech company Stability AI earlier in 2023. In this case, Getty Images alleged that Stability AI processed millions of copyrighted images using a tool called Stable Diffusion, which generates images from text prompts using AI.

The main difference between this case and the New York Times one is that the newspaper’s complaint highlighted actual outputs used by OpenAI to train its AI tools. The Times provided examples of articles that were reproduced almost verbatim.

Use of material

The defence available to OpenAI is “fair use” under the US Copyright Act 1976, section 107. This is because the unlicensed use of copyright material to train generative AI models can serve as a “transformative use” which changes the original material. However, the complaint from the New York Times also says that their chatbots bypassed the newspaper’s paywalls to create summaries of articles.

Even though summaries do not infringe copyright, their use could be used by the New York Times to try to demonstrate a negative commercial impact on the newspaper – challenging the fair use defence.

ChatGPT
Giulio Benzin / Shutterstock

This case could ultimately be settled out of court. It is also possible that the Times’ lawsuit was more a negotiating tactic than a real attempt to go all the way to trial. Whichever way the case proceeds, it could have important implications for both traditional media and AI development.

It also raises the question of the suitability of current copyright laws to deal with AI. In a submission to the House of Lords communications and digital select committee on December 5, 2023, OpenAI claimed that “it would be impossible to train today’s leading AI models without copyrighted materials”.

It went on to say that “limiting training data to public domain books and drawings created more than a century ago might yield an interesting experiment but would not provide AI systems that meet the needs of today’s citizens”.

Looking for answers

The EU’s AI Act –- the world’s first AI Act –- might give us insights into some future directions. Among its many articles, there are two provisions particularly relevant to copyright.

The first provision titled, “Obligations for providers of general-purpose AI
models” includes two distinct requirements related to copyright. Section 1(C)
requires providers of general-purpose AI models to put in place a policy to respect EU copyright law.

Section 1(d) requires providers of general purpose AI systems to draw up and make publicly available a detailed summary about content used for training AI systems.

While section 1(d) raises some questions, section 1(c) makes it clear that any use of copyright protected content requires the authorisation of the rights holder concerned unless relevant copyright exceptions apply. Where the rights to opt out has been expressly reserved in an appropriate manner, providers of general purpose AI models, such as OpenAI, will need to obtain authorisation from rights holders if they want to carry out text and data mining on their copyrighted works.

Even though the EU AI Act may not be directly relevant to the New York Times complaint against OpenAI, it illustrates the way in which copyright laws will be designed to deal with this fast-moving technology. In future, we are likely to see more media organisations adopting this law to protect journalism and creativity. In fact, even before the EU AI Act was passed, the New York Times blocked OpenAI from trawling its content. The Guardian followed suit in September 2023 – as did many others.

However, the move did not allow material to be removed from existing training
data sets. Therefore, any copyrighted material used by the training models up until then would have been used in OpenAI’s outputs –- which led to negotiations between the New York Times and OpenAI breaking down.

With laws such as those in the EU AI Act now placing legal obligations on general purpose AI models, their future could look more constrained in the way that they use copyrighted works to train and improve their systems. We can expect other jurisdictions to update their copyright laws reflecting similar provisions to that of the EU AI Act in an attempt to protect creativity. As for traditional media, ever since the rise of the internet and social media, news outlets have been challenged in drawing readers to their sites and generative AI has simply exacerbated this issue.

This case will not spell the end of generative AI or copyright. However, it certainly raises questions for the future of AI innovation and the protection of creative content. AI will certainly continue to grow and develop and we will continue to see and experience its many benefits. However, the time has come for policymakers to take serious note of these AI developments and update copyright laws, protecting creators in the process.The Conversation

Dinusha Mendis, Professor of Intellectual Property and Innovation Law; Director Centre for Intellectual Property Policy and Managament (CIPPM), Bournemouth University, Bournemouth University

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Two new copyright papers by Business School Professors

Professor Ruth TowseProfessor Ruth Towse’s article What we know, what we don’t know, and what policy-makers would like us to know about the economics of copyright, published in the Review of Economic Research on Copyright Issues (2011, vol. 8(2), pp.101-120) was recently listed on Social Science Research Network’s (SSRN) Top Ten Download list for: Intellectual Property: Copyright Law eJournal.

Ruth Towse is Professor of Economics of Creative Industries in the Business School, and is Centre for Intellectual Property and Policy Management (CIPPM) co-director (economics).

Professor Paul Heald’s ongoing study exploring the public domain effects Professor Paul Healdof copyright law was reported on The Atlantic, among other places. It shows there are twice as many newly published books available on Amazon from 1850 as there are from 1950.

Paul Heald is Professor of Law at the University of Illinois and Professorial Fellow at the CIPPM, Bournemouth University. You can read more about Professor Heald’s work here.

Copyright levies & market growth: Kretschmer presents in Brussels

Professor Martin KretschmerBU’s Professor Martin Kretschmer presented his latest research on copyright levies to over 70 representatives from the European Commission, European Parliament and international organisations and firms including Google, Nokia and Apple in Brussels last week.

The event saw speakers thrash out the role of intellectual property (IP) in digital markets and particularly the barrier copyright levies pose to market growth. (The levy system adds a tariff to blank CDs, MP3 players, printers, PCs and other copying devices, and the money is given as compensation to the IP owner for loss of sale).

Professor Kretschmer’s research reported the results of three product CDsstudies (printer / scanners, portable music / video / game devices and tablet computers) and analysed the relationship between VAT, levy tariffs and retail prices in 20 levy and non-levy countries.

He argued that reproduction of files for personal use, storage or back up should fall under a (non-compensated) copyright exception as there is no harm due to loss of sale, but that file sharing, performance or social network activities will need a licensing solution.

Speaking alongside Kretschmer was Professor Ian Hargreaves; author of the ‘Hargreaves review’, which was conducted in 2011 for Prime Minister David Cameron, recommending an IP framework to support innovation and economic growth in the digital age.

Audio recordings and slides from the event, ‘Intellectual Property for Growth in Digital Markets’, can be accessed via the Bruegel website.

BU Professor of Law presents in Geneva

WIPO logoBournemouth University’s Director for the Centre for Intellectual Property Policy and Management (CIPPM), Professor Martin Kretschmer, has been invited to speak at the World Intellectual Property Organization (WIPO) in Geneva.

Professor Kretschmer will open the ‘Economics of Intellectual Property’ seminar series on 15 February 2012, with a presentation entitled ‘Private copying and fair compensation: An empirical study of copyright levies in Europe’.

His work is the first independent empirical assessment of the European levy system as a whole. It consolidates the evidence on levy setting, collection and distribution and reviews the scope of consumer permissions associated with levy payments.

Professor Kretschmer will present the results of three studies into printer/scanners, portable music/video/game devices, and tablet computers, including his analysis of the relationship between VAT, levy tariffs and retail prices in 20 levy and non-levy countries.

The full seminar series sees six presentations in Geneva between now and November, each by one of the world’s top Intellectual Property researchers. The full programme, including presentations by Professors of Stanford University and the University of Tokyo can be accessed online.

A video version of Professor Kretschmer’s and other ‘Economics of Intellectual Property’ presentations will be available after the event through the WIPO website.