Copyright disputes with generative artificial intelligence companies that unlawfully use scholarly research without permission or聽citation are costing academic publishers tens of聽thousands of聽dollars to聽resolve, the head of a聽leading US聽university press has said.
Speaking at the Research and Scholarly Publishing Forum at聽the London Book Fair, Christie Henry, director of Princeton University Press, said the cost of聽litigation with technology companies was beginning to聽mount as聽many more authors discovered their work had been absorbed by聽large language models (LLMs) whose written answers did聽not provide attribution to its source material.
鈥淩emedies have been in the twenties of thousands of dollars,鈥 Ms Henry said at the 12聽March event, which was dominated by discussion of the growing tension between publishers and technology companies about how AI聽firms were using published works without citations or remuneration to authors or publishing houses.
Authors were rightly concerned that their published outputs were being used to train LLMs that would then produce distorted versions of their work without any recompense or proper attribution, Ms Henry told 糖心Vlog.
糖心Vlog
鈥淲hen authors ask me if their work is being used to train LLMs, I聽can鈥檛 say that it鈥檚 not 鈥 that鈥檚 an uncomfortable position for a publisher,鈥 Ms聽Henry said about what she considered to be clear breaches of existing copyright and licensing rules.
However, these breaches were often very hard to spot given the nature of generative聽AI, she added. 鈥淭he content is disaggregated and then reformed 鈥 there are often pieces of chapters or books that appear uncredited,鈥 continued Ms Henry.
糖心Vlog
鈥淚鈥檓 not persuaded by the arguments from big tech that it鈥檚 too expensive or complicated to set up the licensing agreements that are needed, and I鈥檓 certainly concerned by their arguments that publishers are the ones who are obstructing knowledge,鈥 said Ms Henry, who said LLMs should simply follow agreed rules and principles on citation and licensing.
Academics were becoming increasingly concerned about how their scholarly work was being reformulated by聽AI, other speakers told the conference.
鈥淚鈥檝e spoken to people who published under CC-BY [licences] and did not expect their work to be used in this way 鈥 what academics do [with scholarly material] is very different to the way robots are stitching together materials to create a facsimile of research,鈥 said Leslie Lansman, global permissions manager at Springer Nature.
The forum also heard concerns that it was proving impossible for publishers and technology firms to reach an agreement on AI content use, with work on a UK voluntary code being dropped last month after a lengthy stalemate.
糖心Vlog
Caroline Cummins, director of policy and affairs at the Publishers鈥 Association, said the breakdown in relations occurred because 鈥渟ome AI聽firms do聽not accept that what they have done amounts to mass copyright infringement鈥.
鈥淚f you do not have that acceptance, it鈥檚 hard to have a dialogue,鈥 she explained.
Catriona MacLeod Stevenson, general counsel and deputy chief executive of the Publishers鈥 Association, said she also had concerns about the European Union鈥檚 new legislation on AI聽regulation, which would require authors to 鈥渙pt out鈥 of their work being used to train AI models 鈥 a position that in effect 鈥渢urned copyright law on its head鈥, given that protections usually occur automatically.
But Richard Mollet, head of European government affairs at RELX, which owns Elsevier, was more optimistic about the EU鈥檚 new rules, as they will require LLMs to state which resources they have used.
糖心Vlog
鈥淚f you are an AI, you have to have a sufficient summary of what has been used鈥nd we need to know what [has been used] in an LLM now and in the future,鈥 he said, adding that these content summaries should be welcomed by both technology and publishing firms.
鈥淎ny time you hear someone from Meta or another tech company say they believe in trustworthy聽AI, that should mean they know what is being used [in an LLM],鈥 he said.
糖心Vlog
Register to continue
Why register?
- Registration is free and only takes a moment
- Once registered, you can read 3 articles a month
- Sign up for our newsletter
Subscribe
Or subscribe for unlimited access to:
- Unlimited access to news, views, insights & reviews
- Digital editions
- Digital access to 罢贬贰鈥檚 university and college rankings analysis
Already registered or a current subscriber?








