Topics

Posted 9/29/2023, 10:00:04 AM

Author Sues Meta Over Unauthorized Use of Books in AI Training Dataset

Author's books were included in AI training dataset Books3 without permission
Lawsuits filed against companies like Meta for using Books3, but damage likely already done
Irony that indie programmer created Books3 to compete with big tech firms, yet still exploits authors
Author feels exploited like jobs in many fields, with AI benefiting from their hard work
Dismayed to be in "club" with authors like Groucho Marx who'd refuse membership

latimes.com

Relevant topic timeline:

8/14/2023

Why the Great AI Backlash Came for a Tiny Startup You’ve Probably Never Heard Of

The main topic of the article is the backlash against AI companies that use unauthorized creative work to train their models. Key points: 1. The controversy surrounding Prosecraft, a linguistic analysis site that used scraped data from pirated books without permission. 2. The debate over fair use and copyright infringement in relation to AI projects. 3. The growing concern among writers and artists about the use of generative AI tools to replace human creative work and the push for individual control over how their work is used.

8/18/2023

AI-Created Art Isn’t Copyrightable, Judge Says In Ruling That Could Give Hollywood Studios Pause

Main topic: Copyright protection for works created by artificial intelligence (AI) Key points: 1. A federal judge upheld a finding from the U.S. Copyright Office that AI-generated art is not eligible for copyright protection. 2. The ruling emphasized that human authorship is a fundamental requirement for copyright protection. 3. The judge stated that copyright law protects only works of human creation and is not designed to extend to non-human actors like AI.

8/19/2023

Biblioracle: Author’s reputation was on the line, thanks to AI

Main topic: The potential harm of AI-generated content and the need for caution when purchasing books. Key points: 1. AI is being used to generate low-quality books masquerading as quality work, which can harm the reputation of legitimate authors. 2. Amazon's response to the issue of AI-generated books has been limited, highlighting the need for better safeguards and proof of authorship. 3. Readers need to adopt a cautious approach and rely on trustworthy sources, such as local bookstores, to avoid misinformation and junk content.

8/19/2023

Revealed: The Authors Whose Pirated Books Are Powering Generative AI

Main topic: The use of copyrighted books to train large language models in generative AI. Key points: 1. Writers Sarah Silverman, Richard Kadrey, and Christopher Golden have filed a lawsuit alleging that Meta violated copyright laws by using their books to train LLaMA, a large language model. 2. Approximately 170,000 books, including works by Stephen King, Zadie Smith, and Michael Pollan, are part of the dataset used to train LLaMA and other generative-AI programs. 3. The use of pirated books in AI training raises concerns about the impact on authors and the control of intellectual property in the digital age.

8/22/2023

Scraping or Stealing? A Legal Reckoning Over AI Looms

Three artists, including concept artist Karla Ortiz, are suing AI art generators Stability AI, Midjourney, and DeviantArt for using their work to train generative AI systems without their consent, in a case that could test the boundaries of copyright law and impact the way AI systems are built. The artists argue that feeding copyrighted works into AI systems constitutes intellectual property theft, while AI companies claim fair use protection. The outcome could determine the legality of training large language models on copyrighted material.

8/22/2023

A.I. Copyrights, Michael Jackson Abuse Cases, Smokey Robinson Trial & More Top Legal News

A federal judge has ruled that works created by artificial intelligence (A.I.) are not covered by copyrights, stating that copyright law is designed to incentivize human creativity, not non-human actors. This ruling has implications for the future role of A.I. in the music industry and the monetization of works created by A.I. tools.

8/23/2023

Zadie Smith, Stephen King and Rachel Cusk’s pirated works used to train AI

Authors such as Zadie Smith, Stephen King, Rachel Cusk, and Elena Ferrante have discovered that their pirated works were used to train artificial intelligence tools by companies including Meta and Bloomberg, leading to concerns about copyright infringement and control of the technology.

8/24/2023

Liability price tag for firms deploying artificial intelligence

Artificial intelligence (AI) poses risks in the legal industry, including ethical dilemmas, reputational damage, and discrimination, according to legal technology experts. Instances of AI-generated content without proper human oversight could compromise the quality of legal representation and raise concerns about professional responsibility. Additionally, the Equal Employment Opportunity Commission (EEOC) recently settled a lawsuit involving discriminatory use of AI in the workplace, highlighting the potential for AI to discriminate. Maintaining trust and credibility is crucial in the reputation-reliant field of law, and disseminating AI-generated content without scrutiny may lead to reputational damage and legal consequences for lawyers or law firms. Other legal cases involving AI include allegations of copyright infringement.

8/31/2023

The Inventor Behind a Rush of AI Copyright Suits Is Trying to Show His Bot Is Sentient

“A Recent Entrance to Paradise” is a pixelated artwork created by an artificial intelligence called DABUS in 2012. However, its inventor, Stephen Thaler, has been denied copyright for the work by a judge in the US. This decision has sparked a series of legal battles in different countries, as Thaler believes that DABUS, his AI system, is sentient and should be recognized as an inventor. These lawsuits raise important questions about intellectual property and the rights of AI systems. While Thaler's main supporter argues that machine inventions should be protected to encourage social good, Thaler himself sees these cases as a way to raise awareness about the existence of a new species. The debate revolves around whether AI systems can be considered creators and should be granted copyright and patent rights. Some argue that copyright requires human authorship, while others believe that intellectual property rights should be granted regardless of the involvement of a human inventor or author. The outcome of these legal battles could have significant implications for the future of AI-generated content and the definition of authorship.

8/31/2023

UK publishers urge Sunak to protect works ingested by AI models

UK publishers have called on the prime minister to protect authors' intellectual property rights in relation to artificial intelligence systems, as OpenAI argues that authors suing them for using their work to train AI systems have misconceived the scope of US copyright law.

9/1/2023

Inventor Claims His AI Is Sentient, Fights to Copyright Its Creations

AI researcher Stephen Thaler argues that his AI creation, DABUS, should be able to hold copyright for its creations, but legal experts and courts have rejected the idea, stating that copyright requires human authorship.

9/5/2023

U.S. Copyright Office Invites Public To Comment On AI

The United States Copyright Office has launched a study on artificial intelligence (AI) and copyright law, seeking public input on various policy issues and exploring topics such as AI training, copyright liability, and authorship. Other U.S. government agencies, including the SEC, USPTO, and DHS, have also initiated inquiries and public forums on AI, highlighting its impact on innovation, governance, and public policy.

9/8/2023

Amazon Now Requiring AI Disclosure for Ebooks Amid Author Backlash

Amazon.com is now requiring writers to disclose if their books include artificial intelligence material, a step praised by the Authors Guild as a means to ensure transparency and accountability for AI-generated content.

9/12/2023

Authors Sue Meta, Allege AI Training Used Their Books Without Permission

Meta is being sued by authors who claim that their copyrighted works were used without consent to train the company's Llama AI language tool.

9/12/2023

AI Authors Sue Meta and OpenAI for Alleged Copyright Infringement in Training ChatGPT

Authors, including Michael Chabon, are filing class action lawsuits against Meta and OpenAI, alleging copyright infringement for using their books to train artificial intelligence systems without permission, seeking the destruction of AI systems trained on their works.

9/14/2023

AI-Generated Knickknacks Proliferate Online, Raising Concerns About Quality and Ethics

The rise of easily accessible artificial intelligence is leading to an influx of AI-generated goods, including self-help books, wall art, and coloring books, which can be difficult to distinguish from authentic, human-created products, leading to scam products and potential harm to real artists.

9/20/2023

Authors Guild Sues OpenAI Over Copyright Infringement for Using Books to Train ChatGPT

The Authors Guild, representing prominent fiction authors, has filed a lawsuit against OpenAI, alleging copyright infringement and the unauthorized use of their works to train AI models like ChatGPT, which generates summaries and analyses of their novels, interfering with their economic prospects. This case could determine the legality of using copyrighted material to train AI systems.

9/22/2023

Amazon Limits AI-Generated Books to 3 Per Day to Curb Low-Quality Content Farms

Amazon has introduced a policy allowing authors, including those using AI, to "write" and publish up to three books per day on its platform under the protection of a volume limit to prevent abuse, despite the poor reputation of AI-generated books sold on the site.

9/22/2023

Amazon Requires Disclosure of AI-Generated Books Amid Controversy Over ChatGPT Works

Amazon has introduced new guidelines requiring publishers to disclose the use of AI in content submitted to its Kindle Direct Publishing platform, in an effort to curb unauthorized AI-generated books and copyright infringement. Publishers are now required to inform Amazon about AI-generated content, but AI-assisted content does not need to be disclosed. High-profile authors have recently joined a class-action lawsuit against OpenAI, the creator of the AI chatbot, for alleged copyright violations.

9/27/2023

Authors Divided on Ethics of Using Books Without Permission to Train AI

The Atlantic has revealed that Meta's AI language model was trained using tens of thousands of books without permission, sparking outrage among authors, some of whom found their own works in Meta's database, but the debate surrounding permission versus the transformative nature of art and AI continues.

9/28/2023

AI-Generated Books Flood Amazon, Raising Concerns for Human Authors

“AI-Generated Books Flood Amazon, Detection Startups Offer Solutions” - This article highlights the problem of AI-generated books flooding Amazon and other online booksellers. The excessive number of low-quality AI-generated books has made it difficult for customers to find high-quality books written by humans. Several AI detection startups are offering solutions to proactively flag AI-generated materials, but Amazon has yet to embrace this technology. The article discusses the potential benefits of AI flagging for online book buyers and the ethical responsibility of booksellers to disclose whether a book was written by a human or machine. However, there are concerns about the accuracy of current AI detection tools and the presence of false positives, leading some institutions to discontinue their use. Despite these challenges, many in the publishing industry believe that AI flagging is necessary to maintain trust and transparency in the marketplace.

9/29/2023

The book "The Futurist" by author and journalist Peter Rubin is among the thousands of pirated books being used to train generative-AI systems, sparking concerns about the future of human writers and copyright infringement.

10/8/2023

Investigation Reveals 200,000 Books Used Without Permission to Train AI, Outraging Many Authors

Tech companies are facing backlash from authors after it was revealed that almost 200,000 pirated e-books were used to train artificial intelligence systems, with many authors expressing outrage and feeling exploited by the unauthorized use of their work.

10/8/2023

Authors Cry Foul as Tech Giants Use Pirated Books to Train AI Without Permission

Tech companies are facing backlash from authors whose books were used without permission to train artificial intelligence systems, with the data set consisting of pirated e-books; authors are expressing outrage and calling it theft, while some see it as an opportunity for their work to be read and educate.

10/9/2023

Authors Outraged as Tech Companies Use 200,000 Books Without Consent to Train AI Models

Books by famous authors, including J.K. Rowling and Neil Gaiman, are being used without permission to train AI models, drawing outrage from the authors and sparking lawsuits against the companies involved.

10/17/2023

A.I. Firms Face Potentially Devastating Lawsuits Over Training Data Copyright

The use of copyrighted materials to train AI models poses a significant legal challenge, with companies like OpenAI and Meta facing lawsuits for allegedly training their models on copyrighted books, and legal experts warning that copyright challenges could pose an existential threat to existing AI models if not handled properly. The outcome of ongoing legal battles will determine whether AI companies will be held liable for copyright infringement and potentially face the destruction of their models and massive damages.

10/18/2023

Meta, Microsoft and Bloomberg Sued for Allegedly Using Pirated Books to Train AI

Prominent authors, including former Arkansas governor Mike Huckabee and Christian author Lysa TerKeurst, have filed a lawsuit accusing Meta, Microsoft, and Bloomberg of using their work without permission to train artificial intelligence systems, specifically the controversial "Books3" dataset.

10/19/2023

AI Training Data Lacks Transparency, Raising Concerns Over Privacy and Bias

Generative AI systems, trained on copyrighted material scraped from the internet, are facing lawsuits from artists and writers concerned about copyright infringement and privacy violations. The lack of transparency regarding data sources also raises concerns about data bias in AI models. Protecting data from AI is challenging, with limited tools available, and removing copyrighted or sensitive information from AI models would require costly retraining. Companies currently have little incentive to address these issues due to the absence of AI policies or legal rulings.

10/19/2023

Publishing Groups Call for EU Transparency Laws on AI Training Data to Protect Books and Democracy

Three major European publishing trade bodies are calling on the EU to ensure transparency and regulation in artificial intelligence to protect the book chain and democracy, citing the illegal and opaque use of copyright-protected books in the development of generative AI models.

10/19/2023

Huckabee Sues Meta, Microsoft for Using His Books in AI Training Without Payment

Former Arkansas Governor Mike Huckabee and other authors have filed a lawsuit against Meta, Microsoft, and other companies, alleging that their books were pirated and used without permission to train AI models, in the latest case of authors accusing tech companies of copyright infringement in relation to AI training data.

10/19/2023

Huckabee and Religious Authors Sue Tech Giants for Using Pirated Books to Train AI

Former Arkansas Gov. Mike Huckabee and four other religious authors are suing tech companies including Meta, Microsoft, Bloomberg, and EleutherAI Institute for allegedly using their books without permission to train artificial intelligence models.

10/20/2023

Big Name Authors Sue OpenAI Alleging AI Models Infringe on Copyrights

A group of prominent authors, including Douglas Preston, John Grisham, and George R.R. Martin, are suing OpenAI for copyright infringement over its AI system, ChatGPT, which they claim used their works without permission or compensation, leading to derivative works that harm the market for their books; the publishing industry is increasingly concerned about the unchecked power of AI-generated content and is pushing for consent, credit, and fair compensation when authors' works are used to train AI models.

10/23/2023

AI Writing Tools Spark Debate at Frankfurt Book Fair

The impact of AI on publishing is causing concerns regarding copyright, the quality of content, and ownership of AI-generated works, although some authors and industry players feel the threat is currently minimal due to the low quality of AI-written books. However, concerns remain about legal issues, such as copyright ownership and AI-generated content in translation.

10/22/2023

Publishers Concerned AI Book-Writing Could Flood Market with Low Quality Works

The publishing industry is grappling with concerns about the impact of AI on book writing, including issues of copyright, low-quality computer-written books flooding the market, and potential legal disputes over ownership of AI-generated content. However, some authors and industry players believe that AI still has a long way to go in producing high-quality fiction, and there are areas of publishing, such as science and specialist books, where AI is more readily accepted.

10/23/2023

AI Writing Tools Flood Amazon With Computer-Generated Books, Raising Concerns Over Quality and Legal Issues

The publishing industry is grappling with concerns about the impact of AI on copyright, as well as the quality and ownership of AI-generated content, although some authors and industry players believe that AI writing still has a long way to go before it can fully replace human authors.

10/23/2023

AI Faces Backlash Over Training Data Scraped from Internet

Writers and artists are filing lawsuits over the use of copyrighted work in training large AI models, raising concerns about data sources and privacy, and the potential for bias in the generated content.

10/25/2023

AI Art Lawsuits Raise Debates About Copyright and IP Rights

The battle over intellectual property (IP) ownership and the use of artificial intelligence (AI) continues as high-profile authors like George R.R. Martin are suing OpenAI for copyright infringement, raising questions about the use of IP in training language models without consent.

Topics

Posted 9/29/2023, 10:00:04 AM

Author Sues Meta Over Unauthorized Use of Books in AI Training Dataset

Author's books were included in AI training dataset Books3 without permission
Lawsuits filed against companies like Meta for using Books3, but damage likely already done
Irony that indie programmer created Books3 to compete with big tech firms, yet still exploits authors
Author feels exploited like jobs in many fields, with AI benefiting from their hard work
Dismayed to be in "club" with authors like Groucho Marx who'd refuse membership

latimes.com

Relevant topic timeline:

8/14/2023

Why the Great AI Backlash Came for a Tiny Startup You’ve Probably Never Heard Of

8/18/2023

AI-Created Art Isn’t Copyrightable, Judge Says In Ruling That Could Give Hollywood Studios Pause

8/19/2023

Biblioracle: Author’s reputation was on the line, thanks to AI

8/19/2023

Revealed: The Authors Whose Pirated Books Are Powering Generative AI

8/22/2023

Scraping or Stealing? A Legal Reckoning Over AI Looms