Huckabee and Religious Authors Sue Tech Giants for Using Pirated Books to Train AI
-
Former Arkansas Gov. Mike Huckabee and 4 other religious authors are suing tech companies for allegedly using their books to train AI without permission.
-
The lawsuit accuses Meta, Microsoft, Bloomberg, and EleutherAI of using a dataset called Books3 that scraped nearly 200,000 pirated books.
-
EleutherAI created an open-source dataset called the Pile to train large language models (LLMs), using Books3.
-
Meta and Microsoft used the Pile and Books3 to develop their LLM models. Bloomberg also used Books3.
-
The lawsuit alleges the companies knowingly used pirated books in datasets to train their AI systems.