Meta, Microsoft and Bloomberg Sued for Allegedly Using Pirated Books to Train AI
-
A group of authors including Mike Huckabee and Lysa TerKeurst filed a lawsuit accusing Meta, Microsoft, and Bloomberg of copyright infringement for allegedly using their books without permission to train AI systems.
-
The lawsuit centers around the Books3 dataset which contains thousands of pirated books. The authors say their books are in this dataset which was used to train models like Meta's Llama 2 and Bloomberg's BloombergGPT.
-
The authors argue the companies gained significant value from their books without authorization and are seeking monetary damages.
-
This lawsuit follows other recent copyright cases against tech companies over using copyrighted content like visual art to train AI models without permission.
-
The companies argue their use of the data is protected under fair use provisions of copyright law, but the authors say it was outright theft.