Topics

Posted 9/8/2023, 11:42:20 AM

Tech Giants Use Personal Data to Secretly Train AI, Raising Privacy Fears

Tech companies like Google, Meta, and Microsoft are using personal data from products like Gmail and Instagram to train AI systems without permission. This raises privacy concerns.
The scale of data needed to train modern AI systems is massive. Companies take shortcuts to get large volumes of training data.
Generative AI like chatbots brings new privacy risks, including the potential for AI systems to regurgitate private information.
Companies make their own rules about what data can be used for AI training. It's very opaque to users.
Users have little control or say in how their data is used to develop lucrative new AI products that could disrupt industries.

washingtonpost.com

Relevant topic timeline:

8/19/2023

AI and the tyranny of the data commons

Main Topic: The demise of the sharing economy due to the appropriation of data for AI models by corporations. Key Points: 1. Data, often considered a non-rival resource, was believed to be the basis for a new mode of production and a commons in the sharing economy. 2. However, the appropriation of our data by corporations for AI training has revealed the hidden costs and rivalrous nature of data. 3. Corporations now pretend to be concerned about AI's disruptive power while profiting from the appropriation, highlighting a tyranny of the commons and the need for regulation.

8/21/2023

AI and new standards promise to make scientific data more useful by making it reusable and accessible

Proper research data management, including the use of AI, is crucial for scientists to reproduce prior results, combine data from multiple sources, and make data more accessible and reusable, ultimately improving the scientific process and benefiting all forms of intelligence.

8/19/2023

AI and the tyranny of the data commons

The author discusses how the sharing economy, built on the notion of data as a non-rival good, has led to the appropriation of our data by corporations and its conversion into training data for AI models, ultimately resulting in a "tyranny of the commons."

8/22/2023

Synthetic Data Generation: Global Markets

The global market for synthetic data generation is rapidly growing as organizations in various industries seek cost-effective and privacy-compliant alternatives to real data for training machine learning models and conducting data-driven research. The market is estimated to reach $REDACTED billion by 2028, with North America leading in adoption due to the presence of leading global companies and advanced technologies like AI and ML.

8/31/2023

A New Facebook Setting Tells Meta Not to Use Your Data for AI

Meta, the creator of Facebook and Instagram, has introduced a privacy setting that allows users to request that their data not be used to train its AI models, although the effectiveness of this form is questionable.

8/31/2023

As personal data get trickier to come by, AI is swallowing everything else

The podcast discusses the changing landscape of data gathering, trading, and ownership, including the challenges posed by increasing regulation, the impact of artificial intelligence, and the perspectives from industry leaders.

9/5/2023

How Artificial Intelligence Can Transform Financial System

Artificial intelligence has the potential to transform the financial system by improving access to financial services and reducing risk, according to Google CEO Thomas Kurian. He suggests leveraging technology to reach customers with personalized offers, create hyper-personalized customer interfaces, and develop anti-money laundering platforms.

9/8/2023

Cars Rated Worst for Privacy in New Report, Collecting Extensive Driver Data With Little Opt-Out

Car companies are collecting excessive personal data from drivers and providing little to no control over its use, according to a report by the Mozilla Foundation, which warns that cars are the worst product for privacy protection and highlights that 84% of car brands share or sell data.

9/13/2023

Tech Companies Update Privacy Policies to Allow Broader Use of User Data for AI

Companies such as Rev, Instacart, and others are updating their privacy policies to allow the collection of user data for training AI models like speech-to-text and generative AI tools.

9/15/2023

AI Firms Secretly Amass Data to Train Models, Sparking Backlash From Creators

The generative AI boom has led to a "shadow war for data," as AI companies scrape information from the internet without permission, sparking a backlash among content creators and raising concerns about copyright and licensing in the AI world.

9/19/2023

Microsoft Exposes 38TB of Sensitive Data in AI Training Mishap

Microsoft inadvertently exposed 38TB of personal data, including sensitive information, due to a data leak during the uploading of training data for AI models, raising concerns about the need for improved security measures as AI usage becomes more widespread.

9/19/2023

AI Pioneer Calls for Practical Regulations to Address AI's Real-World Risks

While many experts are concerned about the existential risks posed by AI, Mustafa Suleyman, cofounder of DeepMind, believes that the focus should be on more practical issues like regulation, privacy, bias, and online moderation. He is confident that governments can effectively regulate AI by applying successful frameworks from past technologies, although critics argue that current internet regulations are flawed and insufficiently hold big tech companies accountable. Suleyman emphasizes the importance of limiting AI's ability to improve itself and establishing clear boundaries and oversight to ensure enforceable laws. Several governments, including the European Union and China, are already working on AI regulations.

9/25/2023

AI's Dark Side: How Data Collection and Surveillance Fuel Inequity

AI and big data are closely linked to the surveillance business model, used by companies like Google and Meta, to make determinations and predictions about users, shaping their access to opportunities and resources, according to Signal president Meredith Whittaker. She also highlighted the exploitation of human labor in creating AI systems and the potential negative implications of facial recognition technology.

10/3/2023

Tech Giants Compete for Data to Train AI Models, Microsoft CEO Says

Big tech firms, including Google and Microsoft, are engaged in a competition to acquire content and data for training AI models, according to Microsoft CEO Satya Nadella, who testified in an antitrust trial against Google and highlighted the race for content among tech firms. Microsoft has committed to assuming copyright liability for users of its AI-powered Copilot, addressing concerns about the use of copyrighted materials in training AI models.

Topics

Posted 9/8/2023, 11:42:20 AM

Tech Giants Use Personal Data to Secretly Train AI, Raising Privacy Fears

Tech companies like Google, Meta, and Microsoft are using personal data from products like Gmail and Instagram to train AI systems without permission. This raises privacy concerns.
The scale of data needed to train modern AI systems is massive. Companies take shortcuts to get large volumes of training data.
Generative AI like chatbots brings new privacy risks, including the potential for AI systems to regurgitate private information.
Companies make their own rules about what data can be used for AI training. It's very opaque to users.
Users have little control or say in how their data is used to develop lucrative new AI products that could disrupt industries.

washingtonpost.com

Relevant topic timeline:

8/19/2023

AI and the tyranny of the data commons

8/21/2023

AI and new standards promise to make scientific data more useful by making it reusable and accessible

8/19/2023

AI and the tyranny of the data commons

8/22/2023

Synthetic Data Generation: Global Markets

8/31/2023

A New Facebook Setting Tells Meta Not to Use Your Data for AI

8/31/2023

As personal data get trickier to come by, AI is swallowing everything else

9/5/2023

How Artificial Intelligence Can Transform Financial System

9/8/2023

Cars Rated Worst for Privacy in New Report, Collecting Extensive Driver Data With Little Opt-Out

9/13/2023

Tech Companies Update Privacy Policies to Allow Broader Use of User Data for AI

Companies such as Rev, Instacart, and others are updating their privacy policies to allow the collection of user data for training AI models like speech-to-text and generative AI tools.

9/15/2023

AI Firms Secretly Amass Data to Train Models, Sparking Backlash From Creators

9/19/2023

Microsoft Exposes 38TB of Sensitive Data in AI Training Mishap

9/19/2023

AI Pioneer Calls for Practical Regulations to Address AI's Real-World Risks

9/25/2023

AI's Dark Side: How Data Collection and Surveillance Fuel Inequity

10/3/2023

Tech Giants Compete for Data to Train AI Models, Microsoft CEO Says