Posted 10/19/2023, 9:04:14 AM

Researchers Uncover New Flaw in Major AI Systems That Allows Bypassing Safeguards

Researchers exposed a new flaw in AI systems from OpenAI, Google, and others that allow generating harmful content despite safeguards.
The flaw involves fine-tuning the AI models, which opens a "Pandora's box" of safety issues.
OpenAI's attempt to restrict data for fine-tuning may not fully prevent misuse.
Debate continues over keeping AI code private vs open source. Both have tradeoffs.
As AI capabilities expand, new vulnerabilities are emerging that need addressing.

nytimes.com