Poor AI Translations Flood Non-English Internet with Clickbait Nonsense
• Over half (57.1%) of all sentences on the internet have been translated into 2+ other languages, likely by poor quality AI systems
• This phenomenon is especially prominent in "lower-resource languages" with less training data for AI models
• Researchers believe this is a ploy to generate clickbait content, which is then badly translated to other languages by AI
• As a result, large parts of the non-English web are filled with poor quality, degraded AI translations
• The prevalence of nonsense AI translations could stunt development of advanced AI models for lower-resource languages, as they rely on high quality web data for training