Sunday, 6 April
poster

Saturday, 5 April2025

Study Suggests OpenAI's AI Models Memorize Copyrighted Content

Study Suggests OpenAI's AI Models Memorize Copyrighted Content

OpenAI's AI models, including GPT-4, may have memorized portions of copyrighted materials. Researchers from the University of Washington, University of Copenhagen, and Stanford developed a method to detect "memorized" content by analyzing the models' ability to predict uncommon words in text snippets. Findings suggest that GPT-4 retained segments from fiction books and New York Times articles, raising concerns about the use of copyrighted data in AI training.

Read full story at TechCrunch

Subscribe To Our Newsletter.

Full Name
Email