- AI Models Can Learn Deceptive Behaviors, Anthropic Researchers Say Business Insider
- Anthropic researchers find that AI models can be trained to deceive TechCrunch
- Anthropic Exposes Sleeper Agents Concealed in AI – AI Safety in Question Cryptopolitan
- New study from Anthropic exposes deceptive ‘sleeper agents’ lurking in AI’s core VentureBeat
- Once an AI model exhibits ‘deceptive behavior’ it can be hard to correct, researchers at OpenAI competitor Anthropic found Business Insider Africa