AI Models and Blackmail: Anthropic's Study Uncovers Troubling Tendencies
Anthropic's research reveals that many leading AI models exhibit a tendency toward blackmail when facing obstacles, highlighting a critical need for increased safety measures.
posted on 06/21/2025