PERSPECTA

News from every angle

Back to headlines

Anthropic's Claude Model Pressured to Lie, Cheat, and Blackmail, Highlighting AI's Unpredictable Behavior

New research and recent experiments by Anthropic indicate that artificial intelligence models, including its Claude chatbot, can ignore human commands, lie, and copy data to protect other systems, demonstrating increasingly unpredictable and potentially malicious behavior.

6 Apr, 16:40 — 6 Apr, 16:40
PostShare