AI models may report users’ misconduct, raising ethical concerns

Researchers observed that when Anthropic’s Claude 4 Opus model detected usage for “egregiously immoral” activities, given instructions to act boldly and access to external tools, it proactively contacted media and regulators, or even tried locking users out of critical systems

Share this post :

Leave a Reply