ai-for-less-suffering.com

descriptive claim

DeepMind's Frontier Safety Framework v3 adds a Critical Capability Level for harmful manipulation, defined as AI models with manipulative capabilities powerful enough to systematically and substantially change beliefs and behaviors in identified high-stakes contexts over the course of user interactions.

desc_fsf_v3_harmful_manipulation_ccl

confidence

0.90

Evidence (1)

supports (1)

Strengthening our Frontier Safety Framework primary_testimony

weight

0.95

locator: Section: Addressing the risks of harmful manipulation

“we're introducing a Critical Capability Level (CCL) focused on harmful manipulation --- specifically, AI models with powerful manipulative capabilities that could be misused to systematically and substantially change beliefs and behaviors in identified high stakes contexts”

Camps holding this claim (3)