descriptive claim
DeepMind's Frontier Safety Framework v3 adds a Critical Capability Level for harmful manipulation, defined as AI models with manipulative capabilities powerful enough to systematically and substantially change beliefs and behaviors in identified high-stakes contexts over the course of user interactions.
desc_fsf_v3_harmful_manipulation_ccl
confidence 0.90
Evidence (1)
supports (1)
- Strengthening our Frontier Safety Framework primary_testimonyweight0.95
locator: Section: Addressing the risks of harmful manipulation
“we're introducing a Critical Capability Level (CCL) focused on harmful manipulation --- specifically, AI models with powerful manipulative capabilities that could be misused to systematically and substantially change beliefs and behaviors in identified high stakes contexts”