ai-for-less-suffering.com

descriptive claim

Anthropic's Responsible Scaling Policy ties required safeguards (ASL-3 Security and Deployment Standards) to pre-specified capability thresholds, including CBRN uplift for moderately resourced state programs and two distinct AI R&D thresholds (full automation of entry-level AI research; dramatic acceleration of effective scaling).

desc_anthropic_rsp_capability_thresholds

confidence

0.90

Evidence (1)

supports (1)

Anthropic's Responsible Scaling Policy primary_testimony

weight

0.95

locator: March 31, 2025 update

“we have added a new capability threshold related to CBRN development... we have disaggregated our existing AI R&D capability thresholds, separating them into two distinct levels (the ability to fully automate entry-level AI research work, and the ability to cause dramatic acceleration in the rate of effective scaling)”

Camps holding this claim (3)