descriptive claim
Anthropic's Responsible Scaling Policy ties required safeguards (ASL-3 Security and Deployment Standards) to pre-specified capability thresholds, including CBRN uplift for moderately resourced state programs and two distinct AI R&D thresholds (full automation of entry-level AI research; dramatic acceleration of effective scaling).
desc_anthropic_rsp_capability_thresholds
confidence 0.90
Evidence (1)
supports (1)
- Anthropic's Responsible Scaling Policy primary_testimonyweight0.95
locator: March 31, 2025 update
โwe have added a new capability threshold related to CBRN development... we have disaggregated our existing AI R&D capability thresholds, separating them into two distinct levels (the ability to fully automate entry-level AI research work, and the ability to cause dramatic acceleration in the rate of effective scaling)โ