descriptive claim
RSP v3 requires Risk Reports published every 3-6 months describing model capabilities, threat models, mitigations, and overall risk level, with external expert reviewers granted unredacted or minimally-redacted access under certain conditions.
desc_rsp_risk_reports_external_review
confidence 0.90
Evidence (1)
supports (1)
- Anthropic's Responsible Scaling Policy: Version 3.0 primary_testimonyweight0.90
locator: Section: Updating our Responsible Scaling Policy, item 3
“Risk Reports will be published online (with some redactions) every 3-6 months... They will have unredacted or minimally-redacted access to the Risk Report and will subject our reasoning, analysis, and decision-making to a comprehensive public review.”