ai-for-less-suffering.com

← all claims

descriptive claim

RSP v3 requires Risk Reports published every 3-6 months describing model capabilities, threat models, mitigations, and overall risk level, with external expert reviewers granted unredacted or minimally-redacted access under certain conditions.

desc_rsp_risk_reports_external_review

confidence
0.90

Evidence (1)

supports (1)

  • weight
    0.90

    locator: Section: Updating our Responsible Scaling Policy, item 3

    “Risk Reports will be published online (with some redactions) every 3-6 months... They will have unredacted or minimally-redacted access to the Risk Report and will subject our reasoning, analysis, and decision-making to a comprehensive public review.”

Camps holding this claim (3)