ai-for-less-suffering.com

descriptive claim

Anthropic has implemented preliminary egress bandwidth controls as part of ASL-3 security, exploiting the large size of model weights so that rate limits on outbound network traffic can detect and block weight exfiltration even if an attacker has otherwise significantly compromised internal systems.

desc_asl3_egress_bandwidth_control

confidence

0.85

Evidence (1)

supports (1)

Activating AI Safety Level 3 protections primary_testimony

weight

0.90

locator: Security section

“By limiting the rate of outbound network traffic, these controls can leverage model weight size to create a security advantage... we expect to get to the point where rate limits are low enough that exfiltrating model weights before being detected is very difficult—even if an attacker has otherwise significantly compromised our systems.”

Camps holding this claim (4)