Anthropic’s Responsible Scaling Policy: Version 3.0

Post Details

Company

Anthropic

Date Published

Feb. 26, 2026

Author

Anthropic Team

Word Count

2,379

Language

English

Hacker News Points

-

Source URL

www.anthropic.com/news/responsible-scaling-policy-v3

Summary

Anthropic's third version of its Responsible Scaling Policy (RSP) aims to address the evolving risks of AI systems by reinforcing effective measures, improving transparency, and introducing new measures for accountability. Initially introduced in September 2023, the RSP was designed to manage risks that could emerge rapidly with advancing AI technology, using a principle of conditional commitments tied to capability levels. While the RSP has successfully incentivized Anthropic and other companies to adopt stronger safeguards and inform early AI policy, challenges remain, particularly in achieving consensus on AI risks and implementing higher-level safeguards unilaterally. The updated RSP includes a clearer separation of company plans from industry recommendations, introduces a Frontier Safety Roadmap to outline risk mitigation strategies, and mandates regular Risk Reports with external reviews to enhance safety and transparency. As the AI landscape continues to evolve, Anthropic commits to revising its policy to adapt to new capabilities while encouraging multilateral action to address industry-wide risks.