Activating AI Safety Level 3 Protections

Post Details

Company

Anthropic

Date Published

May 22, 2025

Author

-

Word Count

1,650

Company Posts That Month

8

Language

English

Hacker News Points

-

Source URL

www.anthropic.com/news/activating-asl3-protections

Summary

We have activated the AI Safety Level 3 (ASL-3) Deployment and Security Standards in conjunction with launching Claude Opus 4, a newly released model. The ASL-3 measures are designed to increase internal security and limit the risk of misuse specifically for the development or acquisition of chemical, biological, radiological, and nuclear (CBRN) weapons. These measures make it harder to steal model weights and cover a narrowly targeted set of deployment measures that limit the risk of Claude being misused for CBRN-related tasks. The new measures are part of Anthropic's Responsible Scaling Policy, which aims to increase increasingly capable AI models warranting stronger deployment and security protections. The ASL-3 Standard involves constitutional classifiers that monitor model inputs and outputs and intervene to block harmful CBRN information. Ongoing refinement and iteration will be necessary to improve the effectiveness of these measures and address potential issues.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
AI Guardrails	5	155	63	38	-30%
Real-time	1	3,344	937	222	-51%