AI Coding Assistants Keep Shipping Vulnerable Code -- Here's What We're Doing About It

Post Details

Company

HuggingFace

Date Published

Feb. 26, 2026

Author

Scott Thornton

Word Count

371

Company Posts That Month

55

Language

-

Hacker News Points

-

Source URL

huggingface.co/blog/scthornton/securecode-updated

Summary

AI coding assistants are increasingly responsible for generating a significant portion of codebases, with over 60% of some codebases comprising AI-generated code, much of which contains known vulnerabilities. In response to this security concern, SecureCode was developed as the largest open security training dataset for AI coding assistants, aiming to improve the security practices in AI-generated code. SecureCode now includes three datasets, with examples grounded in real-world security incidents like the Equifax and Capital One breaches, and covers framework-specific security patterns for popular frameworks such as Express.js and Django. The datasets are designed to be easily integrated into training models for AI, offering 219 examples of idiomatic security practices and highlighting the importance of incorporating security-focused datasets to reduce the current 45% rate of vulnerable code produced by AI assistants.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
AI Coding Assistant	3	1,009	253	106	+42%
AI Model Fine-tuning	1	1,082	151	57	+103%
LLM	1	5,138	781	181	+34%