Custom Kernels for All from Codex and Claude

Post Details

Company

Hugging Face

Date Published

Feb. 13, 2026

Author

ben burtenshaw, Sayak Paul, Aritra Roy Gosthipaty, and shaun smith

Word Count

1,792

Company Posts That Month

55

Language

-

Hacker News Points

-

Post removed?

No

Source URL

huggingface.co/blog/custom-cuda-kernels-agent-skills

Summary

A new agent skill has been developed to teach coding agents how to write production-ready CUDA kernels, which are crucial for optimizing performance in computational tasks involving GPUs. The skill provides agents with the necessary domain knowledge, such as GPU architecture specifics, memory access patterns, and integration strategies for libraries like PyTorch, to write and benchmark kernels effectively. By targeting real-world applications, such as a diffusers pipeline and a transformers model, agents successfully generated and optimized kernels that demonstrated performance improvements over baseline PyTorch implementations. The skill integrates with the HuggingFace Kernel Hub, enabling easy distribution and loading of pre-compiled kernels, thereby simplifying the deployment process and making it accessible for use without needing to compile from source. This approach not only streamlines the development of custom kernels but also ensures they are efficiently shared and utilized across various platforms.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	4	5,138	781	181	+34%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.