ClawHub Security Signals: Large Corpus Multi-Scanner Dataset for Agent Skill Security Research

Post Details

Company

Hugging Face

Date Published

June 1, 2026

Author

Vincent Koc, Patrick Erichsen, Jacob Tomlinson, Agustin Rivera, Mike Appel, and Nir Paz

Word Count

1,400

Company Posts That Month

94

Language

-

Hacker News Points

-

Post removed?

No

Source URL

huggingface.co/blog/OpenClaw/clawhub-security-signals

Summary

ClawHub Security Signals is a dataset comprising 67,453 public agent skills from the ClawHub registry, designed to aid research on agent supply-chain security and multi-signal triage. It integrates data from three scanner families—VirusTotal, static heuristic analysis, and NVIDIA SkillSpector—to produce registry verdicts without human annotations. The dataset reveals significant disagreement among scanners, highlighting the need for ensemble approaches to assess malware reputation, static patterns, and semantic risks associated with agent skills. SkillSpector, with a broader scope, often identifies advisory signals regarding authority and data flow, while VirusTotal excels in detecting malicious content. The dataset is structured into four splits for training, validation, testing, and evaluation, providing sanitized data and redacted sensitive information. It aims to facilitate the development of safe agentic systems by examining scanner disagreements and advancing research areas like multi-signal triage, prompt-injection detection, and least-privilege policy learning.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
OpenClaw	12	340	57	28	+3%
MCP	4	7,418	806	202	+5%
Serverless	3	970	223	91	-46%
Secrets Management	2	2,464	377	128	+14%
AI Agents	1	5,835	1,302	257	+18%
AI Guardrails	1	481	149	58	+123%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.