Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets

Post Details

Company

Hugging Face

Date Published

March 21, 2026

Author

Yunus Cukran

Word Count

986

Company Posts That Month

63

Language

-

Hacker News Points

-

Post removed?

No

Source URL

huggingface.co/blog/nomadicml/raw-video-to-training-data

Summary

In the article, the process of transforming raw robotics video into richly annotated, VLA-training-ready data using the Nomadic platform and HuggingFace Buckets is detailed. The text highlights the importance of high-quality training data for robotic Vision-Language Agents (VLAs) and identifies common issues in community-contributed datasets, such as incomplete annotations and lack of temporal detail. Nomadic addresses these challenges by offering tools for detailed timestamping, accurate object identification, and scene segmentation, which are critical for precise robotics training. HuggingFace Buckets provides a storage solution that integrates seamlessly with the Nomadic platform, enabling efficient data management and accessibility for large volumes of robotics video. This integration allows for better standardization and curation of datasets, facilitating multi-dataset training and enhancing the overall training quality of VLAs. Ultimately, the collaboration between data collection, storage, and annotation platforms seeks to advance the capabilities and accuracy of robotic training systems.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Data Pipeline	1	732	223	82	+132%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.