Home / Companies / Snowplow / Blog / Post Details
Content Deep Dive

Introducing Factotum data pipeline runner

Blog post from Snowplow

Post Details
Company
Date Published
Author
Snowplow Team
Word Count
1,359
Language
English
Hacker News Points
-
Summary

Factotum, a new open-source tool by Snowplow, aims to address the challenges of executing data pipeline jobs by offering a flexible and efficient alternative to existing orchestration tools like AWS Data Pipeline, Luigi, and Airflow. Unlike other tools that often intertwine job execution and scheduling, Factotum decouples these processes, allowing for more straightforward and human-readable job definitions in JSON format. This approach enables composability, where jobs can embed other jobs, and maintains independence from specific scheduling systems. Factotum, written in Rust, currently supports Linux/x86_64 and does not require extensive installation dependencies, making it accessible and easy to manage. The tool is designed to execute Directed Acyclic Graphs (DAGs) and report on tasks performed, with plans to expand into a comprehensive ecosystem that includes optional scheduling, audit logging, and user authentication. Users are encouraged to contribute to its development on GitHub.