Airflow in Action: How DoorDash Scaled for Data and ML Engineering

Post Details

Company

Astronomer

Date Published

March 24, 2026

Author

-

Word Count

1,019

Company Posts That Month

11

Language

English

Hacker News Points

-

Post removed?

No

Source URL

www.astronomer.io/blog/airflow-in-action-doordash

Summary

DoorDash's engineering team addressed the challenges of scaling their Apache Airflow deployment by developing the Orchestration Frederator, a centralized unifying layer that enables horizontal scaling across multiple Airflow instances. This solution was necessitated by operational bottlenecks experienced with a single monolithic instance, which proved difficult to manage at high scale due to memory pressure, DAG parsing delays, and API server responsiveness issues. By categorizing pipelines based on their business importance, DoorDash deployed a tiered instance structure that improved scalability, reliability, and isolation, although it introduced complexities such as managing cross-instance DAG dependencies and dynamically shifting workloads. The Frederator's centralized database and unified interface streamline operations by directing users to the correct Airflow instance and managing dependencies, while the dual-hosting migration strategy ensures smooth transitions. DoorDash's approach highlights the intricate engineering required to scale Airflow, whereas Astro offers a managed service with similar benefits out-of-the-box, including elastic auto-scaling, high availability, disaster recovery, and multi-cloud deployment, allowing teams to focus on pipeline creation rather than infrastructure management.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Kubernetes	1	1,840	308	106	+33%
Observability	1	3,204	716	172	+14%
Platform Engineering	1	480	172	60	+30%
Real-time	1	6,457	1,307	242	+28%
Secrets Management	1	1,488	268	99	+7%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.