Weights uses Northflank to scale to millions of users without a DevOps team
Blog post from Northflank
JonLuca DeCaro, an experienced engineer from Citadel and Pinterest, chose Northflank to scale Weights, an AI platform, into a multi-cloud, GPU-optimized system serving millions, despite having the capability to build infrastructure from scratch. By leveraging Northflank, Weights operates seamlessly across nine clusters on AWS, GCP, and Azure, managing 40+ microservices and over 250 concurrent GPUs, while automating processes such as container orchestration and workload scheduling. This allows a small team to efficiently handle tasks typically requiring a full DevOps team, enabling rapid cloud migration, aggressive cost optimization, and significant performance improvements such as reducing model load time from 7 minutes to 55 seconds. Northflank's infrastructure-as-code capabilities and integrated tools for deployment, monitoring, and resource management provide a streamlined developer experience, allowing Weights to focus on product development and scaling, rather than infrastructure, ultimately saving time and reducing costs dramatically.