Tencent's Weixin team has successfully integrated the open-source distributed computing engine Ray into its AI infrastructure to address the technical challenges of deploying large-scale AI applications. By combining Ray with Kubernetes, Weixin has developed AstraRay, a high-performance AI compute platform that efficiently manages resource-intensive tasks like OCR, which requires over a million CPU cores. Ray’s simplicity, robust ecosystem, and ability to scale from local development to large clusters make it an attractive choice for AI computing, enabling Weixin to streamline application deployment, reduce costs, and improve resource utilization. AstraRay's architecture leverages a shared-state scheduling system called Starlink to handle millions of nodes and heterogeneous resources, enhancing reliability and reducing complexity. These innovations have allowed Weixin to support ultra-large-scale AI workloads while maintaining high responsiveness and low costs, preparing the platform for future AI application developments.