The benefits of globally distributed infrastructure for model serving include increased compute availability by providing access to a wider range of GPUs across multiple cloud providers, cost savings through competitive pricing, redundancy for better uptime by shifting workloads to regions with capacity, lower latency by locating servers near users, and data residency compliance by specifying constraints on region and cloud provider. A globally distributed infrastructure also abstracts away complexities, leaving customers with the benefits without the headache of operating workloads on multiple platforms.