The Replit platform has undergone a significant infrastructure overhaul to improve stability and reduce downtime. The company has introduced a new failure domain for its Hacker users, separating their infrastructure into multiple independent clusters to minimize the impact of future incidents. This change requires clients to make DNS queries to determine which cluster their repls are hosted in, but an external cluster resolution service provides a seamless experience. The platform also implemented eventual consistency and tombstones to handle user transfers between clusters. With this new infrastructure in place, Replit is experimenting with provisioning different hardware configurations for each cluster to improve responsiveness and the overall Hacker experience.