We're cutting our prices in half
Blog post from Replicate
Replicate has announced a significant reduction in pricing for its public models, cutting the per-second cost by half for all users, including models from SDXL to Llama 2, without requiring any action from users. The company plans to implement similar price reductions for private models but will introduce charges for setup and idle times, applied at half the per-second rate, specifically for new users or existing users who opt-in. For users with high-volume requests on private models, this could translate to cost savings by optimizing the use of resources, while those with fewer requests may face higher expenses. The changes aim to benefit existing users of private models by maintaining current rates unless they choose to switch, ensuring the update is favorable and allows users to continue enjoying lower costs without disruption.