Photoroom has open-sourced its text-to-image model, PRX, making it available under the Apache 2.0 license through 🤗 Diffusers, with the aim of providing both a robust model and a detailed resource on the training process. The model, which includes a 1.3 billion-parameter version trained on 32 H200 GPUs, is designed to produce high-quality images at resolutions up to 1024 pixels. The release is accompanied by a blog series detailing the training pipeline, including architecture choices, training techniques, and post-training methods, with more updates planned to cover further experiments and refinements. Photoroom encourages community involvement through their Discord server and is actively seeking contributions and feedback. The project showcases extensive experimentation with various architectures, VAEs, and training optimizations, and includes contributions from a diverse team of researchers and engineers.