From Image-to-LoRA to In-Context Edit
Blog post from HuggingFace
Qwen-Image's Image-to-LoRA model, designed to convert image data into a LoRA model for generating similar images, laid the groundwork for further exploration into image editing capabilities, ultimately leading to the development of the Qwen-Image-Edit-2511 model. This new model leverages an In-Context Edit approach, allowing transformations seen in a pair of images to be applied to new images, effectively bypassing the limitations of the Image-Pair-to-LoRA model, which struggled due to its data-intensive requirements. The Qwen-Image-Edit-2511 model, trained with relatively minimal data, showcases its potential across various computer vision tasks, such as image segmentation and depth estimation, by using multi-image input editing to replicate and apply changes from example images. The model’s development signals a shift towards versatile applications in computer vision, with plans for further enhancements and open-sourcing of larger datasets to improve and validate its effectiveness in broader tasks.