InfiniteYou
InfiniteYou (InfU) is an image generation framework based on Diffusion Transformer (DiT), specifically optimized for flexibly generating and modifying images while maintaining the consistency of identity. It addresses the shortcomings of existing methods in terms of identity similarity, text-image alignment, generation quality, and aesthetics.
Key Features:
- InfuseNet: Injects identity features into the DiT model, enhancing identity similarity through residual connections while maintaining generation capabilities.
- Multistage Training Strategy: Includes pre-training and supervised fine-tuning (SFT), utilizing synthesized single-person multi-sample (SPMS) data to improve text-image alignment, enhance image quality, and mitigate issues with facial copy-pasting.
- Plug-and-Play Design: Compatible with various existing methods, such as different versions of FLUX, ControlNets, and LoRAs.
Use Cases:
- Personalized Image Generation: Generates images with user-specific identity features based on photos and textual descriptions provided by users. For example, imagining oneself in different professions or styles.
- Image Editing and Modification: Modifies attributes such as scene, clothing, and expressions while keeping the identity of the person unchanged.
- Virtual Avatar Generation: Creates virtual avatars with personal characteristics for use in social media, gaming, and other scenarios.
- High-Quality Portrait Generation: Suitable for scenarios requiring realistic, high-quality portraits, such as advertising and films.
In summary, InfiniteYou aims to become a powerful and flexible tool for generating and modifying images with specific identity features, and it is easy to integrate with other AI tools.