
Kolors
Large scale text to image generation model based on latent diffusion model
- Support bilingual input in both Chinese and English, understand and generate high-quality images.
- Provide multiple functional modules such as Inference, Checkpoints, LoRA, ControlNet, and IP Adapter.
- KolorsPrompts, a comprehensive evaluation dataset with over 1000 prompts, is used for model performance comparison.
- Achieved industry-leading standards in both human and machine evaluations.
- Provides detailed technical reports and user documentation to facilitate understanding and application by users and researchers.
- Completely open source, promoting collaborative development with the open source community.
Product Details
Kolors is a large-scale text to image generation model developed by the Kwai Kolors team. Based on the potential diffusion model, it is trained in billions of text image pairs. It outperforms both open source and closed source models in terms of visual quality, complex semantic accuracy, and rendering of Chinese and English text. Kolors supports both Chinese and English input, particularly in understanding and generating specific Chinese content.