Janus Pro - Deepseek
Janus-Series is a family of unified multimodal understanding and generation models developed by DeepSeek. It includes Janus, JanusFlow, and Janus-Pro, which integrate autoregressive language models with rectified flow for text-to-image generation and multimodal understanding. The models decouple visual encoding into separate pathways to alleviate conflicts between understanding and generation tasks. Janus-Pro features optimized training, expanded data, and larger model sizes, achieving significant advancements in both understanding and instruction-following for image generation. The models are available on Hugging Face and support both multimodal understanding (image+text input) and text-to-image generation.



