Qwen3.5-Flash
Qwen3.5-Flash
All models
Qwen3.5-Flash is a vision-language model built on a hybrid architecture that combines linear attention mechanisms with a sparse mixture-of-experts model. It is designed for efficient inference and excels in both pure text and multimodal tasks, offering fast response times while maintaining a balance between speed and performance. The model represents a significant improvement over the Qwen3 series in terms of capabilities and efficiency.
Related content
Data enriched Apr 24, 2026. Pricing from OpenRouter API.