Qwen3.5-35B-A3B
All models
AlibabaAlibabaQwenReleased 2026-02-25

Qwen3.5-35B-A3B

262K context$0.163/M input$1.30/M output35B

The Qwen3.5-35B-A3B is a vision-language model with a hybrid architecture combining linear attention mechanisms and a sparse mixture-of-experts model. It is designed for higher inference efficiency and achieves performance comparable to the Qwen3.5-27B. The model supports reasoning-enabled tasks and structured outputs, making it suitable for complex applications.

What is Qwen3.5-35B-A3B?

Qwen3.5-35B-A3B is an AI model from Alibaba that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Qwen3.5-35B-A3B against other models for agent workflows and production deployments.

Model ID

The Qwen3.5-35B-A3B is a vision-language model with a hybrid architecture combining linear attention mechanisms and a sparse mixture-of-experts model. It is designed for higher inference efficiency and achieves performance comparable to the Qwen3.5-27B. The model supports reasoning-enabled tasks and structured outputs, making it suitable for complex applications.

Architecture & Specifications
Architecture
Hybrid architecture with linear attention and sparse mixture-of-experts
Parameters
35B
Tokenizer
Qwen3
License
Released
2026-02-25
Modalities
Input
textimagevideo
Output
text
Supported Parameters
frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p
Strengths
  • Hybrid architecture for higher inference efficiency
  • Supports reasoning-enabled tasks
  • Structured output capabilities
  • Vision-language integration
  • Comparable performance to Qwen3.5-27B
Limitations
  • Low performance on research-level physics reasoning (CritPt: 0.9%)
  • Moderate hallucination rate in knowledge tasks (16.0%)
  • Limited accuracy in economically valuable tasks (GDPval-AA: 20.6%)
  • Lower coding capability compared to specialized models
  • Performance varies across benchmarks
Recommended Use Cases
Scientific reasoning tasks
Vision-language applications
Instruction-following scenarios
Agentic coding workflows
Long context reasoning evaluations

Related content

Data enriched Apr 24, 2026. Pricing from OpenRouter API.