Qwen3.5-9B is an AI model from Alibaba in the Qwen series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

How do you use Qwen3.5-9B?

Qwen3.5-9B can be called through OpenRouter's OpenAI-compatible API using the model ID qwen/qwen3.5-9b. This page includes a ready-to-run curl example and the supported parameter list.

Qwen3.5-9B is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

Qwen3.5-9B

All models

AlibabaQwenReleased 2026-03-10

Qwen3.5-9B

262K context

$0.100/M input

$0.150/M output

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, featuring a 9-billion parameter architecture. It is designed for strong reasoning, coding, and visual understanding, utilizing a unified vision-language design with early fusion of multimodal tokens. This enables the model to process and reason across text and images within the same context efficiently.

What is Qwen3.5-9B?

Qwen3.5-9B is an AI model from Alibaba that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Qwen3.5-9B against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Architecture

Unified vision-language design with early fusion of multimodal tokens

Parameters

Tokenizer

Qwen3

License

Proprietary

Released

2026-03-10

Modalities

Input

textimagevideo

Output

text

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p

Strengths

Strong reasoning capabilities
Efficient multimodal processing of text and images
High performance in coding tasks
Graduate-level scientific reasoning
Instruction-following proficiency

Limitations

Low performance in research-level physics reasoning
Limited accuracy in economically valuable tasks
Moderate hallucination rate in knowledge-based tasks
Relatively low coding capability compared to leading models
Limited long-context reasoning performance

Recommended Use Cases

Scientific reasoning and analysis

Multimodal tasks involving text and images

Coding and programming support

Translation tasks

Health and finance-related applications

More from Alibaba

Qwen3.6 Plus

1M ctx$0.325/M

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...

Qwen3.5-35B-A3B

262K ctx$0.163/M

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall...

Qwen3.5-122B-A10B

262K ctx$0.260/M

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of...

Qwen3.5-27B

262K ctx$0.195/M

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...

Stay in the know

Qwen3.5-9B

What is Qwen3.5-9B?

More from Alibaba

Related content