MiMo-V2.5 is an AI model from Xiaomi in the MiMo series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

How do you use MiMo-V2.5?

MiMo-V2.5 can be called through OpenRouter's OpenAI-compatible API using the model ID xiaomi/mimo-v2.5. This page includes a ready-to-run curl example and the supported parameter list.

MiMo-V2.5 is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

MiMo-V2.5

All models

XiaomiMiMoReleased 2026-04-22

MiMo-V2.5

1.0M context

$0.400/M input

$2.00/M output

MiMo-V2.5 is Xiaomi's omnimodal AI model designed for multimodal perception tasks, including image and video understanding. It features a 1M token context window, enabling it to handle complete documents, extended conversations, and complex task contexts in a single pass. The model is optimized for cost-efficient inference while delivering strong reasoning and perception capabilities, making it suitable for integration with agent frameworks.

What is MiMo-V2.5?

MiMo-V2.5 is an AI model from Xiaomi that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare MiMo-V2.5 against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Architecture

Omnimodal

Tokenizer

Other

Released

2026-04-22

Modalities

Input

textaudioimagevideo

Output

text

Supported Parameters

frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningresponse_formatstoptemperaturetool_choicetoolstop_p

Strengths

Supports a 1M token context window for extended tasks
Optimized for cost-efficient inference
Strong multimodal perception across image and video tasks
Ideal for integration with agent frameworks

Recommended Use Cases

Extended conversations

Complex task contexts

Image and video understanding

Integration with agent frameworks

More from Xiaomi

MiMo-V2.5-Pro

1.0M ctx$1.00/M

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....

MiMo-V2-Omni

262K ctx$0.400/M

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

MiMo-V2-Pro

1.0M ctx$1.00/M

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios. It is highly adaptable to general agent frameworks like...

MiMo-V2-Flash

262K ctx$0.090/M

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...

Stay in the know

MiMo-V2.5

What is MiMo-V2.5?

More from Xiaomi

Related content