How do you use MiMo-V2-Omni?

MiMo-V2-Omni can be called through OpenRouter's OpenAI-compatible API using the model ID xiaomi/mimo-v2-omni. This page includes a ready-to-run curl example and the supported parameter list.

Is MiMo-V2-Omni free?

MiMo-V2-Omni is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

MiMo-V2-Omni

All models

XiaomiMiMo

MiMo-V2-Omni

Q: What is MiMo-V2-Omni?

MiMo-V2-Omni is an AI model from Xiaomi in the MiMo series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

262K context

$0.400/M input

$2.00/M output

MiMo-V2-Omni is an AI model from Xiaomi built for agent workflows, with support for text, audio, image, video input and text output. MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

What is MiMo-V2-Omni?

MiMo-V2-Omni is an AI model from Xiaomi that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare MiMo-V2-Omni against other models for agent workflows and production deployments.

Model ID

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

Modalities

Input

textaudioimagevideo

Output

text

Supported Parameters

frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningresponse_formatstoptemperaturetool_choicetoolstop_p

More from Xiaomi

MiMo-V2.5-Pro

1.0M ctx$1.00/M

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....

MiMo-V2.5

1.0M ctx$0.400/M

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

MiMo-V2-Pro

1.0M ctx$1.00/M

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios. It is highly adaptable to general agent frameworks like...

MiMo-V2-Flash

262K ctx$0.090/M

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...

Stay in the know

MiMo-V2-Omni

What is MiMo-V2-Omni?

More from Xiaomi

Related content