MiMo-V2-Pro
All models
XiaomiXiaomiMiMoReleased 2026-03-18

MiMo-V2-Pro

1.0M context$1.00/M input$3.00/M output1T

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1 trillion parameters and a context length of 1 million tokens. It is optimized for agentic scenarios and integrates seamlessly with general agent frameworks like OpenClaw. The model is designed to orchestrate complex workflows, drive production engineering tasks, and deliver reliable results, ranking among the global top tier in benchmarks such as PinchBench and ClawBench.

What is MiMo-V2-Pro?

MiMo-V2-Pro is an AI model from Xiaomi that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare MiMo-V2-Pro against other models for agent workflows and production deployments.

Model ID

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1 trillion parameters and a context length of 1 million tokens. It is optimized for agentic scenarios and integrates seamlessly with general agent frameworks like OpenClaw. The model is designed to orchestrate complex workflows, drive production engineering tasks, and deliver reliable results, ranking among the global top tier in benchmarks such as PinchBench and ClawBench.

Architecture & Specifications
Parameters
1T
Tokenizer
Other
Released
2026-03-18
Modalities
Input
text
Output
text
Supported Parameters
frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningresponse_formatstoptemperaturetool_choicetoolstop_p
Strengths
  • Extremely large context length of 1 million tokens
  • Optimized for agentic scenarios and workflows
  • Ranks globally in top tier for benchmarks like PinchBench and ClawBench
  • Highly adaptable to general agent frameworks like OpenClaw
  • Reliable performance in complex production engineering tasks
Limitations
  • High hallucination rate (70.1%) in knowledge-based tasks
  • Low performance in research-level physics reasoning (0.3%)
  • Limited accuracy in omniscience tasks (26.8%)
  • Potentially high inference cost due to large parameter count
  • Requires user IDs for anonymity, limiting privacy options
Recommended Use Cases
Integration with agent frameworks like OpenClaw
Orchestrating complex workflows
Driving production engineering tasks
Coding and terminal use in agentic scenarios
Scientific reasoning and graduate-level tasks

Related content

Data enriched Apr 24, 2026. Pricing from OpenRouter API.