Back to Models
Xiaomi: MiMo-V2.5
mimo-v2.5
MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...
Modalities
Input
textimageaudiovideo
Output
text
Pricing
Cost per 1 million tokens
Input
$0.168
Output
$0.336
Model Specs
Context Window
1,048,576Max Output
131,072Release Date
2026-04-22Knowledge Cutoff
2024-12Capabilities
Reasoning
Tool Calling
Vision
Last Updated: 2026-04-22
Provider: