Back to Models

Xiaomi: MiMo-V2.5

mimo-v2.5

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

Modalities

Input

textimageaudiovideo

Output

text
Pricing
Cost per 1 million tokens
Input
$0.168
Output
$0.336
Model Specs
Context Window
1,048,576
Max Output
131,072
Release Date
2026-04-22
Knowledge Cutoff
2024-12
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2026-04-22

Provider: