Back to Models

Z.ai: GLM 4.5V

glm-4.5v

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

Modalities

Input

textimage

Output

text
Pricing
Cost per 1 million tokens
Input
$0.72
Output
$2.16
Model Specs
Context Window
65,536
Max Output
16,384
Release Date
2025-08-11
Knowledge Cutoff
2025-04
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2025-08-11

Provider: