Back to Models
Google: Gemma 3 4B
gemma-3-4b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Modalities
Input
textimage
Output
text
Pricing
Cost per 1 million tokens
Input
$0.06
Output
$0.12
Model Specs
Context Window
131,072Max Output
16,384Release Date
2025-03-13Knowledge Cutoff
2024-08-31Capabilities
Reasoning
Tool Calling
Vision
Last Updated: 2025-03-13
Provider: