Gemma 4 26B A4B IT
gemma-4-26b-a4b-it
Google · chat · api · seen 13m ago
Context
262.1K
Max output
32.8K
Input $/1M
$0.06
Output $/1M
$0.33
Modalities
text
Released
03 Apr 2026
AI summary
● machine-written
Google releases Gemma 4 26B A4B instruction-tuned model
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts model from Google DeepMind with 25.2B total parameters, of which only 3.8B activate per token during inference. The model supports a 262K token context window, native function calling, and structured output. It is available via API at $0.06 per million input tokens and $0.33 per million output tokens.
What's new
- Mixture-of-Experts architecture with 3.8B active parameters per token
- 262K token context window with 32.8K max output
- Native function calling and configurable thinking/reasoning mode
- Released April 3, 2026 under Apache 2.0 license
Best for
Cost-efficient instruction following at scaleLong-context document processingAPI-based chat and instruction completion
Source: https://ai.google.dev/gemini-api/docs/models/gemma-4-26b-a4b-it