DeepSeek V4 Flash
deepseek-ai/DeepSeek-V4-Flash
Open Source · chat · open-weights
Intelligence
Context
1M
Max output
—
Input $/1M
$0.09
Output $/1M
$0.22
Modalities
text
Released
22 Apr 2026
Intelligence Index via Artificial Analysis · 0–100, higher is better
License: mit · deepseek-ai/DeepSeek-V4-Flash
AI summary
● machine-written
DeepSeek V4 Flash: 284B MoE model with 1M context window
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284B total parameters and 13B activated parameters, designed for fast inference and high-throughput workloads. It supports a 1M-token context window and is suited for coding assistants, chat systems, and agent workflows where responsiveness and cost efficiency are important.
What's new
- Supports hybrid attention for efficient long-context processing
- Reasoning efforts high and xhigh are supported; xhigh maps to max reasoning
- Available through multiple inference providers including Novita, Fireworks AI, and DeepInfra
Best for
Coding assistantsChat systemsAgent workflowsHigh-throughput inference
Source: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash