DeepSeek V4 Flash
DeepSeek V4 Flash
All models
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284 billion total parameters and 13 billion activated parameters. It supports a 1 million-token context window and is designed for fast inference and high-throughput workloads. The model features hybrid attention for efficient long-context processing and configurable reasoning modes, making it suitable for coding assistants, chat systems, and agent workflows.
Related content
Data enriched Apr 24, 2026. Pricing from OpenRouter API.