How do you use DeepSeek V4 Pro?

DeepSeek V4 Pro can be called through OpenRouter's OpenAI-compatible API using the model ID deepseek/deepseek-v4-pro. This page includes a ready-to-run curl example and the supported parameter list.

Is DeepSeek V4 Pro free?

DeepSeek V4 Pro is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

DeepSeek V4 Pro

All models

DeepSeekDeepSeekReleased April 24, 2026

DeepSeek V4 Pro

Q: What is DeepSeek V4 Pro?

DeepSeek V4 Pro is an AI model from DeepSeek in the DeepSeek series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

1.0M context

$1.74/M input

$3.48/M output

1.6T total, 49B activated

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model with 1.6 trillion total parameters and 49 billion activated parameters. It supports a 1 million-token context window and is designed for advanced reasoning, coding, and long-horizon agent workflows. The model introduces a hybrid attention system for efficient long-context processing and supports multiple reasoning modes to balance speed and depth depending on the task, making it suitable for complex workloads such as full-codebase analysis and large-scale information synthesis.

What is DeepSeek V4 Pro?

DeepSeek V4 Pro is an AI model from DeepSeek that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare DeepSeek V4 Pro against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Architecture

Mixture of Experts (MoE)

Parameters

1.6T total, 49B activated

Tokenizer

DeepSeek

License

Proprietary

Released

April 24, 2026

Modalities

Input

text

Output

text

Supported Parameters

frequency_penaltyinclude_reasoninglogprobsmax_tokenspresence_penaltyreasoningresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_logprobstop_p

Strengths

Supports a 1 million-token context window
Advanced reasoning capabilities
Efficient long-context processing with hybrid attention system
Multiple reasoning modes for task-specific optimization
Well-suited for complex workloads like full-codebase analysis and multi-step automation

Limitations

Lower performance on research-level physics reasoning (CritPt benchmark: 12.9%)
Moderate accuracy on general knowledge tasks (AA-Omniscience Accuracy: 43.3%)
Hallucination rate of 6.0% in knowledge tasks
Limited coding proficiency (SciCode benchmark: 50.0%)
Performance variability across benchmarks

Recommended Use Cases

Full-codebase analysis

Multi-step automation workflows

Large-scale information synthesis

Advanced reasoning tasks

Scientific computing and coding

More from DeepSeek

DeepSeek V4 Flash

1.0M ctx$0.140/M

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

DeepSeek V3.2

131K ctx$0.252/M

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

DeepSeek V3.2 Speciale

164K ctx$0.400/M

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning...

DeepSeek V3.2 Exp

164K ctx$0.270/M

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

Stay in the know

DeepSeek V4 Pro

What is DeepSeek V4 Pro?

More from DeepSeek

Related content