awesome-open-source-llms/details/deepseek-v3.md at master · ever-works/awesome-open-source-llms

Overview

DeepSeek V3 is a cutting-edge open-source large language model released in December 2025 by DeepSeek AI, a Hangzhou-based AI startup. With 685 billion total parameters using mixture-of-experts architecture, it represents one of the most capable open-source models available.

Architecture

Total Parameters: 685 billion (MoE architecture)
Active Parameters: Uses adaptive routing to activate only necessary experts
Context Window: Up to 128,000 tokens
Model Type: Mixture-of-Experts Transformer
Training: Advanced training on diverse high-quality datasets

Key Features

Extended 128K token context window for analyzing large documents
Exceptional reasoning capabilities
State-of-the-art coding performance
Complex multi-step problem solving
Strong mathematical abilities
Advanced instruction following
Efficient inference through expert routing

Performance Highlights

DeepSeek V3.2 achieves top-tier results across major benchmarks:

HumanEval: 94.2 (exceptional code generation)
AIME 2025: 95.7 (advanced mathematics)
GPQA Diamond: 85.7 (doctoral-level science reasoning)
LiveCodeBench: 84.9 (real-world coding)
IFEval: 88.0 (instruction following)

Specialized Variants

DeepSeek V3.2-Speciale

Surpasses GPT-5 on reasoning benchmarks
Reaches Gemini-3.0-Pro-level performance
90% LiveCodeBench score
Optimized for agentic workloads

DeepSeek-Coder

Widely recognized as the leader for coding tasks
Specialized for software development
Excellent repository-level understanding

Deployment Options

Self-hosting on enterprise GPU infrastructure
Cloud deployment through major providers
Optimized for efficient inference despite large size
Compatible with vLLM and other frameworks
Support for quantization techniques

Use Cases

Advanced coding and software development
Mathematical and scientific reasoning
Large document analysis and summarization
Complex problem-solving and planning
Research and development
Agentic AI applications

Licensing

Released under MIT License, allowing unrestricted commercial and research use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overview

Architecture

Key Features

Performance Highlights

Specialized Variants

DeepSeek V3.2-Speciale

DeepSeek-Coder

Deployment Options

Use Cases

Licensing

FilesExpand file tree

deepseek-v3.md

Latest commit

History

deepseek-v3.md

File metadata and controls

Overview

Architecture

Key Features

Performance Highlights

Specialized Variants

DeepSeek V3.2-Speciale

DeepSeek-Coder

Deployment Options

Use Cases

Licensing