- 
                Notifications
    You must be signed in to change notification settings 
- Fork 150
add deepseek v3 transformers integration #1445
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
| SUPPORTED_ARCHITECTURES += ("glm", "mistral-nemo", "minicpm3", "phi3-moe") | ||
| # openvino 2025.0 required for disabling check_trace | ||
| if is_openvino_version(">=", "2025.0"): | ||
| SUPPORTED_ARCHITECTURES += ("deepseek",) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
actually remote code deepseek_v3 : katuni4ka/tiny-random-deepseek-v3
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add tests for export and inference
| @register_in_tasks_manager( | ||
| "deepseek_v3", *["text-generation", "text-generation-with-past"], library_name="transformers" | ||
| ) | ||
| class DeepseekVOpenVINOConfig(LlamaOpenVINOConfig): | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
before we can merge : need to fix rope_interleave for export https://github.com/huggingface/transformers/blob/v4.55.4/src/transformers/models/deepseek_v3/modeling_deepseek_v3.py#L402
| The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. | 
No description provided.