Skip to content

Fix DeepSeekV3Model for Configurable build pattern#2725

Draft
sanketpurandare wants to merge 1 commit intosanketpurandare/stack/3from
sanketpurandare/stack/4
Draft

Fix DeepSeekV3Model for Configurable build pattern#2725
sanketpurandare wants to merge 1 commit intosanketpurandare/stack/3from
sanketpurandare/stack/4

Conversation

@sanketpurandare
Copy link
Contributor

@sanketpurandare sanketpurandare commented Mar 27, 2026

stack-info: PR: #2725, branch: sanketpurandare/stack/4
@sanketpurandare sanketpurandare changed the base branch from sanketpurandare/stack/3 to main March 27, 2026 01:13
@sanketpurandare sanketpurandare force-pushed the sanketpurandare/stack/4 branch from b0f0cc3 to cd1af1a Compare March 27, 2026 01:14
@sanketpurandare sanketpurandare changed the base branch from main to sanketpurandare/stack/3 March 27, 2026 01:14


# Wire Configurable pattern: build() calls DeepSeekV3Model(config=...)
DeepSeekV3ModelArgs._owner = DeepSeekV3Model
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The "proper" way is to define DeepSeekV3ModelArgs inside DeepSeekV3Model, and you don't need this wiring.

@sanketpurandare sanketpurandare marked this pull request as draft March 27, 2026 02:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants