[Cherry-Pick][BugFix] fix paddle optional get assert in sm103 (#7816)#7820
Conversation
|
Thanks for your contribution! |
PaddlePaddle-bot
left a comment
There was a problem hiding this comment.
🤖 Paddle-CI-Agent | pr_review |
2026-05-14 18:21:40
📋 Review 摘要
PR 概述:Cherry-pick #7816 到 release/2.6,修复在 SM103 GPU 上编译时触发的 optional get assert 错误,为 SM103 架构添加正确的 gencode 编译标志及 nvcc 版本门控。
变更范围:custom_ops/setup_ops.py
影响面 Tag:[OP]
📝 PR 规范检查
PR 标题格式符合 Cherry-Pick 规范([Cherry-Pick][BugFix] + 原 PR 号 #7816)。描述中 ## Modifications 段落内容为空(仅留有模板注释),不符合描述模板要求。
PR 描述建议(可直接复制):
## Motivation
cherry-pick #7816 到 release/2.6 分支,修复在 SM103 GPU 上编译时触发的 optional get assert 错误。
## Modifications
- `custom_ops/setup_ops.py`:在 `get_gencode_flags()` 中新增 `cc_val == 103` 分支,arch_code 设为 `103a`,生成 `-gencode arch=compute_103a,code=sm_103a` 编译参数
- `custom_ops/setup_ops.py`:新增 `has_sm103`(要求 nvcc >= 13.0)变量,将 SM103 从 `has_generic_fp8` 路径剔除,并纳入 Blackwell SM100/SM103 优化编译路径
## Usage or Command
N/A
## Accuracy Tests
N/A
## Checklist
- [x] Add at least a tag in the PR title.
- Tag list: [`[FDConfig]`,`[APIServer]`,`[Engine]`, `[Scheduler]`, `[PD Disaggregation]`, `[Executor]`, `[Graph Optimization]`, `[Speculative Decoding]`, `[RL]`, `[Models]`, `[Quantization]`, `[Loader]`, `[OP]`, `[KVCache]`, `[DataProcessor]`, `[BugFix]`, `[Docs]`, `[CI]`, `[Optimization]`, `[Feature]`, `[Benchmark]`, `[Others]`, `[XPU]`, `[HPU]`, `[GCU]`, `[DCU]`, `[Iluvatar]`, `[Metax]`]
- You can add new tags based on the PR content, but the semantics must be clear.
- [x] Format your code, run `pre-commit` before commit.
- [ ] Add unit tests. Please write the reason in this PR if no unit tests.
- [ ] Provide accuracy results.
- [x] If the current PR is submitting to the `release` branch, make sure the PR has been submitted to the `develop` branch, then cherry-pick it to the `release` branch with the `[Cherry-Pick]` PR tag.问题
未发现阻塞性问题。
总体评价
代码变更逻辑清晰,SM103 架构支持遵循了 SM90/SM100 的既有模式,103a gencode 处理和 nvcc 版本门控实现正确。建议补全 PR 描述 Modifications 段落。
CI报告基于以下代码生成(30分钟更新一次): 1 任务总览CI 仍在进行中,Required 任务暂无失败:4 个运行中,2 个等待中,4 个已通过。可选任务有 2 个失败(不阻塞合并),建议关注。
2 任务状态汇总2.1 Required任务 : 4/10 通过
2.2 可选任务 — 21/25 通过
3 失败详情(仅 required)无 required 失败任务。 |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## release/2.6 #7820 +/- ##
==============================================
Coverage ? 72.45%
==============================================
Files ? 381
Lines ? 54139
Branches ? 8456
==============================================
Hits ? 39229
Misses ? 12153
Partials ? 2757
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
18cab83
into
PaddlePaddle:release/2.6
Motivation
cherry-pick #7816 to release/2.6
Modifications
Usage or Command
Accuracy Tests
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.