Skip to content

[Others] update flash mask version#7819

Open
BingooYang wants to merge 1 commit into
PaddlePaddle:developfrom
BingooYang:up_flashmaske
Open

[Others] update flash mask version#7819
BingooYang wants to merge 1 commit into
PaddlePaddle:developfrom
BingooYang:up_flashmaske

Conversation

@BingooYang
Copy link
Copy Markdown
Contributor

@BingooYang BingooYang commented May 14, 2026

Motivation

flashinfer升级到0.6.11版本要求nvidia-cutlass-dsl>=4.4.2(https://github.com/PaddlePaddle/FastDeploy/pull/7799),flash mask旧版本锁死了nvidia-cutlass-dsl==4.4.2,产生冲突,因此升级一下flash mask版本

Modifications

升级flash mask版本
版本信息记录在:https://ku.baidu-int.com/knowledge/HFVrC7hq1Q/pKzJfZczuc/YeqWcBGW4m/EUBpKxHfTurV5G

Usage or Command

NA

Accuracy Tests

NA

Checklist

  • Add at least a tag in the PR title.
    • Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
    • You can add new tags based on the PR content, but the semantics must be clear.
  • Format your code, run pre-commit before commit.
  • Add unit tests. Please write the reason in this PR if no unit tests.
  • Provide accuracy results.
  • If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

@paddle-bot
Copy link
Copy Markdown

paddle-bot Bot commented May 14, 2026

Thanks for your contribution!

Copy link
Copy Markdown

@PaddlePaddle-bot PaddlePaddle-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤖 Paddle-CI-Agent | pr_review | 2026-05-14 17:54:28

📋 Review 摘要

PR 概述:升级 flash_mask wheel 包版本,解决与 flashinfer 0.6.11 的 nvidia-cutlass-dsl 依赖冲突
变更范围requirements.txt
影响面 Tag[Others]

📝 PR 规范检查

## Usage or Command## Accuracy Tests 两个 section 仅保留了 HTML 注释,未明确填写 N/A,不符合描述模板要求(模板要求"按真实情况填写,无则 N/A")。

标题建议:标题格式合规,无需修改。

PR 描述建议(可直接复制,必须复刻 checklist §D2 模板的完整结构):

## Motivation
flashinfer升级到0.6.11版本要求nvidia-cutlass-dsl>=4.4.2(https://github.com/PaddlePaddle/FastDeploy/pull/7799),flash mask旧版本锁死了nvidia-cutlass-dsl==4.4.2,产生冲突,因此升级一下flash mask版本

## Modifications
升级 `requirements.txt``flash_mask` 的 wheel 包版本,从 `4.0.0+g4c84f74` 升级至 `4.0.0+g4a17261`,解除与 flashinfer 0.6.11 的 `nvidia-cutlass-dsl` 依赖冲突

## Usage or Command
N/A

## Accuracy Tests
N/A

## Checklist

- [x] Add at least a tag in the PR title.
  - Tag list: [`[FDConfig]`,`[APIServer]`,`[Engine]`, `[Scheduler]`, `[PD Disaggregation]`, `[Executor]`, `[Graph Optimization]`, `[Speculative Decoding]`, `[RL]`, `[Models]`, `[Quantization]`, `[Loader]`, `[OP]`, `[KVCache]`, `[DataProcessor]`, `[BugFix]`, `[Docs]`, `[CI]`, `[Optimization]`, `[Feature]`, `[Benchmark]`, `[Others]`, `[XPU]`, `[HPU]`, `[GCU]`, `[DCU]`, `[Iluvatar]`, `[Metax]`]
  - You can add new tags based on the PR content, but the semantics must be clear.
- [x] Format your code, run `pre-commit` before commit.
- [ ] Add unit tests. Please write the reason in this PR if no unit tests.
- [ ] Provide accuracy results.
- [ ] If the current PR is submitting to the `release` branch, make sure the PR has been submitted to the `develop` branch, then cherry-pick it to the `release` branch with the `[Cherry-Pick]` PR tag.

问题

未发现阻塞性问题。

总体评价

变更合理,通过升级 flash_mask wheel 包版本解决了 flashinfer 0.6.11 与 nvidia-cutlass-dsl 之间的依赖冲突。PR 描述两个 section 的占位符未替换,建议按描述模板要求明确填写 N/A

@PaddlePaddle-bot
Copy link
Copy Markdown

🤖 Paddle-CI-Agent | ci_status_monitor | 2026-05-14 18:31:07

CI报告基于以下代码生成(30分钟更新一次):


1 任务总览

Required 任务进行中:3 个 required 任务仍在运行,7 个已通过,0 个失败,请等待运行完成。

总执行(rerun次数) 总任务 ✅ 通过 ❌ 失败 ⏳ 运行中 ⏸️ 等待中 跳过
37(0) 37 31 2 3 1 0

2 任务状态汇总

2.1 Required任务 : 7/10 通过

必选任务阻塞合并,失败需优先处理。

状态 任务 耗时 根因 修复建议 日志 重跑
Run FastDeploy Unit Tests and Coverage / run_tests_with_coverage - 运行中 - Job -
Extracted partial CE model tasks to run in CI. / run_ce_cases - 运行中 - Job -
xpu_4cards_case_test / run_xpu_4cards_cases - 运行中 - Job -
其余 7 个必选任务通过 - - - - -

2.2 可选任务 — 24/27 通过

可选任务不阻塞合并,失败仅供参考。

状态 任务 耗时 日志 重跑
Run iluvatar Tests / run_iluvatar_cases 13m24s Job -
Check PR Template 18s Job -
⏸️ CI_HPU - - -
其余 24 个可选任务通过 - - -

3 失败详情(仅 required)

@codecov-commenter
Copy link
Copy Markdown

codecov-commenter commented May 14, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (develop@cb2d7c0). Learn more about missing BASE report.

Additional details and impacted files
@@            Coverage Diff             @@
##             develop    #7819   +/-   ##
==========================================
  Coverage           ?   63.17%           
==========================================
  Files              ?      461           
  Lines              ?    64083           
  Branches           ?     9806           
==========================================
  Hits               ?    40484           
  Misses             ?    20821           
  Partials           ?     2778           
Flag Coverage Δ
GPU 72.30% <ø> (?)
XPU 7.14% <ø> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copy link
Copy Markdown
Collaborator

@zoooo0820 zoooo0820 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants