Test #1

zyfncg · 2025-11-12T12:54:47Z

PR Category

PR Types

Description

…Paddle#75708)

)

* fix comparison warning * fix

…e#75665) * 【CUDA Kernel No.39】collect_fpn_proposals算子Kernel修复 * fix index path

* Add moe_unpermute_kernel.h * 修复typo

…addlePaddle#75711)

* refractor & fix moe_permute * refractor

* fix: prevent memcpy over-read in im2col_sh1sw1dh1dw1ph1pw1 NCHW branches - Add bounds clamping for all memcpy operations in the specialized fast path - Add zero-fill for shortfall cases to ensure complete output tensor coverage - Maintain performance by using memcpy when safe, falling back to element-wise operations only when necessary * fix: prevent memcpy over-read in filter_width==1 case of im2col_sh1sw1dh1dw1ph1pw1 - Fix unsafe memcpy in NCHW path when filter_width == 1 - Prevent negative size_t conversion when output_width < plw + prw - Clamp copy size to available source span (im_width) to avoid over-read - Add zero-fill for shortfall cases to ensure complete output coverage * fix: enhance im2col_common to prevent overflow in arithmetic operations - Convert dimensions to 64-bit integers to avoid overflow during calculations - Update index calculations for col and im arrays to use 64-bit arithmetic - Ensure safe access to tensor data by checking bounds before indexing

…dlePaddle#75572)

--------- Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: Copilot <[email protected]>

…dle#75746)

…addle#75747) --------- Co-authored-by: Nyakku Shigure <[email protected]>

…5476)

…5724) * add log * support dynamic_shape

* clean py3.8 in dockerfile - part * fix

* fix: using latest API * switch check_prim_pir ON * fix: Code Style Issue * remove: useless whitelist. * fix: code-style issue. * Update test/legacy_test/test_dropout_op.py Co-authored-by: Nyakku Shigure <[email protected]> * fix: code-style issue. --------- Co-authored-by: Nyakku Shigure <[email protected]>

* fix * fix * fix dcu

* feat: debugging info * fix: non-cuda device’s logging error. * remove: cuda version checking useless * fix: syntax error * fix: code-style issue. * fix: build error * fix: syntax error * feat: ctcloss.zero_infinity * Remove zero_infinity parameter from ctc_loss Removed the 'zero_infinity' parameter from the ctc_loss function call. * fix: code-style issue. * fix: code-style issue. ? * fix: code-style issue.

* support hf checkpoint fix support cast add id macro fix * add test and fix some bug * fix full param bug * add full param cast test --------- Co-authored-by: xingmingyyj <[email protected]>

…75642) * Add partial_concat_grad_kernel.h * Change to gpu * 修改目录 * Fix

…ddlePaddle#76027)

)

* sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix

)" (PaddlePaddle#76090)

…addlePaddle#76084) This reverts commit 9f19eef.

… dev/flashep

…4284)" (PaddlePaddle#76090) This reverts commit e2a8155.

…Paddle#74284)" (PaddlePaddle#76090)" This reverts commit e5f8345.

…into dev/flashep

Le-soleile and others added 30 commits October 10, 2025 20:12

【CUDA Kernel No.122】expand_modality_expert_id算子Kernel修复 -part (Paddle…

0f34ab6

…Paddle#75708)

【CUDA Kernel No.57】global_scatter算子Kernel修复 -part (PaddlePaddle#75699)

451814c

[XPU] Auto bump XHPC to 20251007 (PaddlePaddle#75688)

129fab3

【CUDA Kernel No.53】fused_token_prune算子Kernel修复 -part (PaddlePaddle#75701

ee159d0

)

del deprecated uts part2 (PaddlePaddle#75726)

d0c2788

[Test] Remove deprecated uts (part3) (PaddlePaddle#75730)

3c407fa

fix comparison warning (PaddlePaddle#75652)

474d1ab

* fix comparison warning * fix

【CUDA Kernel No.39】collect_fpn_proposals算子Kernel修复 -part (PaddlePaddl…

3dd52c5

…e#75665) * 【CUDA Kernel No.39】collect_fpn_proposals算子Kernel修复 * fix index path

【CUDA Kernel No.81】moe_unpermute算子Kernel修复 -part (PaddlePaddle#75644)

d30a353

* Add moe_unpermute_kernel.h * 修复typo

python2.7 change to python in pyCov_multithreading (PaddlePaddle#75669)

abb153b

add python3.13 in build_utils.sh (PaddlePaddle#75723)

fabaa95

【CUDA Kernel No.132】moe_gate_dispatch_permute_grad算子Kernel修复 -part (P…

fe2a8fc

…addlePaddle#75711)

refractor & fix moe_permute (PaddlePaddle#75725)

ea2cc97

* refractor & fix moe_permute * refractor

[XPU] support index_elementwise_get kernel (PaddlePaddle#75486)

f556d04

rename test_mkldnn_matmul_elementwise_add_fuse_pass [fluid_ops] (Pad…

4f3effe

…dlePaddle#75572)

[Test] Move cpp unittests to test directory (PaddlePaddle#75632)

2c02b6c

--------- Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: Copilot <[email protected]>

Replace mkldnn with onednn in test_build_strategy.py (PaddlePad…

6759447

…dle#75746)

[SOT] Support builtin dispatch for is_compiled_with_onednn (PaddleP…

08fe857

…addle#75747) --------- Co-authored-by: Nyakku Shigure <[email protected]>

[CI] Add Report Preview URLs Workflow (PaddlePaddle#75687)

cf92c0c

Disable CUBLAS TF32 for default for better precision. (PaddlePaddle#7…

fcf3c3f

…5476)

【pipeparellal】 PipelineParallel support dynamic_shape (PaddlePaddle#7…

8224888

…5724) * add log * support dynamic_shape

[XPU] Auto bump XHPC to 20251010 (PaddlePaddle#75751)

402b977

cuda13 almalinux trt (PaddlePaddle#75695)

7975faf

replace mkldnn to onednn in strings (PaddlePaddle#75745)

7a86836

clean py3.8 in dockerfile (PaddlePaddle#75732)

5beed39

* clean py3.8 in dockerfile - part * fix

time string format in progress bar (PaddlePaddle#75736)

290c4da

[DeepEP] support M2N (PaddlePaddle#75582)

1990bcc

[深度对齐] dot (PaddlePaddle#75717)

a02d1aa

* fix * fix * fix dcu

aztice and others added 30 commits October 27, 2025 17:25

Pr support load hf checkpoint (PaddlePaddle#75928)

1fd2b5a

* support hf checkpoint fix support cast add id macro fix * add test and fix some bug * fix full param bug * add full param cast test --------- Co-authored-by: xingmingyyj <[email protected]>

【CUDA Kernel No.89】partial_concat_grad算子Kernel修复 -part (PaddlePaddle#…

70a0660

…75642) * Add partial_concat_grad_kernel.h * Change to gpu * 修改目录 * Fix

add SetDataType INT64 (PaddlePaddle#76017)

e05b3b1

Fix ComparePriority to satisfy strict weak ordering for std::sort (Pa…

efc6b44

…ddlePaddle#76027)

Temporary fix of moe_gat_dispatch_w_permute optest. (PaddlePaddle#76039)

feeef7e

fix test_incubate_fused_loss (PaddlePaddle#76068)

310e746

clean CUDA_ARCH_FP16_SUPPORTED - part (PaddlePaddle#76024)

d768c1a

clean CUDA_ARCH_FP16_SUPPORTED - part (PaddlePaddle#76022)

9f19eef

clean CUDA_ARCH_FP16_SUPPORTED(__CUDA_ARCH__) - part (PaddlePaddle#76021

ccdfb90

)

clean CUDA_VERSION >= 7050 (PaddlePaddle#76020)

168742e

fix typo load_static_dict (PaddlePaddle#75739)

3dbac78

Fix some tests for custom device (PaddlePaddle#76063)

3313952

sharding stage3 bugfix (PaddlePaddle#76005)

5c2e29e

* sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix

[Dy2St] Remove import of ast2 in gast.py (PaddlePaddle#76057)

f07eb72

fix cinn 0size dynshape bug (PaddlePaddle#76093)

8e10916

Revert "Update deep_ep intranode & internode kernels (PaddlePaddle#74284

e2a8155

)" (PaddlePaddle#76090)

Revert "clean CUDA_ARCH_FP16_SUPPORTED - part (PaddlePaddle#76022)" (P…

7263266

…addlePaddle#76084) This reverts commit 9f19eef.

[CUDAGraph] Remove CUDAGraph legacy unitest (PaddlePaddle#76043)

316ec54

add notify_dispatch api in deepep

7072d8a

add python api in buffer

10684bf

fix param

3896d23

add test file

873e4db

modify nvshmem

8edacce

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

6e0a5a1

… dev/flashep

Reapply "Update deep_ep intranode & internode kernels (PaddlePaddle#7…

e5f8345

…4284)" (PaddlePaddle#76090) This reverts commit e2a8155.

Add kernel of notify_combine

0b9ca97

Revert "Reapply "Update deep_ep intranode & internode kernels (Paddle…

a1c6383

…Paddle#74284)" (PaddlePaddle#76090)" This reverts commit e5f8345.

update code

1c3a399

Merge branch 'dev/flashep' of https://github.com/zhangyuqin1998/Paddle …

36be241

…into dev/flashep

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test #1

Test #1

Uh oh!

zyfncg commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Test #1

Are you sure you want to change the base?

Test #1

Uh oh!

Conversation

zyfncg commented Nov 12, 2025

PR Category

PR Types

Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants