Commit 6730b0e
Fix fused rope kernel and update paddle (PaddlePaddle#2268)
* fix fused rope kernel and update paddle
* fix flag
* [Metax] Fix fused compile bug and remove some patch (PaddlePaddle#198)
* fix flags
* update disabled tests
---------
Co-authored-by: duqimeng <[email protected]>1 parent 8e2c4f8 commit 6730b0e
File tree
13 files changed
+101
-942
lines changed- backends
- iluvatar_gpu
- common
- kernels/cuda_kernels
- tests
- metax_gpu
- kernels/fusion
13 files changed
+101
-942
lines changed- paddle/cinn/backends/codegen_cuda_dev.cc+3-5
- paddle/cinn/backends/compiler.cc+404-23
- paddle/cinn/backends/compiler.h+58-2
- paddle/cinn/backends/llvm/execution_engine.cc+133-4
- paddle/cinn/backends/llvm/execution_engine.h+15-3
- paddle/cinn/cinn.h+10
- paddle/cinn/hlir/framework/graph_compiler_util.h+1
- paddle/cinn/hlir/framework/pir/compilation_cache.h+10
- paddle/cinn/hlir/framework/pir/compilation_task.cc+9-3
- paddle/cinn/hlir/framework/pir/fusion_info.cc+45-8
- paddle/cinn/hlir/framework/pir/fusion_info.h+3
- paddle/cinn/hlir/framework/pir/op_lowering_group.h+1
- paddle/cinn/hlir/framework/pir_compiler.cc+297-11
- paddle/cinn/runtime/arch_device.h+3-3
- paddle/cinn/runtime/cuda/cuda_util.cc+22-3
- paddle/cinn/runtime/cuda/cuda_util.h+6-1
- paddle/common/flags.cc+28
- paddle/fluid/pir/dialect/operator/interface/infer_symbolic_shape/unary_infer_sym.cc+68-11
- paddle/fluid/pir/dialect/operator/interface/infer_symbolic_shape/unary_infer_sym.h+9-2
- paddle/fluid/pir/serialize_deserialize/CMakeLists.txt+2-2
- paddle/fluid/pir/serialize_deserialize/patch/4.yaml
- paddle/fluid/pybind/eager_method.cc+3-1
- paddle/fluid/pybind/eager_properties.cc+4-16
- paddle/phi/kernels/funcs/blas/blas_impl.cu.h+48-18
- paddle/phi/kernels/fusion/gpu/fused_rope_grad_kernel.cu+133-127
- paddle/phi/kernels/fusion/gpu/fused_rope_kernel.cu+160-183
- paddle/phi/kernels/fusion/gpu/fused_rope_utils.h+273-448
- paddle/phi/kernels/impl/matmul_kernel_impl.h+81-24
- paddle/phi/kernels/stride/reduce_stride_base.cu.h+20
- paddle/phi/ops/yaml/ops.yaml+5
- paddle/pir/include/core/type.h+9-1
- python/paddle/distributed/flex_checkpoint/dcp/load_state_dict.py+25-7
- python/paddle/distributed/flex_checkpoint/dcp/reshard_comm.py+27-12
- python/paddle/distributed/flex_checkpoint/dcp/save_state_dict.py+2-2
- python/paddle/tensor/manipulation.py+6
- python/paddle/utils/cpp_extension/extension_utils.py+5-1
- test/auto_parallel/hybrid_strategy/semi_flexcheckpoint_merge.py+7-7
- test/ir/pir/cinn/CMakeLists.txt+3
- test/ir/pir/cinn/test_enable_cinn_kernel_cache.py+114
- test/ir/pir/cinn/test_enable_cinn_kernel_cache_2.py+114
- test/legacy_test/test_fused_rotary_position_embedding.py+47-4
- test/legacy_test/test_tensor.py+122-1
- test/legacy_test/test_tile_op.py+24
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
112 | 112 | | |
113 | 113 | | |
114 | 114 | | |
115 | | - | |
| 115 | + | |
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
| |||
128 | 128 | | |
129 | 129 | | |
130 | 130 | | |
| 131 | + | |
131 | 132 | | |
132 | 133 | | |
133 | 134 | | |
| |||
876 | 877 | | |
877 | 878 | | |
878 | 879 | | |
879 | | - | |
| 880 | + | |
880 | 881 | | |
881 | 882 | | |
882 | 883 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
277 | 277 | | |
278 | 278 | | |
279 | 279 | | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
Lines changed: 23 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
Lines changed: 0 additions & 200 deletions
This file was deleted.
Lines changed: 24 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
0 commit comments