Skip to content

Commit 9e8f089

Browse files
varun-sundar-rabindranathVarun Sundar Rabindranath
authored andcommitted
[Kernels] LoRA - Retire SGMV and BGMV Kernels (#14685)
Signed-off-by: Varun Sundar Rabindranath <[email protected]> Co-authored-by: Varun Sundar Rabindranath <[email protected]>
1 parent 16e9064 commit 9e8f089

File tree

15 files changed

+247
-2094
lines changed

15 files changed

+247
-2094
lines changed

benchmarks/kernels/benchmark_lora.py

Lines changed: 63 additions & 373 deletions
Large diffs are not rendered by default.

tests/lora/test_punica_ops.py

Lines changed: 84 additions & 359 deletions
Large diffs are not rendered by default.
Lines changed: 6 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,11 @@
11
# SPDX-License-Identifier: Apache-2.0
22

3-
from vllm.lora.ops.triton_ops.bgmv_expand import bgmv_expand
4-
from vllm.lora.ops.triton_ops.bgmv_expand_slice import bgmv_expand_slice
5-
from vllm.lora.ops.triton_ops.bgmv_shrink import bgmv_shrink
6-
from vllm.lora.ops.triton_ops.sgmv_expand import sgmv_expand
7-
from vllm.lora.ops.triton_ops.sgmv_shrink import sgmv_shrink # noqa: F401
3+
from vllm.lora.ops.triton_ops.lora_expand import lora_expand
4+
from vllm.lora.ops.triton_ops.lora_kernel_metadata import LoRAKernelMeta
5+
from vllm.lora.ops.triton_ops.lora_shrink import lora_shrink
86

97
__all__ = [
10-
"bgmv_expand",
11-
"bgmv_expand_slice",
12-
"bgmv_shrink",
13-
"sgmv_expand",
14-
"sgmv_shrink",
8+
"lora_expand",
9+
"lora_shrink",
10+
"LoRAKernelMeta",
1511
]

vllm/lora/ops/triton_ops/bgmv_expand.py

Lines changed: 0 additions & 188 deletions
This file was deleted.

0 commit comments

Comments
 (0)