Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit db88dc8

Browse files
larryliu0820metascroy
authored andcommitted
Add import for quantized decomposed ops
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
1 parent 4ce0de8 commit db88dc8

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

torchchat/model.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -932,6 +932,9 @@ def apply_rotary_emb(x: Tensor, freqs_cis: Tensor) -> Tensor:
932932
from executorch.extension.pybindings import portable_lib as exec_lib
933933

934934
# ET changed the way it's loading the custom ops so it's not included in portable_lib but has to be loaded separately.
935+
# For quantized_decomposed ops
936+
from executorch.kernels import quantized # no-qa
937+
# For llama::sdpa_with_kv_cache.out, preprocess ops
935938
from executorch.extension.llm.custom_ops import sdpa_with_kv_cache # no-qa
936939

937940
class PTEModel(nn.Module):

0 commit comments

Comments
 (0)