Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit d92366f

Browse files
committed
Add import for quantized decomposed ops
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
1 parent 713b430 commit d92366f

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

torchchat/model.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -961,6 +961,9 @@ def apply_rotary_emb(x: Tensor, freqs_cis: Tensor) -> Tensor:
961961
from executorch.extension.pybindings import portable_lib as exec_lib
962962

963963
# ET changed the way it's loading the custom ops so it's not included in portable_lib but has to be loaded separately.
964+
# For quantized_decomposed ops
965+
from executorch.kernels import quantized # no-qa
966+
# For llama::sdpa_with_kv_cache.out, preprocess ops
964967
from executorch.extension.llm.custom_ops import sdpa_with_kv_cache # no-qa
965968

966969
class PTEModel(nn.Module):

0 commit comments

Comments
 (0)