Skip to content

Commit 06fca85

Browse files
committed
Comment
1 parent 52ecd62 commit 06fca85

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

examples/models/llama/export_llama_lib.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -606,7 +606,9 @@ def _prepare_for_llama_export(args) -> LLMEdgeManager:
606606
)
607607
)
608608

609-
# We want to do compute the actual ops in the precision of the dtype_override.
609+
# We want to do compute the actual ops in the precision of the dtype_override,
610+
# since the precision of the quantized linear will initially be the dtype of the
611+
# checkpoint, not the dtype_override.
610612
def _set_precision_to_fp32(module):
611613
"""
612614
Recursively iterate through the module and set the precision attribute

0 commit comments

Comments
 (0)