Skip to content

Commit 3a43b1e

Browse files
committed
default attn_implementaion to eager to avoid issues
Signed-off-by: Zhiyu Cheng <[email protected]>
1 parent 2fd67cc commit 3a43b1e

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/llm_ptq/hf_ptq.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -748,10 +748,10 @@ def output_decode(generated_ids, input_shape):
748748
parser.add_argument(
749749
"--attn_implementation",
750750
help=(
751-
"Specify the attention implementation to use."
751+
"Specify the attention implementation to use. "
752752
"This arg will be passed to the HF model loading if specified."
753753
),
754-
default=None,
754+
default="eager",
755755
type=str,
756756
)
757757

0 commit comments

Comments
 (0)