You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -195,6 +209,38 @@ List of model specific and optional passes:
195
209
- InsertCastForOpsWithInt64InputPass
196
210
- Functionality:
197
211
- For LLMs such as LLama, some opeartors like aten.embedding have int64 input. In order to lower these operators to TOSA, this pass will insert a casting node that converts the input from int64 to int32.
198
-
- Example usage: backends/arm/test/models/test_llama.py
199
212
- Supported Ops:
200
213
- aten.embedding.default, aten.slice_copy.Tensor
214
+
- Example usage:
215
+
- backends/arm/test/models/test_llama.py
216
+
217
+
- ConvertInt64ConstOpsToInt32Pass
218
+
- Functionalities:
219
+
- Rewrites constant-producing ops that output int64 to instead output int32, when values are within int32 bounds.
0 commit comments