forked from quic/efficient-transformers
-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathsubfunction_120b_npi.yaml
More file actions
50 lines (50 loc) · 2.24 KB
/
subfunction_120b_npi.yaml
File metadata and controls
50 lines (50 loc) · 2.24 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
FP32NodeInstanceNames:
- onnx::Shape_139893
- onnx::Shape_140187
- onnx::Shape_144086
- onnx::Shape_144410
- onnx::Shape_883
- onnx::Shape_1215
- hidden_states.267
- hidden_states.271
- hidden_states.275
- hidden_states.279
- hidden_states.3
- hidden_states.7
- /model/norm/CustomRMSNorm_output_0
- /model/layers.0/QEffGptOssDecoderLayer_output_127
- /model/layers.1/QEffGptOssDecoderLayer.2_output_2
- /model/layers.2/QEffGptOssDecoderLayer.1_output_2
- /model/layers.3/QEffGptOssDecoderLayer.2_output_2
- /model/layers.4/QEffGptOssDecoderLayer.1_output_2
- /model/layers.5/QEffGptOssDecoderLayer.2_output_2
- /model/layers.6/QEffGptOssDecoderLayer.1_output_2
- /model/layers.7/QEffGptOssDecoderLayer.2_output_2
- /model/layers.8/QEffGptOssDecoderLayer.1_output_2
- /model/layers.9/QEffGptOssDecoderLayer.2_output_2
- /model/layers.10/QEffGptOssDecoderLayer.1_output_2
- /model/layers.11/QEffGptOssDecoderLayer.2_output_2
- /model/layers.12/QEffGptOssDecoderLayer.1_output_2
- /model/layers.13/QEffGptOssDecoderLayer.2_output_2
- /model/layers.14/QEffGptOssDecoderLayer.1_output_2
- /model/layers.15/QEffGptOssDecoderLayer.2_output_2
- /model/layers.16/QEffGptOssDecoderLayer.1_output_2
- /model/layers.17/QEffGptOssDecoderLayer.2_output_2
- /model/layers.18/QEffGptOssDecoderLayer.1_output_2
- /model/layers.19/QEffGptOssDecoderLayer.2_output_2
- /model/layers.20/QEffGptOssDecoderLayer.1_output_2
- /model/layers.21/QEffGptOssDecoderLayer.2_output_2
- /model/layers.22/QEffGptOssDecoderLayer.1_output_2
- /model/layers.23/QEffGptOssDecoderLayer.2_output_2
- /model/layers.24/QEffGptOssDecoderLayer.1_output_2
- /model/layers.25/QEffGptOssDecoderLayer.2_output_2
- /model/layers.26/QEffGptOssDecoderLayer.1_output_2
- /model/layers.27/QEffGptOssDecoderLayer.2_output_2
- /model/layers.28/QEffGptOssDecoderLayer.1_output_2
- /model/layers.29/QEffGptOssDecoderLayer.2_output_2
- /model/layers.30/QEffGptOssDecoderLayer.1_output_2
- /model/layers.31/QEffGptOssDecoderLayer.2_output_2
- /model/layers.32/QEffGptOssDecoderLayer.1_output_2
- /model/layers.33/QEffGptOssDecoderLayer.2_output_2
- /model/layers.34/QEffGptOssDecoderLayer.1_output_2
- /model/layers.35/QEffGptOssDecoderLayer.2_output_2