@@ -17,29 +17,29 @@ Here is a basic example for exporting Qwen 3, although please refer to the Llama
1717Export 0.6b to XNNPack, quantized with 8da4w:
1818```
1919python -m extension.llm.export.export_llm \
20- --config examples/models/qwen3/config/qwen3_xnnpack_q8da4w.yaml
20+ --config examples/models/qwen3/config/qwen3_xnnpack_q8da4w.yaml \
2121 +base.model_class="qwen3_0_6b" \
2222 +base.params="examples/models/qwen3/config/0_6b_config.json" \
23- +export.output_name="qwen3_0_6b.pte" \
23+ +export.output_name="qwen3_0_6b.pte"
2424
2525```
2626
2727Export 1.7b to XNNPack, quantized with 8da4w:
2828```
2929python -m extension.llm.export.export_llm \
30- --config examples/models/qwen3/config/qwen3_xnnpack_q8da4w.yaml
30+ --config examples/models/qwen3/config/qwen3_xnnpack_q8da4w.yaml \
3131 +base.model_class="qwen3_1_7b" \
3232 +base.params="examples/models/qwen3/config/1_7b_config.json" \
33- +export.output_name="qwen3_1_7b.pte" \
33+ +export.output_name="qwen3_1_7b.pte"
3434```
3535
3636Export 4b to XNNPack, quantized with 8da4w:
3737```
3838python -m extension.llm.export.export_llm \
39- --config examples/models/qwen3/config/qwen3_xnnpack_q8da4w.yaml
39+ --config examples/models/qwen3/config/qwen3_xnnpack_q8da4w.yaml \
4040 +base.model_class="qwen3_4b" \
4141 +base.params="examples/models/qwen3/config/4b_config.json" \
42- +export.output_name="qwen3_4b.pte" \
42+ +export.output_name="qwen3_4b.pte"
4343```
4444
4545### Example run
0 commit comments