|
| 1 | +# CI Configuration Files for LLM Export |
| 2 | + |
| 3 | +This directory contains YAML configuration files used by CI tests for exporting LLM models with the new `extension.llm.export.export_llm` command. |
| 4 | + |
| 5 | +## Usage |
| 6 | + |
| 7 | +These config files can be used with the export command like this: |
| 8 | + |
| 9 | +```bash |
| 10 | +python -m extension.llm.export.export_llm --config path/to/config.yaml |
| 11 | +``` |
| 12 | + |
| 13 | +Or you can override specific parameters: |
| 14 | + |
| 15 | +```bash |
| 16 | +python -m extension.llm.export.export_llm --config ci_stories110m_xnnpack_quantized.yaml base.checkpoint=my_checkpoint.pt |
| 17 | +``` |
| 18 | + |
| 19 | +## Configuration Files |
| 20 | + |
| 21 | +### CI Test Configurations |
| 22 | +- `ci_stories110m_xnnpack_quantized.yaml` - Stories110M with XNNPACK quantization (used in test_llama.sh) |
| 23 | +- `ci_stories110m_mps.yaml` - Stories110M with MPS backend |
| 24 | +- `ci_stories110m_coreml.yaml` - Stories110M with CoreML backend |
| 25 | +- `ci_stories110m_qnn.yaml` - Stories110M with QNN backend |
| 26 | + |
| 27 | +### Performance Test Configurations |
| 28 | +- `llama3_spinquant.yaml` - Llama3 with SpinQuant (used in apple-perf.yml, android-perf.yml) |
| 29 | +- `llama3_qlora.yaml` - Llama3 with QLoRA (QAT + LoRA) |
| 30 | +- `llama3_coreml_ane.yaml` - Llama3 with CoreML ANE |
| 31 | +- `xnnpack_8da4w_basic.yaml` - Basic XNNPACK 8da4w quantization |
| 32 | +- `qwen3_xnnpack_8da4w.yaml` - Qwen3 with XNNPACK 8da4w quantization |
| 33 | + |
| 34 | +### Specialized Configurations |
| 35 | +- `stories110m_torchao_lowbit.yaml` - Stories110M with TorchAO lowbit quantization |
| 36 | +- `xnnpack_custom_quantized.yaml` - XNNPACK with custom ops and quantization |
| 37 | + |
| 38 | +## Background |
| 39 | + |
| 40 | +These configuration files were created as part of migrating CI tests from the old `examples.models.llama.export_llama` command to the new `extension.llm.export.export_llm` command with hydra configuration support. |
| 41 | + |
| 42 | +The config files help reduce duplication in CI scripts and make it easier to maintain consistent export settings across different test scenarios. |
0 commit comments