[etLLM][Config, Part1] Convert Args to DictConfig #9450

iseeyuan · 2025-03-20T14:25:09Z

Summary

Add structure to the complicated args
Convert args to DictConfig, to decouple the cli args
Pass needed sub configs to functions (instead of args)

Test plan

python3 -m examples.models.llama.export_llama -c stories110M.pt -p params.json -d fp32 -n tinyllama_xnnpack+custom_fp32_main.pte -kv -X --xnnpack-extended-ops -qmode 8da4w -G 128 --use_sdpa_with_kv_cache --output-dir tmp -E 8,64

pytorch-bot · 2025-03-20T14:25:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9450

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures

As of commit bccc2e3 with merge base 76ae537 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
>>> Lint for examples/models/llama/source_transformation/quantize.py:
pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
RuntimeError: Command docker exec -t b433c13e52a45f9cee3bfc0d36acd5b3b5842f90b94bfd56c9f8c75d4ad876c8 /exec failed with exit code 1
pull / test-phi-3-mini-runner-linux / linux-job (gh)
RuntimeError: Command docker exec -t 9dd1b62f6f179fa55b77549d3240cb1ea05bfb1783e9c75ad1a665d47bc8aeaf /exec failed with exit code 1
pull / unittest-arm / linux-job (gh)
backends/arm/test/ops/test_log.py::TestLog::test_log_tosa_BI_3_randn_pos

This comment was automatically generated by Dr. CI and updates every 15 minutes.

iseeyuan · 2025-03-20T14:27:01Z

@pytorchbot label "topic: not user facing"

facebook-github-bot · 2025-03-20T16:33:42Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2025-03-21T02:31:43Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2025-03-21T14:35:40Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

larryliu0820 · 2025-03-21T18:08:27Z

examples/models/llama/eval_llama_lib.py

+    config.eval = OmegaConf.create()
+    config.eval.tasks = args.tasks
+    config.eval.limit = args.limit
+    config.eval.num_fewshot = args.num_fewshot
+    config.eval.pte = args.pte
+    config.eval.tokenizer_bin = args.tokenizer_bin
+    config.eval.output_eager_checkpoint_file = args.output_eager_checkpoint_file


Sorry is there a definition of the config?

This PR is just the first step: to convert args to config to unblock internal work, where the configs can be used standalone, without cli args and yaml file. More context in #9449

facebook-github-bot · 2025-03-23T00:05:41Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

jackzhxng · 2025-03-24T13:15:25Z

examples/models/llama/export_llama_lib.py

+        "output_dir": args.output_dir,
+        "checkpoint": args.checkpoint,
+        "checkpoint_dir": args.checkpoint_dir,
+        "tokenizer_path": args.tokenizer_path,


This is only needed during export for quant calibration, should go in args_dict["calibration"]. Or since these args are also used in different runners, I'd prefer to have args_dict["tokenizer"], we can combine it with tokenizer_config_path which the eager runner uses

jackzhxng · 2025-03-24T13:19:52Z

examples/models/llama/export_llama_lib.py

+    args_dict["kv_cache"] = {
+        "use_kv_cache": args.use_kv_cache,
+        "quantize_kv_cache": args.quantize_kv_cache,
+        "use_sdpa_with_kv_cache": args.use_sdpa_with_kv_cache,


This arg is a bit poorly named now that the custom sdpa op is now decoupled from the kv cache, should move this to arcs_dict["misc"]. We should rename this arg @kimishpatel

Summary: 1. Add structure to the complicated args 2. Convert args to DictConfig, to decouple the cli args 3. Pass needed sub configs to functions (instead of args) Test Plan: ``` python3 -m examples.models.llama.export_llama -c stories110M.pt -p params.json -d fp32 -n tinyllama_xnnpack+custom_fp32_main.pte -kv -X --xnnpack-extended-ops -qmode 8da4w -G 128 --use_sdpa_with_kv_cache --output-dir tmp -E 8,64 ``` Reviewed By: larryliu0820 Differential Revision: D71557301 Pulled By: jackzhxng

Summary: Pull Request resolved: #9717 1. Add structure to the complicated args 2. Convert args to DictConfig, to decouple the cli args 3. Pass needed sub configs to functions (instead of args) Pull Request resolved: #9450 Test Plan: ``` python3 -m examples.models.llama.export_llama -c stories110M.pt -p params.json -d fp32 -n tinyllama_xnnpack+custom_fp32_main.pte -kv -X --xnnpack-extended-ops -qmode 8da4w -G 128 --use_sdpa_with_kv_cache --output-dir tmp -E 8,64 ``` Reviewed By: larryliu0820 Differential Revision: D71557301 Pulled By: jackzhxng

Summary: 1. Add structure to the complicated args 2. Convert args to DictConfig, to decouple the cli args 3. Pass needed sub configs to functions (instead of args) Test Plan: ``` python3 -m examples.models.llama.export_llama -c stories110M.pt -p params.json -d fp32 -n tinyllama_xnnpack+custom_fp32_main.pte -kv -X --xnnpack-extended-ops -qmode 8da4w -G 128 --use_sdpa_with_kv_cache --output-dir tmp -E 8,64 ``` Pulled By: jackzhxng jackzhxng Differential Revision: D71557301

Summary: 1. Add structure to the complicated args 2. Convert args to DictConfig, to decouple the cli args 3. Pass needed sub configs to functions (instead of args) Test Plan: ``` python3 -m examples.models.llama.export_llama -c stories110M.pt -p params.json -d fp32 -n tinyllama_xnnpack+custom_fp32_main.pte -kv -X --xnnpack-extended-ops -qmode 8da4w -G 128 --use_sdpa_with_kv_cache --output-dir tmp -E 8,64 ``` Reviewed By: larryliu0820 Pulled By: jackzhxng jackzhxng Differential Revision: D71557301

github-actions · 2025-08-31T00:51:23Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

github-actions · 2025-10-30T00:52:02Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

iseeyuan requested review from jackzhxng and lucylq as code owners March 20, 2025 14:25

iseeyuan linked an issue Mar 20, 2025 that may be closed by this pull request

[etLLM] New config system to export_llama #9449

Closed

pytorch-bot bot added the topic: not user facing label Mar 20, 2025

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 20, 2025

iseeyuan mentioned this pull request Mar 20, 2025

Add config system to export_llama #9367

Closed

iseeyuan force-pushed the dictconfig branch from d06c82f to 6a5f434 Compare March 20, 2025 16:32

iseeyuan force-pushed the dictconfig branch 2 times, most recently from 6df85c1 to 311e925 Compare March 21, 2025 02:31

iseeyuan force-pushed the dictconfig branch from 311e925 to d12d7d1 Compare March 21, 2025 14:35

larryliu0820 reviewed Mar 21, 2025

View reviewed changes

larryliu0820 approved these changes Mar 21, 2025

View reviewed changes

iseeyuan force-pushed the dictconfig branch from d12d7d1 to 5975726 Compare March 22, 2025 22:43

[etLLM][Config, Part1] Convert Args to DictConfig

bccc2e3

iseeyuan force-pushed the dictconfig branch from 5975726 to bccc2e3 Compare March 22, 2025 23:59

jackzhxng reviewed Mar 24, 2025

View reviewed changes

jackzhxng removed a link to an issue Jun 5, 2025

[etLLM] New config system to export_llama #9449

Closed

github-actions bot added the stale PRs inactive for over 60 days label Aug 31, 2025

jackzhxng closed this Oct 30, 2025

[etLLM][Config, Part1] Convert Args to DictConfig #9450

[etLLM][Config, Part1] Convert Args to DictConfig #9450

Uh oh!

Conversation

iseeyuan commented Mar 20, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9450

❌ 4 New Failures

Uh oh!

iseeyuan commented Mar 20, 2025

Uh oh!

facebook-github-bot commented Mar 20, 2025

Uh oh!

facebook-github-bot commented Mar 21, 2025

Uh oh!

facebook-github-bot commented Mar 21, 2025

Uh oh!

larryliu0820 Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

iseeyuan Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Mar 23, 2025

Uh oh!

jackzhxng Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jackzhxng Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 31, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Mar 20, 2025 •

edited

Loading

jackzhxng Mar 24, 2025 •

edited

Loading