Phi3 runner uses TextLLMRunner #12477

larryliu0820 · 2025-07-15T04:04:17Z

Stack from ghstack (oldest at bottom):

-> Phi3 runner uses TextLLMRunner #12477

As titled, start to use TextLLMRunner to replace all text only decoder
only CPU runners in the codebase.

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

pytorch-bot · 2025-07-15T04:04:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12477

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 50 Pending

As of commit 0583101 with merge base aa44c06 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
pull / test-eval_llama-mmlu-linux / linux-job (gh)
RuntimeError: Command docker exec -t 9a43bafae1d7ac90551d5287ab4478357b738800b226f5c1fb81e61b8d5e663a /exec failed with exit code 1
trunk / test-coreml-delegate / macos-job (gh)
The process '/opt/homebrew/bin/git' failed with exit code 128

This comment was automatically generated by Dr. CI and updates every 15 minutes.

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: 0340257 Pull Request resolved: #12477

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: 0361280 Pull Request resolved: #12477

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #12477 * __->__ #12476 In Huggingface causal LM forward convention, `cache_position` should have the same length as `input_ids`. The previous logic will allocate `cache_position` based on method metadata which by default is equal to the maximum length of this tensor (normally max context length). Now changing the logic to align the size of `cache_position` to `input_ids`.

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: fa07a7a Pull Request resolved: #12477

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: a433148 Pull Request resolved: #12477

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: af70a63 Pull Request resolved: #12477

@larryliu0820

This PR was created by the merge bot to help merge the original PR into the main branch. ghstack PR number: #12477 by @larryliu0820 ^ Please use this as the source of truth for the PR details, comments, and reviews ghstack PR base: https://github.com/pytorch/executorch/tree/gh/larryliu0820/69/base ghstack PR head: https://github.com/pytorch/executorch/tree/gh/larryliu0820/69/head Merge bot PR base: https://github.com/pytorch/executorch/tree/main Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/larryliu0820/69/orig @diff-train-skip-merge Co-authored-by: Mengwei Liu <[email protected]>

@larryliu0820

This PR was created by the merge bot to help merge the original PR into the main branch. ghstack PR number: #12477 by @larryliu0820 ^ Please use this as the source of truth for the PR details, comments, and reviews ghstack PR base: https://github.com/pytorch/executorch/tree/gh/larryliu0820/69/base ghstack PR head: https://github.com/pytorch/executorch/tree/gh/larryliu0820/69/head Merge bot PR base: https://github.com/pytorch/executorch/tree/main Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/larryliu0820/69/orig @diff-train-skip-merge Co-authored-by: Mengwei Liu <[email protected]>

Phi3 runner uses TextLLMRunner

0ada71e

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

larryliu0820 requested review from jackzhxng, kirklandsign and lucylq as code owners July 15, 2025 04:04

larryliu0820 mentioned this pull request Jul 15, 2025

Fix cache_positions tensor size in TextLLMRunner #12476

Merged

larryliu0820 added a commit that referenced this pull request Jul 15, 2025

Phi3 runner uses TextLLMRunner

0f428fa

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: 0340257 Pull Request resolved: #12477

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 15, 2025

lucylq approved these changes Jul 15, 2025

View reviewed changes

larryliu0820 added the release notes: llm Changes to llm utilities label Jul 15, 2025

Update on "Phi3 runner uses TextLLMRunner"

4c18e71

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

larryliu0820 added a commit that referenced this pull request Jul 15, 2025

Phi3 runner uses TextLLMRunner

c1bb662

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: 0361280 Pull Request resolved: #12477

Update on "Phi3 runner uses TextLLMRunner"

1a02bec

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

larryliu0820 added a commit that referenced this pull request Jul 15, 2025

Phi3 runner uses TextLLMRunner

c3ed1ae

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: fa07a7a Pull Request resolved: #12477

Update on "Phi3 runner uses TextLLMRunner"

69ea927

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

larryliu0820 added a commit that referenced this pull request Jul 15, 2025

Phi3 runner uses TextLLMRunner

391d6ab

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: a433148 Pull Request resolved: #12477

Update on "Phi3 runner uses TextLLMRunner"

0583101

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. [ghstack-poisoned]

larryliu0820 added a commit that referenced this pull request Jul 15, 2025

Phi3 runner uses TextLLMRunner

a6b4b7f

As titled, start to use `TextLLMRunner` to replace all text only decoder only CPU runners in the codebase. ghstack-source-id: af70a63 Pull Request resolved: #12477

larryliu0820 merged commit 82b4ada into gh/larryliu0820/69/base Jul 15, 2025
192 of 196 checks passed

larryliu0820 deleted the gh/larryliu0820/69/head branch July 15, 2025 08:01

larryliu0820 temporarily deployed to cherry-pick-bot July 15, 2025 08:01 — with GitHub Actions Inactive

pytorchbot mentioned this pull request Jul 15, 2025

Phi3 runner uses TextLLMRunner #12482

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Phi3 runner uses TextLLMRunner #12477

Phi3 runner uses TextLLMRunner #12477

Uh oh!

larryliu0820 commented Jul 15, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Phi3 runner uses TextLLMRunner #12477

Phi3 runner uses TextLLMRunner #12477

Uh oh!

Conversation

larryliu0820 commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12477

❌ 3 New Failures, 50 Pending

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

larryliu0820 commented Jul 15, 2025 •

edited

Loading

pytorch-bot bot commented Jul 15, 2025 •

edited

Loading