Skip to content

Conversation

danielvegamyhre
Copy link
Contributor

@danielvegamyhre danielvegamyhre commented Oct 17, 2025

Stacked PRs:


[mxfp8 moe training] integrate triton quant/dequant kernels into mxfp8 all to all

Test plan

  • pytest test/prototype/moe_training/mxfp8/test_mxfp8_a2a.py -k ToMXFP8AllToAllVDequantTest -s

Benchmarks

Flat perf

input_shape         num_splits    fwd_bf16_ms    fwd_mxfp8_ms    bwd_bf16_ms    bwd_mxfp8_ms
----------------  ------------  -------------  --------------  -------------  --------------
(16, 8192, 5120)             4        2.38517         2.59848         7.5735          7.7693

Copy link

pytorch-bot bot commented Oct 17, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3197

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 710192d with merge base b644211 (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

danielvegamyhre added a commit that referenced this pull request Oct 17, 2025
…8 all to all

stack-info: PR: #3197, branch: danielvegamyhre/stack/79
@danielvegamyhre danielvegamyhre force-pushed the danielvegamyhre/stack/79 branch from e8acfbb to 2885c91 Compare October 17, 2025 01:09
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 17, 2025
@danielvegamyhre danielvegamyhre added mx topic: not user facing Use this tag if you don't want this PR to show up in release notes moe labels Oct 17, 2025
@danielvegamyhre danielvegamyhre changed the base branch from danielvegamyhre/stack/78 to main October 17, 2025 01:36
danielvegamyhre added a commit that referenced this pull request Oct 17, 2025
…8 all to all

stack-info: PR: #3197, branch: danielvegamyhre/stack/79
@danielvegamyhre danielvegamyhre force-pushed the danielvegamyhre/stack/79 branch from 2885c91 to 5918398 Compare October 17, 2025 01:37
@danielvegamyhre danielvegamyhre changed the base branch from main to danielvegamyhre/stack/78 October 17, 2025 01:37
@danielvegamyhre danielvegamyhre changed the base branch from danielvegamyhre/stack/78 to main October 17, 2025 16:12
@danielvegamyhre danielvegamyhre changed the base branch from main to danielvegamyhre/stack/78 October 17, 2025 16:12
…8 all to all

stack-info: PR: #3197, branch: danielvegamyhre/stack/79
@danielvegamyhre danielvegamyhre changed the base branch from danielvegamyhre/stack/78 to main October 17, 2025 16:55
@danielvegamyhre danielvegamyhre force-pushed the danielvegamyhre/stack/79 branch from 5918398 to 710192d Compare October 17, 2025 16:55
@danielvegamyhre danielvegamyhre changed the base branch from main to danielvegamyhre/stack/78 October 17, 2025 16:55
@danielvegamyhre danielvegamyhre changed the base branch from danielvegamyhre/stack/78 to main October 17, 2025 17:23
@danielvegamyhre danielvegamyhre changed the base branch from main to danielvegamyhre/stack/78 October 17, 2025 17:23
@danielvegamyhre danielvegamyhre changed the base branch from danielvegamyhre/stack/78 to main October 17, 2025 21:04
@danielvegamyhre danielvegamyhre changed the base branch from main to danielvegamyhre/stack/78 October 17, 2025 21:05
@danielvegamyhre danielvegamyhre changed the base branch from danielvegamyhre/stack/78 to main October 17, 2025 22:02
@danielvegamyhre danielvegamyhre changed the base branch from main to danielvegamyhre/stack/78 October 17, 2025 22:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. moe mx topic: not user facing Use this tag if you don't want this PR to show up in release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant