Skip to content

Conversation

kwen2501
Copy link

@kwen2501 kwen2501 commented Jun 12, 2024

Stack from ghstack (oldest at bottom):

Status:

  • Switched to DTensor based TP in regular tensor path
  • Result is correct, but there is a perf gap (seems to perform extra colls in the beginning, investigating)
  • TODO: switch to DTensor for quantized path too

kwen2501 added a commit that referenced this pull request Jun 12, 2024
ghstack-source-id: b55b264
Pull Request resolved: #180
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 12, 2024
@kwen2501 kwen2501 changed the title Use DTensor-based tensor parallel [WIP] Use DTensor-based tensor parallel Jun 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants