Skip to content

Conversation

@SS-JIA
Copy link
Contributor

@SS-JIA SS-JIA commented Apr 15, 2025

Stack from ghstack (oldest at bottom):

Context

As title. Update the default compute shader for weight-only quantized int4 linear to use a tiled algorithm, which should boost performance for gemm cases, i.e. where mat1 is a matrix.

Changes

  • Changed q_4w_linear name to q_4w_linear_tiled name
  • Update the compute shader to use tiled algorithm

Using a value of 3 for TILE_ROWS; I expect to add variants which switch between different output tile configurations.

Differential Revision: D73044649

…d linear

## Context

As title. Update the default compute shader for weight-only quantized int4 linear to use a tiled algorithm, which should boost performance for `gemm` cases, i.e. where `mat1` is a matrix.

## Changes

* Changed `q_4w_linear` name to `q_4w_linear_tiled` name
* Update the compute shader to use tiled algorithm

Using a value of 3 for `TILE_ROWS`; I expect to add variants which switch between different output tile configurations.

Differential Revision: [D73044649](https://our.internmc.facebook.com/intern/diff/D73044649/)

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Apr 15, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10205

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit ef07365 with merge base 6d1caca (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

SS-JIA added a commit that referenced this pull request Apr 15, 2025
…d linear

## Context

As title. Update the default compute shader for weight-only quantized int4 linear to use a tiled algorithm, which should boost performance for `gemm` cases, i.e. where `mat1` is a matrix.

## Changes

* Changed `q_4w_linear` name to `q_4w_linear_tiled` name
* Update the compute shader to use tiled algorithm

Using a value of 3 for `TILE_ROWS`; I expect to add variants which switch between different output tile configurations.

Differential Revision: [D73044649](https://our.internmc.facebook.com/intern/diff/D73044649/)

ghstack-source-id: 278225005
Pull Request resolved: #10205
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 15, 2025
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D73044649

@facebook-github-bot facebook-github-bot merged commit b9c8c82 into gh/SS-JIA/213/base Apr 16, 2025
82 of 87 checks passed
@facebook-github-bot facebook-github-bot deleted the gh/SS-JIA/213/head branch April 16, 2025 18:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported topic: not user facing

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants