Skip to content

Commit 31f6801

Browse files
author
ssjia
committed
Update on "[ET-VK] Implemement linear_dq8ta_q4gsw"
Title says it all! Build upon the support for quantized linear introduced in the previous diffs to enable dynamically quantized linear. Also included in this diff is a cleanup of the glslh files used across quantized linear implementations. Differential Revision: [D81931060](https://our.internmc.facebook.com/intern/diff/D81931060/) [ghstack-poisoned]
1 parent 45d9b43 commit 31f6801

File tree

2 files changed

+2
-0
lines changed

2 files changed

+2
-0
lines changed

.github/workflows/pull.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -939,6 +939,7 @@ jobs:
939939
# Run e2e testing for selected operators. More operators will be tested via this
940940
# route in the future.
941941
python -m unittest backends/vulkan/test/test_vulkan_delegate.py -k "*pt2e*"
942+
python -m unittest backends/vulkan/test/test_vulkan_delegate.py -k "*torchao*"
942943
943944
nxp-build-test:
944945
name: nxp-build-test

backends/vulkan/test/test_vulkan_delegate.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2651,6 +2651,7 @@ def forward(self, x):
26512651
rtol=1e-1,
26522652
)
26532653

2654+
@unittest.skip("Cannot run on swiftshader due to no 8-bit int support")
26542655
def test_vulkan_backend_torchao_8da4w_quantized_linear(self):
26552656
"""
26562657
Test TorchAO 8da4w quantization (int8 dynamic activation + int4 weight) with Vulkan backend.

0 commit comments

Comments
 (0)