Update on "[ET-VK][int4] patch 4-bit linear op for ensuring w-packed in/out"

Nathanael See · Nathanael See · commit 2d32f6f8f762 · 2025-02-05T14:19:35.000-08:00
If the partitioner is using channels-packed setting for activations, then the checks will throw. Remove the checks and conditionally re-pack the input/output tensors if they are not width-packed. Differential Revision: [D68813946](https://our.internmc.facebook.com/intern/diff/D68813946/) [ghstack-poisoned]
diff --git a/backends/vulkan/runtime/graph/ops/impl/QuantizedLinear.cpp b/backends/vulkan/runtime/graph/ops/impl/QuantizedLinear.cpp
@@ -352,8 +352,8 @@ void add_q_4w_linear_node(
       local_wg_size,
       // Inputs and Outputs
       {{out_W_packed, vkapi::MemoryAccessType::WRITE},
-       {{mat1_W_packed, mat2, scales_and_zeros}, 
-       vkapi::MemoryAccessType::READ}},
+       {{mat1_W_packed, mat2, scales_and_zeros},
+        vkapi::MemoryAccessType::READ}},
       // Shader params buffers
       ubos,
       // Specialization Constants