This issue is splitted from https://github.com/intel/intel-xpu-backend-for-triton/issues/2607 LNL has DPAS support but still fails `test_dot3d`, we should look into this.