Skip to content

Conversation

@liubo-intel
Copy link
Collaborator

cpu: x64: enable f16 dst for s8/u8 inputs in matmul

cpu: enable f16 dst for s8/u8 inputs in ref ip

cpu: enable f16 dst for s8/u8 inputs in mm-based ip

benchdnn: inputs: matmul: swap src/wei types to target optimized impls

Description

Please include a summary of the change. Please also include relevant motivation and context. See contribution guidelines for more details. If the change fixes an issue not documented in the project's Github issue tracker, please document all steps necessary to reproduce it.

Fixes # (github issue)

Checklist

General

  • Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
  • Have you formatted the code using clang-format?

Performance improvements

  • Have you submitted performance data that demonstrates performance improvements?

New features

  • Have you published an RFC for the new feature?
  • Was the RFC approved?
  • Have you added relevant tests?

Bug fixes

  • Have you included information on how to reproduce the issue (either in a github issue or in this PR)?
  • Have you added relevant regression tests?

RFC PR

  • Does RFC document follow the template?
  • Have you added a link to the rendered document?

@liubo-intel liubo-intel force-pushed the liubo/cherry-pick-upstream-for-fc-s8s8f16-support branch from f947b16 to 8db5d5a Compare September 8, 2025 11:41
@maxnick maxnick merged commit 8db5d5a into v3.8_for_ie_master Sep 12, 2025
6 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants