Skip to content

New SDOT Kernels in KleidiAI #11931

@mcr229

Description

@mcr229

🚀 The feature, motivation and pitch

KleidiAI introduced the following new SDOT kernels:

  • matmul_clamp_f32_qai8dxp4x4_qsi4c32p4x4_16x4_neon_dotprod
  • matmul_clamp_f32_qai8dxp1x4_qsi4c32p4x4_1x4_neon_dotprod

which provide significantly larger performance coverage of ARM CPUs. We will bring these into XNNPACK

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Done

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions