[QNN EP] Support batch multiple on HTP backend #26619

chunghow-qti · 2025-11-20T09:38:13Z

Description

This change adds support for the batch multiplier in the QNN HTP backend, allowing the compiled batch size to differ from the runtime batch size, as long as the runtime batch size is divisible by the compiled batch size.

Included in this change:

Skip input/output validation in the inference session to accommodate varying batch sizes when the batch multiplier option is used.
Verify that disable_cpu_ep_fallback and the HTP backend are used with the batch multiplier option, since batch multiplier is only supported when the entire graph runs on QNN.
Ensure that the HTP backend is used inside QnnModel::ExecuteGraph.
Modify the batch dimension of qnn_inputs and qnn_outputs when the batch multiplier is triggered.

Motivation and Context

This change supports batch multiplier in QNN API to ORT as described in this page: https://docs.qualcomm.com/bundle/publicresource/topics/80-63442-10/function_QnnGraph_8h_1a3ea05f42a9295f9a74a2e3a0cdd64228.html
relevant PR: [QNN EP] Skip inputs/outputs shape validation for QNN Batch Multiple #26336

adrianlizarraga · 2025-11-21T17:53:43Z

onnxruntime/core/session/inference_session.cc

      // log evaluation start to trace logging provider
      env.GetTelemetryProvider().LogEvaluationStart(session_id_);
-
+#ifdef USE_QNN


Considering that we're moving towards plugin EPs, we should avoid EP-specific code in the core onnxruntime library. Otherwise, we would need a special build of onnxruntime.dll that works with the plugin QNN EP.

chunghow-qti added 2 commits November 20, 2025 17:16

[QNN EP] Support batch multiple

2a714c2

add checkShpe inside QNN EP

a06c17b

adrianlizarraga reviewed Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QNN EP] Support batch multiple on HTP backend #26619

[QNN EP] Support batch multiple on HTP backend #26619

Uh oh!

chunghow-qti commented Nov 20, 2025

Uh oh!

adrianlizarraga Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[QNN EP] Support batch multiple on HTP backend #26619

Are you sure you want to change the base?

[QNN EP] Support batch multiple on HTP backend #26619

Uh oh!

Conversation

chunghow-qti commented Nov 20, 2025

Description

Motivation and Context

Uh oh!

adrianlizarraga Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants