Metal backend: Enable Float16 #15947

manuelcandales · 2025-11-21T20:31:29Z

Enables Float16 in the metal backend

pytorch-bot · 2025-11-21T20:31:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15947

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

This PR enables Float16 (Half precision) support in the Metal backend, complementing the existing BFloat16 support. The changes add Float16 as a supported data type across the Metal backend infrastructure and provide conversion utilities for transforming Float32 tensors to Float16.

Key Changes:

Added convert_to_float16 utility function mirroring the existing convert_to_bfloat16 pattern
Enabled Float16 dtype code (5) in Metal backend type system and validation
Extended Metal operations (matrix multiplication, convolution, attention) to handle Float16 data
Updated CI/CD workflows to test both float16 and bfloat16 dtypes

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`extension/llm/runner/util.h`	Added `convert_to_float16` helper function for Float→Half tensor conversion
`extension/asr/runner/runner.cpp`	Added Float16 conversion support in ASR audio feature preprocessing
`backends/apple/metal/runtime/shims/utils.h`	Uncommented FLOAT16 enum value to enable Float16 dtype
`backends/apple/metal/runtime/shims/utils.cpp`	Added Float16 to supported dtypes validation
`backends/apple/metal/runtime/shims/et_metal_ops.mm`	Added Float16 handling in mm, convolution, and attention operations
`backends/aoti/utils.h`	Added dtype code 5 mapping to ScalarType::Half
`backends/aoti/common_shims.h`	Added `aoti_torch_dtype_float16` function declaration
`backends/aoti/common_shims.cpp`	Implemented `aoti_torch_dtype_float16` returning dtype code 5
`.github/workflows/metal.yml`	Added dtype matrix parameter (float16/bfloat16) for testing
`.github/workflows/cuda.yml`	Added bfloat16 dtype parameter to script calls for consistency
`.ci/scripts/export_model_artifact.sh`	Added dtype parameter and validation to export script

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-21T20:54:46Z

.github/workflows/cuda.yml

      ref: ${{ github.event_name == 'pull_request' && github.event.pull_request.head.sha || github.sha }}
      script: |
-        source .ci/scripts/test_model_e2e.sh cuda "${{ matrix.model.repo }}/${{ matrix.model.name }}" "${{ matrix.quant }}" "${RUNNER_ARTIFACT_DIR}"
+        source .ci/scripts/test_model_e2e.sh cuda bfloat16 "${{ matrix.model.repo }}/${{ matrix.model.name }}" "${{ matrix.quant }}" "${RUNNER_ARTIFACT_DIR}"


The script test_model_e2e.sh is being called with a new signature that includes dtype as the second parameter: test_model_e2e.sh cuda bfloat16 "${{ matrix.model.repo }}/${{ matrix.model.name }}" "${{ matrix.quant }}". However, based on the current implementation of test_model_e2e.sh (lines 60-62), it expects the old signature: <device> <hf_model> <quant_name> [model_dir]. This mismatch will cause the script to fail as it will interpret bfloat16 as the HF model name and the actual model name as the quant parameter. The test_model_e2e.sh script needs to be updated to accept and handle the dtype parameter in the same way export_model_artifact.sh was updated.

Copilot · 2025-11-21T20:54:46Z

.github/workflows/metal.yml

        echo "::endgroup::"

-        ${CONDA_RUN} bash .ci/scripts/test_model_e2e.sh metal "${{ matrix.model.repo }}/${{ matrix.model.name }}" "${{ matrix.quant }}" "${RUNNER_ARTIFACT_DIR}"
+        ${CONDA_RUN} bash .ci/scripts/test_model_e2e.sh metal "${{ matrix.dtype }}" "${{ matrix.model.repo }}/${{ matrix.model.name }}" "${{ matrix.quant }}" "${RUNNER_ARTIFACT_DIR}"


The script test_model_e2e.sh is being called with a new signature that includes dtype as the second parameter: test_model_e2e.sh metal "${{ matrix.dtype }}" "${{ matrix.model.repo }}/${{ matrix.model.name }}" "${{ matrix.quant }}". However, the current implementation of test_model_e2e.sh (lines 60-62) expects the old signature: <device> <hf_model> <quant_name> [model_dir]. This mismatch will cause the script to fail as it will interpret dtype as the HF model name and the actual model name as the quant parameter. The test_model_e2e.sh script needs to be updated to accept and handle the dtype parameter consistently with how export_model_artifact.sh was updated.

malfet · 2025-11-21T22:40:04Z

backends/apple/metal/runtime/shims/et_metal_ops.mm

+      ET_LOG(Debug, "aoti_torch_mps_mm_out: self_tensor scalar_type=%d, SupportedDTypes::FLOAT32=%d, SupportedDTypes::FLOAT16=%d, SupportedDTypes::BFLOAT16=%d",
+             dtype, static_cast<int32_t>(SupportedDTypes::FLOAT32), static_cast<int32_t>(SupportedDTypes::FLOAT16), static_cast<int32_t>(SupportedDTypes::BFLOAT16));

      if (dtype == static_cast<int32_t>(SupportedDTypes::FLOAT32)) {


I'm not very well familiar with ET coding practice, but considering you have no modify it in two places, why not have to_mps_dtype(SupportedDtypes) inline function and call it here and few hundrend lines down below?

malfet · 2025-11-21T22:41:06Z

extension/llm/runner/util.h

 } // namespace llm
 } // namespace extension
 } // namespace executorch


Nit: if ET is a C++17 compatible project, why not use nested namespaces?

Metal backend: Enable Float16

49f7f3a

manuelcandales requested a review from jackzhxng as a code owner November 21, 2025 20:31

Copilot AI review requested due to automatic review settings November 21, 2025 20:31

manuelcandales requested review from cccclai, larryliu0820, mergennachin and shoumikhin as code owners November 21, 2025 20:31

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 21, 2025

Copilot started reviewing on behalf of manuelcandales November 21, 2025 20:32 View session

manuelcandales requested review from Gasoonjia and removed request for cccclai, Copilot, jackzhxng and shoumikhin November 21, 2025 20:33

Copilot finished reviewing on behalf of manuelcandales November 21, 2025 20:35

manuelcandales added the release notes: none Do not include this in the release notes label Nov 21, 2025

lint fix

e489e51

Copilot AI review requested due to automatic review settings November 21, 2025 20:50

Copilot started reviewing on behalf of manuelcandales November 21, 2025 20:50 View session

Copilot finished reviewing on behalf of manuelcandales November 21, 2025 20:54

Copilot AI reviewed Nov 21, 2025

View reviewed changes

fix ci

cd92132

manuelcandales had a problem deploying to upload-benchmark-results November 21, 2025 21:39 — with GitHub Actions Failure

malfet reviewed Nov 21, 2025

View reviewed changes

malfet approved these changes Nov 21, 2025

View reviewed changes

malfet reviewed Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Metal backend: Enable Float16 #15947

Metal backend: Enable Float16 #15947

Uh oh!

manuelcandales commented Nov 21, 2025

Uh oh!

pytorch-bot bot commented Nov 21, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 21, 2025

Uh oh!

Copilot AI Nov 21, 2025

Uh oh!

malfet Nov 21, 2025

Uh oh!

malfet Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Metal backend: Enable Float16 #15947

Are you sure you want to change the base?

Metal backend: Enable Float16 #15947

Uh oh!

Conversation

manuelcandales commented Nov 21, 2025

Uh oh!

pytorch-bot bot commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15947

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Nov 21, 2025 •

edited

Loading