Let ASR models support bfloat16 dtype and cuda device #173

larryliu0820 · 2025-10-19T21:48:17Z

This pull request improves device and dtype handling across the ExecuTorch export pipeline, ensuring that models, tensors, and modules are consistently placed on the correct device and use the appropriate data type. Additionally, it introduces a new test to validate exporting Whisper models with bfloat16 precision and checks the resulting file size.

Device and dtype consistency improvements:

Updated initialization and export logic in optimum/exporters/executorch/integrations.py to use model.device and model.dtype for all relevant tensors and modules, replacing hardcoded "cpu" and torch.float32 values. This ensures exported models and caches are created on the correct device with the correct data type. [1] [2] [3] [4] [5]
Modified load_seq2seq_speech_model in optimum/exporters/executorch/tasks/asr.py to accept device and dtype as keyword arguments, passing them to the underlying model loading and export logic.

Testing enhancements:

Added a new slow test test_whisper_large_v3_turbo_export_bfloat16 in tests/models/test_modeling_whisper.py to export the Whisper large v3 turbo model with bfloat16 precision, verify the output file exists, and check that its size is approximately 1.2GB with a 10% tolerance.

Let ASR models support bfloat16 dtype and cuda device

77507da

larryliu0820 requested a review from jackzhxng October 19, 2025 21:49

larryliu0820 added 4 commits October 19, 2025 14:49

Code quality

1ba5c6c

Fix test

af9bede

expect 1.6GB

527c81f

Fix dtype and device_map

e8f76b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Let ASR models support bfloat16 dtype and cuda device #173

Let ASR models support bfloat16 dtype and cuda device #173

Uh oh!

larryliu0820 commented Oct 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Let ASR models support bfloat16 dtype and cuda device #173

Are you sure you want to change the base?

Let ASR models support bfloat16 dtype and cuda device #173

Uh oh!

Conversation

larryliu0820 commented Oct 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant