[Whisper][NPU] Default optimum export of whisper model does not create KV cache decoder

The latest optimum-cli command to download and convert whisper model to OV as given in the [README](https://github.com/openvinotoolkit/openvino.genai/blob/master/samples/python/whisper_speech_recognition/README.md) does not create the decoder model with KV cache.

`optimum-cli export openvino --trust-remote-code --model openai/whisper-base whisper-base`

Since NPU inference pipeline needs the KV cache decoder model, running the [sample](https://github.com/openvinotoolkit/openvino.genai/blob/master/samples/python/whisper_speech_recognition/whisper_speech_recognition.py) throws out this error.

![Image](https://github.com/user-attachments/assets/2cc9972d-7318-4262-a5b9-d0f73d3107de)

The following change to the exporting command can solve this issue as it generates the KV cache decoder also

`optimum-cli export openvino --model openai/whisper-base whisper-base --task automatic-speech-recognition-with-past --disable-stateful`

**NOTE**
Tried without the `--disable-stateful` flag, but did not produce the KV cache decoder.

As mentioned in this #1726 , `--trust-remote-code` is also not needed.

Environment:

- **transformers** 4.46.2
- **openvino**       2025.0.0
- **openvino-genai**                2025.0.0.0
- **openvino-telemetry**            2024.1.0
- **openvino-tokenizers**           2025.0.0.0
- **optimum**                       1.24.0
- **optimum-intel**                 1.23.0.dev0+dd4fe68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Whisper][NPU] Default optimum export of whisper model does not create KV cache decoder #1728

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Whisper][NPU] Default optimum export of whisper model does not create KV cache decoder #1728

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions