Skip to content

Commit e56d47a

Browse files
authored
Fix model selection for non-English (#4362)
* Fix model selection for language * Fix model selection for non-English
1 parent cf1a3e5 commit e56d47a

File tree

5 files changed

+13
-7
lines changed

5 files changed

+13
-7
lines changed

whisper/CHANGELOG.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,11 @@
11
# Changelog
22

3+
## 3.1.0
4+
5+
- Fix model selection for language
6+
- Prefer Parakeet only for English (detection fails for other languages)
7+
- Add missing `onnx_asr` dependency
8+
39
## 3.0.1
410

511
- Add support for `sherpa-onnx` and Nvidia's parakeet model

whisper/Dockerfile

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ RUN \
2222
"wyoming[zeroconf]==1.8.0" \
2323
"wyoming-faster-whisper[sherpa] @ https://github.com/rhasspy/wyoming-faster-whisper/archive/refs/tags/v${WYOMING_WHISPER_VERSION}.tar.gz" \
2424
'transformers==4.52.4' \
25+
'onnx-asr[cpu,hub]==0.7.0' \
2526
\
2627
&& pip3 install --no-cache-dir \
2728
--index-url 'https://download.pytorch.org/whl/cpu' \

whisper/build.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,4 +3,4 @@ build_from:
33
amd64: ghcr.io/home-assistant/amd64-base-debian:bookworm
44
aarch64: ghcr.io/home-assistant/aarch64-base-debian:bookworm
55
args:
6-
WYOMING_WHISPER_VERSION: 3.0.1
6+
WYOMING_WHISPER_VERSION: 3.1.0

whisper/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
version: 3.0.1
2+
version: 3.1.0
33
slug: whisper
44
name: Whisper
55
description: Speech-to-text with Whisper

whisper/translations/en.yaml

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -3,18 +3,17 @@ configuration:
33
beam_size:
44
name: Beam size
55
description: >-
6-
Number of candidates to consider simultaneously during transcription.
7-
Increasing the beam size will increase accuracy at the cost of
6+
Number of candidates to consider simultaneously during transcription (0 =
7+
auto). Increasing the beam size will increase accuracy at the cost of
88
performance.
99
language:
1010
name: Language
1111
description: >-
12-
Language that you will speak to the app. If you select "auto",
13-
the model will run much slower but will auto-detect the spoken language.
12+
Language to pre-load model for during start-up.
1413
model:
1514
name: Model
1615
description: |
17-
Whisper model that will be used for transcription.
16+
Whisper model that will be used for transcription (faster-whisper only).
1817
1918
The default model is `tiny-int8`, a compressed version of the smallest
2019
Whisper model which is able to run on a Raspberry Pi 4. Compressed models

0 commit comments

Comments
 (0)