Skip to content

Commit 4426c0d

Browse files
authored
Merge pull request #91 from intel/update-branch
feat: update Dockerfile and requirements for speech-to-text microservice to use OpenVINO 2024.6.0; add README for setup instructions (#279)
2 parents 2bf4055 + df90b86 commit 4426c0d

File tree

3 files changed

+60
-5
lines changed

3 files changed

+60
-5
lines changed

usecases/ai/microservices/speech-to-text/Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Copyright (C) 2024 Intel Corporation
22
# SPDX-License-Identifier: Apache-2.0
33

4-
FROM openvino/ubuntu22_dev:2024.5.0
4+
FROM openvino/ubuntu22_dev:2024.6.0
55

66
USER root
77

Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,55 @@
1+
# Automatic Speech Recognition
2+
3+
## Requirements
4+
5+
### Validated Hardware Requirements
6+
- **CPU:** 13th generation Intel Core processors or newer
7+
- **GPU:** Intel® Arc™ graphics
8+
- **RAM:** 32GB (may vary based on model size)
9+
- **Disk:** 128GB (may vary based on model size)
10+
11+
### Supported Inference Device
12+
* CPU
13+
* GPU
14+
* NPU
15+
16+
## Quick Start
17+
### 1. Install Operating System
18+
Install the latest [Ubuntu 22.04 LTS Desktop](https://releases.ubuntu.com/jammy/). Refer to the [Ubuntu Desktop installation tutorial](https://ubuntu.com/tutorials/install-ubuntu-desktop#1-overview) if needed.
19+
20+
### 2. Install GPU Driver (Optional)
21+
If you plan to use a GPU for inference, install the appropriate GPU driver:
22+
- **Intel® Arc™ A-Series Graphics:** [Installation Guide](https://github.com/intel/edge-developer-kit-reference-scripts/tree/main/gpu/arc/dg2)
23+
- **Intel® Data Center GPU Flex Series:** [Installation Guide](https://github.com/intel/edge-developer-kit-reference-scripts/tree/main/gpu/flex/ats)
24+
25+
### 3. Set Up Docker
26+
Follow the instructions [here](https://docs.docker.com/engine/install/) to install Docker and Docker Compose.
27+
28+
### 4. Build the Automatic Speech Recognition Docker Image
29+
```bash
30+
docker build -t automatic-speech-recognition .
31+
```
32+
33+
### 5. Run the Automatic Speech Recognition container
34+
* **CPU**
35+
```bash
36+
docker run -it --rm \
37+
-p 5996:5996 \
38+
-e DEFAULT_MODEL_ID=openai/whisper-tiny \
39+
-e STT_DEVICE=CPU \
40+
-v ./data:/usr/src/app/data \
41+
automatic-speech-recognition
42+
```
43+
44+
* **GPU**
45+
```bash
46+
export RENDER_GROUP_ID=$(getent group render | cut -d: -f3)
47+
docker run -it --rm \
48+
--group-add $RENDER_GROUP_ID \
49+
--device /dev/dri:/dev/dri \
50+
-p 5996:5996 \
51+
-e DEFAULT_MODEL_ID=openai/whisper-tiny \
52+
-e STT_DEVICE=GPU \
53+
-v ./data:/usr/src/app/data \
54+
automatic-speech-recognition
55+
```
Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
fastapi[all]==0.115.6
22
uvicorn==0.32.1
3-
43
soundfile==0.12.1
54
pydub==0.25.1
65

7-
openvino==2024.5.0
8-
openvino_genai==2024.5.0
9-
optimum-intel[openvino,nncf]==1.21.0
6+
--extra-index-url https://download.pytorch.org/whl/cpu
7+
optimum-intel[openvino,nncf]==1.21.0
8+
openvino==2024.6.0
9+
openvino_genai==2024.6.0

0 commit comments

Comments
 (0)