|
| 1 | +# Automatic Speech Recognition |
| 2 | + |
| 3 | +## Requirements |
| 4 | + |
| 5 | +### Validated Hardware Requirements |
| 6 | +- **CPU:** 13th generation Intel Core processors or newer |
| 7 | +- **GPU:** Intel® Arc™ graphics |
| 8 | +- **RAM:** 32GB (may vary based on model size) |
| 9 | +- **Disk:** 128GB (may vary based on model size) |
| 10 | + |
| 11 | +### Supported Inference Device |
| 12 | +* CPU |
| 13 | +* GPU |
| 14 | +* NPU |
| 15 | + |
| 16 | +## Quick Start |
| 17 | +### 1. Install Operating System |
| 18 | +Install the latest [Ubuntu 22.04 LTS Desktop](https://releases.ubuntu.com/jammy/). Refer to the [Ubuntu Desktop installation tutorial](https://ubuntu.com/tutorials/install-ubuntu-desktop#1-overview) if needed. |
| 19 | + |
| 20 | +### 2. Install GPU Driver (Optional) |
| 21 | +If you plan to use a GPU for inference, install the appropriate GPU driver: |
| 22 | +- **Intel® Arc™ A-Series Graphics:** [Installation Guide](https://github.com/intel/edge-developer-kit-reference-scripts/tree/main/gpu/arc/dg2) |
| 23 | +- **Intel® Data Center GPU Flex Series:** [Installation Guide](https://github.com/intel/edge-developer-kit-reference-scripts/tree/main/gpu/flex/ats) |
| 24 | + |
| 25 | +### 3. Set Up Docker |
| 26 | +Follow the instructions [here](https://docs.docker.com/engine/install/) to install Docker and Docker Compose. |
| 27 | + |
| 28 | +### 4. Build the Automatic Speech Recognition Docker Image |
| 29 | +```bash |
| 30 | +docker build -t automatic-speech-recognition . |
| 31 | +``` |
| 32 | + |
| 33 | +### 5. Run the Automatic Speech Recognition container |
| 34 | +* **CPU** |
| 35 | +```bash |
| 36 | +docker run -it --rm \ |
| 37 | + -p 5996:5996 \ |
| 38 | + -e DEFAULT_MODEL_ID=openai/whisper-tiny \ |
| 39 | + -e STT_DEVICE=CPU \ |
| 40 | + -v ./data:/usr/src/app/data \ |
| 41 | + automatic-speech-recognition |
| 42 | +``` |
| 43 | + |
| 44 | +* **GPU** |
| 45 | +```bash |
| 46 | +export RENDER_GROUP_ID=$(getent group render | cut -d: -f3) |
| 47 | +docker run -it --rm \ |
| 48 | + --group-add $RENDER_GROUP_ID \ |
| 49 | + --device /dev/dri:/dev/dri \ |
| 50 | + -p 5996:5996 \ |
| 51 | + -e DEFAULT_MODEL_ID=openai/whisper-tiny \ |
| 52 | + -e STT_DEVICE=GPU \ |
| 53 | + -v ./data:/usr/src/app/data \ |
| 54 | + automatic-speech-recognition |
| 55 | +``` |
0 commit comments