You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -16,7 +16,7 @@ You can choose one way from meduim and hard to install paddlespeech.
16
16
17
17
### 2. Prepare config File
18
18
The configuration file can be found in `conf/tts_online_application.yaml`.
19
-
-`protocol` indicates the network protocol used by the streaming TTS service. Currently, both http and websocket are supported.
19
+
-`protocol` indicates the network protocol used by the streaming TTS service. Currently, both **http and websocket** are supported.
20
20
-`engine_list` indicates the speech engine that will be included in the service to be started, in the format of `<speech task>_<engine type>`.
21
21
- This demo mainly introduces the streaming speech synthesis service, so the speech task should be set to `tts`.
22
22
- the engine type supports two forms: **online** and **online-onnx**. `online` indicates an engine that uses python for dynamic graph inference; `online-onnx` indicates an engine that uses onnxruntime for inference. The inference speed of online-onnx is faster.
@@ -31,12 +31,12 @@ The configuration file can be found in `conf/tts_online_application.yaml`.
@@ -122,7 +120,6 @@ The configuration file can be found in `conf/tts_online_application.yaml`.
122
120
- `sample_rate`: Sampling rate, choices: [0, 8000, 16000], the default is the same as the model. Default: 0
123
121
- `output`: Output wave filepath. Default: None, which means not to save the audio to the local.
124
122
- `play`: Whether to play audio, play while synthesizing, default value: False, which means not playing. **Playing audio needs to rely on the pyaudio library**.
125
-
126
123
127
124
Output:
128
125
```bash
@@ -165,8 +162,144 @@ The configuration file can be found in `conf/tts_online_application.yaml`.
- `protocol`: Service protocol, choices: [http, websocket], default: http.
250
+
- `input`: (required): Input text to generate.
251
+
- `spk_id`: Speaker id for multi-speaker text to speech. Default: 0
252
+
- `speed`: Audio speed, the value should be set between 0 and 3. Default: 1.0
253
+
- `volume`: Audio volume, the value should be set between 0 and 3. Default: 1.0
254
+
- `sample_rate`: Sampling rate, choices: [0, 8000, 16000], the default is the same as the model. Default: 0
255
+
- `output`: Output wave filepath. Default: None, which means not to save the audio to the local.
256
+
- `play`: Whether to play audio, play while synthesizing, default value: False, which means not playing. **Playing audio needs to rely on the pyaudio library**.
0 commit comments