You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: demos/streaming_tts_server/README.md
+41-56Lines changed: 41 additions & 56 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,6 @@ For service interface definition, please check:
9
9
-[PaddleSpeech Server RESTful API](https://github.com/PaddlePaddle/PaddleSpeech/wiki/PaddleSpeech-Server-RESTful-API)
10
10
-[PaddleSpeech Streaming Server WebSocket API](https://github.com/PaddlePaddle/PaddleSpeech/wiki/PaddleSpeech-Server-WebSocket-API)
11
11
12
-
13
12
## Usage
14
13
### 1. Installation
15
14
see [installation](https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/source/install.md).
@@ -34,11 +33,10 @@ The configuration file can be found in `conf/tts_online_application.yaml`.
34
33
- Both hifigan and mb_melgan support streaming voc inference.
35
34
- When the voc model is mb_melgan, when voc_pad=14, the synthetic audio for streaming inference is consistent with the non-streaming synthetic audio; the minimum voc_pad can be set to 7, and the synthetic audio has no abnormal hearing. If the voc_pad is less than 7, the synthetic audio sounds abnormal.
36
35
- When the voc model is hifigan, when voc_pad=19, the streaming inference synthetic audio is consistent with the non-streaming synthetic audio; when voc_pad=14, the synthetic audio has no abnormal hearing.
36
+
- Pad calculation method of streaming vocoder in PaddleSpeech: [AIStudio tutorial](https://aistudio.baidu.com/aistudio/projectdetail/4151335)
-**Note:** If the service can be started normally in the container, but the client access IP is unreachable, you can try to replace the `host` address in the configuration file with the local IP address.
39
39
40
-
41
-
42
40
### 3. Streaming speech synthesis server and client using http protocol
43
41
#### 3.1 Server Usage
44
42
- Command Line (Recommended)
@@ -58,7 +56,7 @@ The configuration file can be found in `conf/tts_online_application.yaml`.
INFO: Uvicorn running on http://0.0.0.0:8092 (Press CTRL+C to quit)
100
98
[2022-04-24 21:00:17] [INFO] [server.py:211] Uvicorn running on http://0.0.0.0:8092 (Press CTRL+C to quit)
101
-
102
-
103
99
```
104
100
105
101
#### 3.2 Streaming TTS client Usage
@@ -130,7 +126,7 @@ The configuration file can be found in `conf/tts_online_application.yaml`.
130
126
- Currently, only the single-speaker model is supported in the code, so `spk_id` does not take effect. Streaming TTS does not support changing sample rate, variable speed and volume.
0 commit comments