Skip to content

Commit 1ac9f94

Browse files
committed
Add speech and fixes
1 parent 03a33ca commit 1ac9f94

File tree

2 files changed

+47
-2
lines changed

2 files changed

+47
-2
lines changed

MyApp/_pages/ai-server/comfy-extension.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -49,13 +49,13 @@ The `.env` file is used to configure the ComfyUI extension during the initial se
4949

5050
The keys available in the `.env` file are:
5151

52-
- **DEFAULT_MODELS**: Comma-separated list of models to load on startup. This will be used to automatically download the models and their related dependencies. The full list of options can be found on your AI Server at `/lib/data/ai-models.json`.
52+
- **DEFAULT_MODELS**: Comma-separated list of models to load on startup. This will be used to automatically download the models and their related dependencies. The full list of options can be found on your AI Server at `/lib/data/media-models.json`.
5353
- **API_KEY**: This is the API key that will be used by your AI Server to authenticate with the ComfyUI. If not provided, there will be no authentication required to access your ComfyUI instance.
5454
- **HF_TOKEN**: This is the Hugging Face token that will be used to authenticate with the Hugging Face API when trying to download models. If not provided, models requiring Hugging Face authentication like those with user agreements will not be downloaded.
5555
- **CIVITAI_TOKEN**: This is the Civitai API key that will be used to authenticate with the Civitai API when trying to download models. If not provided, models requiring Civitai authentication like those with user agreements will not be downloaded.
5656

5757
::: info
58-
Models requiring authentication to download are also flagged in the `/lib/data/ai-models.json` file.
58+
Models requiring authentication to download are also flagged in the `/lib/data/media-models.json` file.
5959
:::
6060

6161
### Accessing the ComfyUI Extension
Lines changed: 45 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,45 @@
1+
---
2+
title: Speech Endpoints
3+
description: Processing speech with AI Server
4+
---
5+
6+
AI Server provides endpoints for speech-related tasks, including Speech-to-Text and Text-to-Speech conversions. These endpoints utilize AI models to process audio and text data.
7+
8+
The following tasks are available for speech processing:
9+
10+
- **Speech to Text**: Convert audio input to text output.
11+
- **Text to Speech**: Convert text input to audio output.
12+
13+
## Using Speech Endpoints
14+
15+
These endpoints are used in a similar way to other AI Server endpoints. You can provide a RefId and Tag to help categorize the request, and for Queue requests, you can provide a ReplyTo URL to send a POST request to when the request is complete.
16+
17+
### Speech to Text {#speech-to-text}
18+
19+
The Speech to Text endpoint converts audio input into text. It provides two types of output:
20+
21+
1. Text with timestamps: JSON format with `start` and `end` timestamps for each segment.
22+
2. Plain text: The full transcription without timestamps.
23+
24+
These outputs are returned in the `TextOutputs` array, where the JSON will need to be parsed to extract the text and timestamps.
25+
26+
::include ai-server/cs/speech-to-text-1.cs.md::
27+
28+
### Queue Speech to Text {#queue-speech-to-text}
29+
30+
For longer audio files or when you want to process the request asynchronously, you can use the Queue Speech to Text endpoint.
31+
32+
::include ai-server/cs/queue-speech-to-text-1.cs.md::
33+
34+
### Text to Speech {#text-to-speech}
35+
36+
The Text to Speech endpoint converts text input into audio output.
37+
38+
::include ai-server/cs/text-to-speech-1.cs.md::
39+
40+
### Queue Text to Speech {#queue-text-to-speech}
41+
42+
For generating longer audio files or when you want to process the request asynchronously, you can use the Queue Text to Speech endpoint.
43+
44+
::include ai-server/cs/queue-text-to-speech-1.cs.md::
45+

0 commit comments

Comments
 (0)