You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -10,8 +10,18 @@ The solution combines two powerful OCI services:
10
10
11
11
This automated approach significantly reduces the time and effort required to create multilingual subtitles, making content more accessible to a global audience.
12
12
13
-
PUT IMAGE HERE
14
-
join oracle profile
13
+
## Features
14
+
15
+
-**Flexible Input Sources**: Accept both local audio files (MP3, WAV, etc.) and files already stored in OCI Object Storage
16
+
-**Multiple Output Options**: Store generated SRT files locally, in Object Storage, or both
17
+
-**Complete Workflow**: Single command to transcribe audio and translate to multiple languages
18
+
-**Standalone Scripts**: Individual scripts for transcription-only or translation-only workflows
19
+
-**Translation Methods**:
20
+
- Synchronous translation for smaller files (subtitle-by-subtitle)
21
+
- Batch translation for larger files (up to 20MB)
22
+
-**Language Support**: 30+ supported languages for translation
23
+
-**Configurable**: Comprehensive YAML configuration with sensible defaults
3. Copy the example configuration and update with your settings:
71
+
```bash
72
+
cp config_example.yaml config.yaml
73
+
# Edit config.yaml with your OCI details
71
74
```
72
75
73
76
## 2. Usage
74
77
75
-
> Before running the script, make sure your input `.mp3` file has already been uploaded to the OCI Object Storage **input bucket** defined in your `config.yaml`.
76
-
> The script does **not** accept local files it looks for the file in the cloud bucket only.
78
+
The solution provides three main ways to use it:
77
79
78
-
This solution works in two steps:
80
+
### Option 1: Complete Workflow (Recommended)
79
81
80
-
1. First, we generate SRT from audio:
82
+
Use the main workflow script to transcribe audio and translate in one command:
language_code: "en-US"# Default transcription language
139
+
```
140
+
141
+
### Output Configuration
142
+
```yaml
143
+
output:
144
+
storage_type: "both"# "local", "object_storage", or "both"
145
+
local_directory: "./output"
146
+
object_storage_prefix: "translations"
147
+
```
148
+
149
+
### Translation Configuration
150
+
```yaml
151
+
translation:
152
+
target_languages:
153
+
- "es"# Spanish
154
+
- "fr"# French
155
+
- "de"# German
156
+
method: "batch"# "batch" or "sync"
157
+
```
158
+
159
+
## 4. Supported Languages
160
+
161
+
### Speech-to-Text (Transcription)
162
+
163
+
The following language codes are supported for audio transcription:
164
+
165
+
| Language | Code |
166
+
|----------|------|
167
+
| US English | en-US |
168
+
| British English | en-GB |
169
+
| Australian English | en-AU |
170
+
| Indian English | en-IN |
171
+
| Spanish (Spain) | es-ES |
172
+
| Brazilian Portuguese | pt-BR |
173
+
| Hindi (India) | hi-IN |
174
+
| French (France) | fr-FR |
175
+
| German (Germany) | de-DE |
176
+
| Italian (Italy) | it-IT |
91
177
92
-
## Annex: Supported Languages
178
+
### Translation
93
179
94
180
The solution supports translation to the following languages:
95
181
@@ -129,6 +215,69 @@ The solution supports translation to the following languages:
129
215
130
216
For an updated list of supported languages, refer to [the OCI Documentation](https://docs.oracle.com/en-us/iaas/language/using/translate.htm#supported-langs).
0 commit comments