You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[To Set Up the Project Locally](#to-set-up-the-project-locally)
88
+
-[Notes](#notes)
89
89
-[Usage](#usage)
90
90
-[Transcribe From](#transcribe-from)
91
91
-[Save Transcription](#save-transcription)
@@ -304,49 +304,45 @@ You can also choose the theme you like best. It can be dark, light, or the one c
304
304
305
305
## Getting Started
306
306
307
-
### Notes
308
-
- You cannot generate a single executable file for this project with PyInstaller due to the dependency with the CustomTkinter package (reason [here](https://github.com/TomSchimansky/CustomTkinter/wiki/Packaging)).
309
-
- For **Mac computers with Apple silicon**: An error occurs when trying to install the `pyaudio` package. [Here](https://stackoverflow.com/questions/73268630/error-could-not-build-wheels-for-pyaudio-which-is-required-to-install-pyprojec) is a StackOverflow post explaining how to solve this issue.
310
-
- You need to install [FFmpeg](https://ffmpeg.org) to execute the program. Otherwise, it won't be able to process the audio files.
311
-
312
-
To check if you have it installed on your system, run `ffmpeg -version`. It should return something similar to this:
313
-
```
314
-
ffmpeg version 5.1.2-essentials_build-www.gyan.dev Copyright (c) 2000-2022 the FFmpeg developers
315
-
built with gcc 12.1.0 (Rev2, Built by MSYS2 project)
If the output is an error, it is because your system cannot find the `ffmpeg` system variable, which is probably because you don't have it installed on your system. To install `ffmpeg`, open a command prompt and run one of the following commands, depending on your operating system:
327
-
```
328
-
# on Ubuntu or Debian
329
-
sudo apt update && sudo apt install ffmpeg
330
-
331
-
# on Arch Linux
332
-
sudo pacman -S ffmpeg
333
-
334
-
# on MacOS using Homebrew (https://brew.sh/)
335
-
brew install ffmpeg
336
-
337
-
# on Windows using Chocolatey (https://chocolatey.org/)
338
-
choco install ffmpeg
339
-
340
-
# on Windows using Scoop (https://scoop.sh/)
341
-
scoop install ffmpeg
342
-
```
343
-
344
-
### To Run the Program
345
-
1. Go to [releases](https://github.com/HenestrosaDev/audiotext/releases).
346
-
2. Download the latest release.
307
+
### Installation
308
+
309
+
1. Install [FFmpeg](https://ffmpeg.org) to execute the program. Otherwise, it won't be able to process the audio files.
310
+
311
+
To check if you have it installed on your system, run `ffmpeg -version`. It should return something similar to this:
312
+
```
313
+
ffmpeg version 5.1.2-essentials_build-www.gyan.dev Copyright (c) 2000-2022 the FFmpeg developers
314
+
built with gcc 12.1.0 (Rev2, Built by MSYS2 project)
If the output is an error, it is because your system cannot find the `ffmpeg` system variable, which is probably because you don't have it installed on your system. To install `ffmpeg`, open a command prompt and run one of the following commands, depending on your operating system:
326
+
```
327
+
# on Ubuntu or Debian
328
+
sudo apt update && sudo apt install ffmpeg
329
+
330
+
# on Arch Linux
331
+
sudo pacman -S ffmpeg
332
+
333
+
# on MacOS using Homebrew (https://brew.sh/)
334
+
brew install ffmpeg
335
+
336
+
# on Windows using Chocolatey (https://chocolatey.org/)
337
+
choco install ffmpeg
338
+
339
+
# on Windows using Scoop (https://scoop.sh/)
340
+
scoop install ffmpeg
341
+
```
342
+
343
+
2. Go to [releases](https://github.com/HenestrosaDev/audiotext/releases) and download the latest.
347
344
3. Decompress the downloaded file.
348
-
4. Open the `audiotext` folder.
349
-
5. Double-click the `Audiotext` executable file (`.exe` for Windows and `.app` for macOS).
345
+
4. Open the `audiotext` folder and double-click the `Audiotext` executable file (`.exe` for Windows and `.app` for macOS).
350
346
351
347
### To Set Up the Project Locally
352
348
1. Clone the repository by running `git clone https://github.com/HenestrosaDev/audiotext.git`.
@@ -365,6 +361,10 @@ You can also choose the theme you like best. It can be dark, light, or the one c
365
361
366
362
>**IMPORTANT NOTE**: I had to comment out the lines `pprint(response_text, indent=4)` in the `recognize_google` function from the `__init__.py` file of the `SpeechRecognition` package to avoid opening a command line along with the GUI. Otherwise, the program would not be able to use the Google API transcription method because `pprint` throws an error if it cannot print to the CLI, preventing the code from generating the transcription. The same applies to the lines using the `logger` package in the `moviepy/audio/io/ffmpeg_audiowriter` file from the `moviepy` package. There is also a change in the line 169. `logger=logger` has been changed to `logger=None` to avoid more errors related to opening the console.
367
363
364
+
### Notes
365
+
- You cannot generate a single executable file for this project with PyInstaller due to the dependency with the CustomTkinter package (reason [here](https://github.com/TomSchimansky/CustomTkinter/wiki/Packaging)).
366
+
- For **Mac computers with Apple silicon**: An error occurs when trying to install the `pyaudio` package. [Here](https://stackoverflow.com/questions/73268630/error-could-not-build-wheels-for-pyaudio-which-is-required-to-install-pyprojec) is a StackOverflow post explaining how to solve this issue.
367
+
368
368
<palign="right">(<ahref="#top">back to top</a>)</p>
369
369
370
370
<!-- USAGE -->
@@ -394,76 +394,76 @@ You can transcribe from three audio sources:
394
394
395
395
-**File** (see image above): Click on the file explorer icon to select the file you want to transcribe. You can also manually enter the path to the file into the input field. You can transcribe audio from both audio and video files. Note that the file explorer has the `All supported files` option selected by default. To select only audio files or video files, click the combo box in the lower right corner of the file explorer to change the file type, as marked in red in the following image:
396
396
397
-

398
-
399
-

400
-
401
-
<details>
402
-
<summary>Supported audio file formats</summary>
403
-
404
-
- `.mp3`
405
-
- `.mpeg`
406
-
- `.wav`
407
-
- `.wma`
408
-
- `.aac`
409
-
- `.flac`
410
-
- `.ogg`
411
-
- `.oga`
412
-
- `.opus`
413
-
</details>
414
-
415
-
<details>
416
-
<summary>Supported video file formats</summary>
417
-
418
-
- `.mp4`
419
-
- `.m4a`
420
-
- `.m4v`
421
-
- `.f4v`
422
-
- `.f4a`
423
-
- `.m4b`
424
-
- `.m4r`
425
-
- `.f4b`
426
-
- `.mov`
427
-
- `.avi`
428
-
- `.webm`
429
-
- `.flv`
430
-
- `.mkv`
431
-
- `.3gp`
432
-
- `.3gp2`
433
-
- `.3g2`
434
-
- `.3gpp`
435
-
- `.3gpp2`
436
-
- `.ogv`
437
-
- `.ogx`
438
-
- `.wmv`
439
-
- `.asf`
440
-
</details>
397
+

398
+
399
+

400
+
401
+
<details>
402
+
<summary>Supported audio file formats</summary>
403
+
404
+
-`.mp3`
405
+
-`.mpeg`
406
+
-`.wav`
407
+
-`.wma`
408
+
-`.aac`
409
+
-`.flac`
410
+
-`.ogg`
411
+
-`.oga`
412
+
-`.opus`
413
+
</details>
414
+
415
+
<details>
416
+
<summary>Supported video file formats</summary>
417
+
418
+
-`.mp4`
419
+
-`.m4a`
420
+
-`.m4v`
421
+
-`.f4v`
422
+
-`.f4a`
423
+
-`.m4b`
424
+
-`.m4r`
425
+
-`.f4b`
426
+
-`.mov`
427
+
-`.avi`
428
+
-`.webm`
429
+
-`.flv`
430
+
-`.mkv`
431
+
-`.3gp`
432
+
-`.3gp2`
433
+
-`.3g2`
434
+
-`.3gpp`
435
+
-`.3gpp2`
436
+
-`.ogv`
437
+
-`.ogx`
438
+
-`.wmv`
439
+
-`.asf`
440
+
</details>
441
441
442
442
-**Microphone**: To start recording, simply click the `Start recording` button to begin the process. The text of the button will change to `Stop recording` and its color will change to red. Click it to stop recording and generate the transcription.
443
443
444
-
Note that your operating system must recognize an input source, otherwise an error will appear in the text box indicating that no input source was detected.
444
+
Note that your operating system must recognize an input source, otherwise an error will appear in the text box indicating that no input source was detected.
0 commit comments