IAHispano
diff --git a/‎apps/applio-docs/src/content/docs/getting-started/audio-analyzer.mdx‎
Lines changed: 31 additions & 19 deletions b/‎apps/applio-docs/src/content/docs/getting-started/audio-analyzer.mdx‎
Lines changed: 31 additions & 19 deletions
diff --git a/‎apps/applio-docs/src/content/docs/getting-started/embedder.mdx‎
Lines changed: 23 additions & 21 deletions b/‎apps/applio-docs/src/content/docs/getting-started/embedder.mdx‎
Lines changed: 23 additions & 21 deletions
diff --git a/‎apps/applio-docs/src/content/docs/getting-started/google-colab-guide.mdx‎
Lines changed: 92 additions & 0 deletions b/‎apps/applio-docs/src/content/docs/getting-started/google-colab-guide.mdx‎
Lines changed: 92 additions & 0 deletions
@@ -1,34 +1,46 @@
 ---
-title: Audio Analyzer
-description: Audio Analyzer is a tool designed to obtain detailed information about audio files.
+title: "Audio Analyzer"
+description: "Learn how to use the Audio Analyzer to get detailed information about your audio files."
 ---
 
-![Audio Analyzer Interface](/images/audio-analyzer.png)
+import { Aside, Steps } from '@astrojs/starlight/components';
 
-## On what kind of occasion can audio analyzer be useful?
+The **Audio Analyzer** is a powerful tool that provides detailed information about your audio files, including sample rate, frequency distribution, and more. This information is crucial for training high-quality voice models.
 
-If you want to perform a training session correctly, it is advisable to know the frequency (Sample Rate) of the audio that is being used. Currently applio is compatible and has pretraineds in `32k, 40k and 48k`, these values refer to the hertz rate at which the pretraineds are created to use (32000hz, 40000hz, 48000hz). This clearly means that you will have to use audio in the mentioned frequencies to have an adequate result, especially when you have clean and quality audio.
-- You can observe the audio frequency in a reliable software such as audacity, fl studio, [Spek](https://github.com/alexkay/spek/releases/download/v0.8.5/spek-0.8.5-beta.zip) etc, But if you need to have more precise details about it, use the tool.
+![The Audio Analyzer interface in Applio, showing the audio upload section.](/images/audio-analyzer.png)
 
-## Use Audio Analyzer Tool
+## Why is the Sample Rate Important?
 
-### Upload your Audio
-To proceed to use the Analyzer tool, go to the extra section, upload your audio, and click "get information about audio".
+Applio's pre-trained models are available in three sample rates: **32k**, **40k**, and **48k** (corresponding to 32,000 Hz, 40,000 Hz, and 48,000 Hz). For the best training results, the sample rate of your dataset should match the sample rate of the pre-trained model you are using.
 
-### Check the information given
+While you can check the sample rate in audio editors like Audacity, the Audio Analyzer provides a more detailed analysis of your audio's frequency content.
 
-When you get your audio you will see several information about it
+## How to Use the Audio Analyzer
 
-![Audio Analyzer Result](/images/audio-analyzer-result.png)
+<Steps>
+1.  Navigate to the **Extras** tab in Applio.
+2.  Upload your audio file using the **Upload Audio** box.
+3.  Click the **Get Information About Audio** button.
+</Steps>
 
-## How these graphs can help
+Once the analysis is complete, you will see a detailed breakdown of your audio file, including a spectrogram and several spectral feature graphs.
 
-These three graphs provide valuable information about the audio you are about to use, allowing you to fine-tune your settings prior to training for optimal results, like the Spectrogram and Spectral Features, these provide crucial information. The spectrogram displays the full set of frequencies present in the audio, allowing you to identify unwanted sounds, such as background noise or unwanted frequencies.
+![The results of an audio analysis in Applio, showing the spectrogram and spectral feature graphs.](/images/audio-analyzer-result.png)
 
-This also applies to the Spectral Features, with the three data thresholds that are provided, a lot of information is shared about the audio, which helps to further examine its characteristics in low, mid and high frequencies, for example;
+## Understanding the Graphs
 
-- **Spectral Centroid:** This graph shows both low and high frequencies within the audio, _represented as mentioned_, in the graph the higher frequencies will be upwards, while the lower frequencies will be downwards.
-- **Spectral Bandwidth:** This can somewhat represent the "variety of things (in this case taking context of the general content of the audio)", normally this would not take on much importance except for convert something other than a voice.
-- **Spectral Rolloff:** Basically, the rolloff takes all the audio context of the above-mentioned graphs under a specific volume threshold (in this case of the audio).
+The graphs provided by the Audio Analyzer can help you identify issues with your audio and fine-tune your training settings for optimal results.
 
-Finally, you can also get the frequency using the audio analyzer, both for the spectrogram section as the Spectral Features, in the spectrogram you can observe the values and duplicate them, as with the Spectral Features, the numbers shown around the graph will help determine the frequency.
+### Spectrogram
+
+The spectrogram is a visual representation of the frequencies in your audio over time. It can help you identify unwanted noise, such as background hiss or electrical hum, which you can then remove using an audio editor.
+
+### Spectral Features
+
+The spectral feature graphs provide a more detailed look at the frequency content of your audio.
+
+-   **Spectral Centroid:** This graph represents the "center of mass" of the spectrum. A higher spectral centroid indicates that the audio has more high-frequency content, while a lower spectral centroid indicates more low-frequency content. This can help you understand the overall brightness or darkness of the audio.
+-   **Spectral Bandwidth:** This graph shows the range of frequencies in the audio. A wider bandwidth indicates a more complex sound with a wider range of frequencies, while a narrower bandwidth indicates a simpler sound.
+-   **Spectral Rolloff:** This graph shows the frequency below which a certain percentage of the total spectral energy lies. It's another way to measure the "skewness" of the spectral distribution and can be useful for distinguishing between different types of sounds.
+
+By understanding these graphs, you can make more informed decisions about your audio processing and training settings, leading to better voice models.
@@ -1,42 +1,44 @@
 ---
-title: Embedders
-description: Learn about embedders and how to use them in voice conversion
+title: "Understanding Embedders"
+description: "Learn what embedders are and how to use them effectively in your voice conversion projects."
 ---
 
 import { Aside, Steps } from '@astrojs/starlight/components';
 
-## What are embedders?
+## What is an Embedder?
 
-Embedders are neural network models that convert raw audio input into high-dimensional vector representations. These representations capture essential acoustic and linguistic features of the audio, making them crucial for various audio processing tasks, including voice conversion.
+An **embedder** is a crucial component in the voice conversion process. It's a neural network that analyzes an audio file and converts it into a set of numerical representations, called "embeddings." These embeddings capture the essential acoustic and linguistic features of the audio, such as the speaker's tone, pitch, and accent.
 
-## How to use embedders?
+Think of an embedder as a translator that turns complex audio waves into a simplified language that the voice conversion model can understand and work with.
 
-Embedders are used in two main stages of the voice conversion process:
+## How to Use Embedders in Applio
 
-- **Training:** Select the embedder in the extraction settings.
-- **Inference:** Choose the same embedder in the advanced settings.
+You'll interact with embedders at two key stages of the voice conversion process:
 
-<Aside type="caution">
-  It is critical to use the same embedder for both training and inference. The embedder used to train the pretrained model must be consistent throughout the entire process.
+-   **Training:** When you're training a new voice model, you'll need to select an embedder in the **Extraction Settings**.
+-   **Inference:** When you're using a trained model to convert a voice, you must select the *same* embedder in the **Advanced Settings**.
+
+<Aside type="danger" title="Critical Information">
+  It is absolutely essential to use the same embedder for both training and inference. Using different embedders will result in poor-quality output or errors.
 </Aside>
 
-## Where to find embedders?
+## Where to Find Embedders
 
-You can find a variety of embedders on [Hugging Face](https://huggingface.co/models?pipeline_tag=feature-extraction&sort=trending&search=Hubert). To narrow down your search:
+You can find a wide variety of pre-trained embedders on [Hugging Face](https://huggingface.co/models?pipeline_tag=feature-extraction&sort=trending&search=Hubert). Here's how to find them:
 
 <Steps>
-1. Visit the Hugging Face model hub.
-2. Apply the "Feature Extraction" filter.
-3. Search for specific embedder types (e.g., "HuBERT", "Contentvec").
-4. Sort by trending or other relevant metrics to find popular and well-maintained models.
+1.  Go to the Hugging Face model hub.
+2.  In the sidebar, filter by **Task > Feature Extraction**.
+3.  Use the search bar to find specific embedder types, such as "HuBERT" or "ContentVec".
 </Steps>
 
 <Aside type="note">
-When choosing an embedder, consider factors such as model size, supported languages, and community adoption.
+When choosing an embedder, consider factors like the model's size, the languages it was trained on, and its popularity within the community.
 </Aside>
 
-## Best practices
+## Best Practices for Using Embedders
 
-- Experiment with different embedders to find the best fit for your specific voice conversion task.
-- Keep track of which embedder you use for each model to ensure consistency.
-- Stay updated with the latest developments in audio embedders, as new models may offer improved performance.
+-   **Consistency is Key:** Always use the same embedder for a given model, from training all the way through to inference.
+-   **Keep Track:** If you're working with multiple models, keep a record of which embedder you used for each one.
+-   **Experiment:** Don't be afraid to experiment with different embedders to see which one works best for your specific use case. Some embedders may be better suited for singing, while others excel at speech.
+-   **Stay Updated:** The field of audio processing is constantly evolving. Keep an eye out for new and improved embedders that may offer better performance.
@@ -0,0 +1,92 @@
+---
+title: "Google Colab Guide"
+description: "Learn how to use Applio in the cloud with Google Colab."
+---
+
+import { Aside, Steps } from '@astrojs/starlight/components';
+
+Google Colab provides a convenient way to use Applio without needing a powerful local computer. However, it's important to be aware of the risks and limitations.
+
+<Aside type="danger" title="Important Notice">
+  Launching graphical user interfaces (UIs) like Applio on Google Colab is against their Terms of Service. Doing so may result in limitations being placed on your Google account. If you understand and accept this risk, you may proceed.
+
+  As a safer alternative, we recommend using the official [Applio No UI Colab Notebook](https://colab.research.google.com/github/iahispano/applio/blob/main/assets/Applio_NoUI.ipynb), which is designed to be used without a graphical interface.
+</Aside>
+
+## Getting Started with the Applio UI Colab
+
+If you choose to proceed with the UI version, here's how to get it running.
+
+<Steps>
+1.  **Open the Colab Notebook:** Launch the [Applio UI Colab Notebook](https://colab.research.google.com/github/iahispano/applio/blob/main/assets/Applio.ipynb).
+2.  **Install Applio:** Run the first cell, labeled "Install Applio," by clicking the play button. This will install Applio and all its dependencies.
+3.  **Launch the Interface:** Run the second cell. This will launch the Applio interface and provide you with a URL to access it. We recommend using the `localtunnel` sharing method for a more stable connection.
+4.  **Access the UI:** Open the provided URL. You will be prompted for a password, which is the IP address displayed in the Colab cell output.
+</Steps>
+
+![A screenshot showing the two main cells to run in the Applio Colab notebook.](/images/colab.png)
+
+## Training on Colab
+
+Training models on Colab requires a bit of extra setup to ensure you don't lose your progress.
+
+### Syncing with Google Drive
+
+We highly recommend syncing your Colab instance with Google Drive. This will save your trained models to a folder called `ApplioBackup` in your Google Drive and allow you to resume training from a previously saved model.
+
+To do this, run the **Sync with Google Drive** cell in the Colab notebook.
+
+![A screenshot of the "Sync with Google Drive" cell in the Applio Colab notebook.](/images/extra-colab.png)
+
+### Resuming Training
+
+To resume training a model that you've previously saved to Google Drive:
+
+<Steps>
+1.  Run all the initial cells, including **Install Applio** and **Sync with Google Drive**.
+2.  In the Applio UI, go to the **Train** tab.
+3.  Enter the name of your model.
+4.  Select the same sample rate you used previously.
+5.  Load your custom pretrained model if you used one.
+6.  Increase the number of epochs and click **Train** to continue training.
+</Steps>
+
+## Managing Models on Colab
+
+### Exporting Your Final Model
+
+Once your model is fully trained, you can export it to your Google Drive.
+
+<Steps>
+1.  Go to the **Train** tab and click the **Export Model** sub-tab.
+2.  Click the **Refresh** button.
+3.  Select the `.pth` and `.index` files for your model.
+4.  Click the **Upload** button. Your model will be saved to a folder named `ApplioExported` in your Google Drive.
+</Steps>
+
+## Keeping Colab Active
+
+Google Colab will automatically disconnect idle notebooks. To prevent this from happening during a long training session, you can run a small script in your browser's developer console.
+
+<Steps>
+1.  Press `Ctrl + Shift + i` to open the developer tools.
+2.  Go to the **Console** tab.
+3.  Type `Allow pasting` and press Enter.
+4.  Paste the following code into the console and press Enter:
+    ```js
+    function ClickConnect() {
+      var iconElement = document.getElementById("toggle-header-button");
+      if (iconElement) {
+        var clickEvent = new MouseEvent("click", {
+          bubbles: true,
+          cancelable: true,
+          view: window,
+        });
+        iconElement.dispatchEvent(clickEvent);
+      }
+    }
+    setInterval(ClickConnect, 60000);
+    ```
+</Steps>
+
+This script will simulate a click every minute, keeping your Colab session active.