List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
-
Updated
Aug 14, 2024
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations
Audio data loading and augmentations in JAX
This repository contains the code and methodology used for the BirdCLEF 2024 Kaggle competition, where I achieved a rank of 55th out of 974 participants, earning a bronze medal. The goal of this competition was to build a model that can accurately classify bird sounds.
SoundScaper is an audio augmented reality mobile application that allows users to author, save and reload virtual, and spatially interactive, three-dimensional binaural soundscapes within physical, real world spaces.
Converting text to audio and applying audio augmentation
Add a description, image, and links to the audio-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the audio-augmentation topic, visit your repo's landing page and select "manage topics."