PytorchForAudio

Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.

This repository is a comprehensive collection of resources and code for understanding and implementing deep learning models for audio tasks using PyTorch and Torchaudio. It serves as a practical guide, moving from basic neural network implementations to building a complete sound classification system (CNN) trained on the UrbanSound8K dataset.

Note on Versioning

While this v2 release is fully functional and optimized for current environments, it may differ from the original version shown in the course. The codebase has been updated to reflect modern best practices and improved dependency management. Consequently, the original course version has been deprecated; however, it remains available in the legacy branch for those wishing to follow the video content exactly.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
01 Course overview		01 Course overview
02 Training a feed forward network		02 Training a feed forward network
03 Making predictions		03 Making predictions
04 Creating a custom dataset		04 Creating a custom dataset
05 Extracting Mel spectrograms		05 Extracting Mel spectrograms
06 Padding audio files		06 Padding audio files
07 Preprocessing data on GPU		07 Preprocessing data on GPU
08 Implementing a CNN network		08 Implementing a CNN network
09 Training urban sound classifier		09 Training urban sound classifier
10 Predictions with sound classifier		10 Predictions with sound classifier
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
Instructions_UrbanSound8K.md		Instructions_UrbanSound8K.md
LICENSE		LICENSE
README.md		README.md
dataset_downloader.py		dataset_downloader.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PytorchForAudio

Note on Versioning

Table of Contents

Dataset Setup (UrbanSound8K)

Course Structure

Introduction & Basics

Audio Data Processing

Sound Classification Project (UrbanSound8K)

How to Run the Scripts

1. Prepare the Environment (Recommended)

2. Navigate to the Lesson Folder

3. Execute the Script

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PytorchForAudio

Note on Versioning

Table of Contents

Dataset Setup (UrbanSound8K)

Course Structure

Introduction & Basics

Audio Data Processing

Sound Classification Project (UrbanSound8K)

How to Run the Scripts

1. Prepare the Environment (Recommended)

2. Navigate to the Lesson Folder

3. Execute the Script

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages