Skip to content

musikalkemist/AudioSignalProcessingForML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AudioSignalProcessingForML

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

This repository is a comprehensive collection of resources, code, and explanations for understanding and implementing audio signal processing techniques, with a focus on applications in machine learning. It serves as a learning guide, starting from the fundamentals of sound and waveforms and progressing to advanced feature extraction methods.

Course Structure

Foundational Concepts

  1. Overview: Video | Slides
  2. Sound and waveforms: Video | Slides
  3. Intensity, loudness, and timbre: Video | Slides | Notebook
  4. Understanding audio signals: Video | Slides

Feature Extraction Theory

  1. Types of audio features for ML: Video | Slides
  2. How to extract audio features: Video | Slides
  3. Time-domain audio features: Video | Slides

Time-Domain Implementation

  1. Implementing the amplitude envelope: Video | Notebook
  2. RMS energy and zero-crossing rate: Video | Notebook

Frequency-Domain Concepts

  1. Fourier Transform: The Intuition: Video | Slides
  2. Complex numbers for audio signal processing: Video | Slides
  3. Defining the Fourier transform using complex numbers: Video | Slides | Notebook
  4. Discrete Fourier Transform: Video | Slides

Frequency-Domain Implementation

  1. Extracting the Discrete Fourier Transform: Video | Notebook
  2. Short-Time Fourier Transform explained easily: Video | Slides
  3. Extracting Spectrograms from Audio with Python: Video | Notebook
  4. Mel Spectrogram Explained Easily: Video | Slides
  5. Extracting Mel Spectrograms with Python: Video | Notebook
  6. MFCCs Explained Easily: Video | Slides
  7. Extracting MFCCs with Python: Video | Notebook
  8. Frequency-Domain Audio Features: Video | Slides
  9. Implementing Band Energy Ratio from Scratch with Python: Video | Notebook
  10. Spectral centroid and bandwidth: Video | Notebook

Audio examples

  • audio_resources/: A collection of .wav files used for the examples in the notebooks.

About

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published