Name	Name	Last commit message	Last commit date
parent directory ..
.devcontainer	.devcontainer
Dockerfile	Dockerfile
README.md	README.md
model.keras	model.keras
predict.py	predict.py
requirements.txt	requirements.txt
run.sh	run.sh
train.py	train.py

Name

Last commit message

Last commit date

Dockerfile

Audio Classification Model

This repository contains a project to classify audio files as either real or faked using MFCC (Mel-frequency cepstral coefficients) features and a neural network model built with TensorFlow and Keras.

📂 Dataset

The dataset should be organized into two directories:

dataset/real: Contains real audio samples in .wav format.
dataset/fake: Contains faked audio samples in .wav format.

🛠️ Setup

Prerequisites

You can install the required packages using pip in env:

pip install numpy tensorflow keras librosa scikit-learn

Project Structure

├── dataset
│   ├── real
│   │   └── *.wav
│   └── fake
│       └── *.wav
├── model.py
├── model.h5
└── README.md

🚀 Getting Started

1. Data Preparation

Ensure your dataset is correctly placed in the dataset/real and dataset/fake directories. Each .wav file will be processed to extract MFCC features.

2. Training the Model

Run the model.py script to train the model:

python model.py

This script performs the following steps:

Extract MFCC Features: From the audio files in the dataset.
Create Dataset: Combine the features and labels for real and faked audio.
Train the Model: Train a neural network model on the extracted features.
Save the Model: The trained model is saved as model.h5.

3. Analyzing New Audio Files

After training the model, you can use it to classify new audio files:

When prompted, enter the path of the .wav file you want to analyze. The model will predict whether the audio is real or faked.

📊 Evaluation

The model's performance is evaluated using accuracy and a confusion matrix, which are printed during training:

Accuracy: 0.95

🔧 Functions Overview

extract_mfcc_features(audio_path, n_mfcc=13, n_fft=2048, hop_length=512): Extracts MFCC features from the given audio file.
create_dataset(directory, label): Creates a dataset by extracting MFCC features from all .wav files in the given directory.
build_model(input_shape): Builds a neural network model with the specified input shape.
train_model(X, y): Trains the neural network model on the provided features and labels.
analyze_audio(input_audio_path): Analyzes a given audio file to predict whether it is real or faked.

📝 Notes

Ensure the audio files are in .wav format and 22500hH.
Adjust the parameters such as n_mfcc, n_fft, and hop_length if necessary to better suit your data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Audio Classification Model

📂 Dataset

🛠️ Setup

Prerequisites

Project Structure

🚀 Getting Started

1. Data Preparation

2. Training the Model

3. Analyzing New Audio Files

📊 Evaluation

🔧 Functions Overview

📝 Notes

FilesExpand file tree

tensorflow

Directory actions

More options

Directory actions

More options

Latest commit

History

tensorflow

Folders and files

parent directory

README.md

Audio Classification Model

📂 Dataset

🛠️ Setup

Prerequisites

Project Structure

🚀 Getting Started

1. Data Preparation

2. Training the Model

3. Analyzing New Audio Files

📊 Evaluation

🔧 Functions Overview

📝 Notes