Hearoo 🎧 — The Smart Audio Classifier & Visualizer

Welcome to Hearoo, your interactive, ResNet-powered audio classifier and visualizer! This project combines state-of-the-art deep learning with an intuitive web interface to make exploring sound exciting and insightful.

🚀 Overview

Hearoo is a CNN-based audio classifier and visualizer built on ResNet, trained using Modal.com on the ESC-50 dataset, which contains 50 diverse environmental sound categories like dog barks, rain, clapping, thunderstorm, and more.

Model Accuracy: 81.25%
Training Dataset: ESC-50
Loss: 1.0082
Validation Loss: 1.3925

Hearoo can:

Classify your audio files (WAV format) into 50 sound categories.
Visualize deep feature maps from ResNet layers to see how the network understands sound.
Explore the waveform and spectrogram of your audio in a beautiful, interactive UI.

🎯 Features

Upload or use sample audio files from the ESC-50 dataset.
Interactive ResNet feature maps display how each convolutional layer reacts to sounds.
Top-3 predictions with confidence scores for easy interpretation.
Waveform and spectrogram visualization to understand audio patterns.
Credits warning system to remind users about Modal.com usage limits (customizable).

🔊 Sample Audios

Try Hearoo instantly with these sample audio files:

Can Opening
Chirping Birds
Clapping
Knocking
Thunderstorm

You can also explore more audio samples from the ESC-50 dataset.

⚡ How It Works

The frontend is built in Next.js 15 for an interactive, reactive experience.
The backend runs on Modal.com, where ResNet is trained and serves predictions via a secure endpoint.
When a user uploads or selects a sample, the frontend sends audio to Modal, which returns:
- Top-3 predictions
- Feature maps of ResNet layers
- Input spectrogram
- Audio waveform

All results are displayed in real-time, making audio exploration seamless and visually appealing.

🛠 Tech Stack

Frontend: Next.js 15, React, Tailwind CSS
Backend / Model Hosting: Modal.com
Deep Learning: PyTorch, Torchaudio
Audio Dataset: ESC-50 (50 environmental sound classes)

🔐 Future Updates

Future updates may include authentication so only authorized users can analyze audio.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Graphs		Graphs
frontend		frontend
.gitignore		.gitignore
Modal.txt		Modal.txt
README.md		README.md
chirpingbirds.wav		chirpingbirds.wav
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hearoo 🎧 — The Smart Audio Classifier & Visualizer

🚀 Overview

🎯 Features

🔊 Sample Audios

⚡ How It Works

🛠 Tech Stack

🔐 Future Updates

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hearoo 🎧 — The Smart Audio Classifier & Visualizer

🚀 Overview

🎯 Features

🔊 Sample Audios

⚡ How It Works

🛠 Tech Stack

🔐 Future Updates

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages