Skip to content

Commit 03efa38

Browse files
authored
Update README.md
1 parent 8908840 commit 03efa38

File tree

1 file changed

+33
-5
lines changed

1 file changed

+33
-5
lines changed

README.md

Lines changed: 33 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -21,13 +21,42 @@ Audio dataset given has very high frame rate, on an average every file had 80,00
2121
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/audio_preprocessing-1.png)
2222

2323

24+
25+
26+
### Using ANN
27+
28+
During initial experiments ANN was not performing good and later after several experiments a Multi Layer Perceptron (MLP) model was build based on intuition of CNN. Before we feed audio data in network it was max pooled in 3 different layers and output of pooled layers was given input to the fully connected layers as shows in below figure. To merge features extracted from different pooling layers output of fully connected layer was merged.
29+
30+
31+
#### Core Idea:
32+
Sample Bee Audio and expected feature extraction using pooling layers and merging fully connected layers
33+
34+
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/audio_graph-1.png)
35+
36+
37+
#### Performance:
38+
39+
Training was done for 500 epochs using Adaptive Moment Estimation (adam) as optimizer with learning rate of 0.0005. Figure 9 displays accuracy during training.
40+
41+
| | Training | Testing |
42+
|-----| ------------- |:-------------:|
43+
|Accuracy | 91.11% | 88.25% |
44+
45+
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/ANN_Net_2-1.png)
46+
47+
#### Accuracy
48+
49+
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/bee_ann_audio.png)
50+
51+
52+
2453
### Using CNN
2554

2655
A network using Convolution layers was used to build classifier, network architecture is shown in Fig 6. The number of filters for both convolution was 64 and filter_size was 10 and 3 for respective layers followed by 3 fully connected layers, details about activation function used is in code. Max pooling was used after each convolution layer. During training over fitting was observed, to handle that dropout of 50% (keep) was used after first two fully connected layers and also ‘L2’ regularization was added to both layers. Input length was fixed as 22,000 with 1 channel. During training it was also observed, without downsampling
2756
data model was not able to generalize well between bee and noise data. Adding downsampling technique helped the
2857
model in generalization.
2958

30-
## Performance:
59+
#### Performance:
3160

3261
Training was done for 500 epochs using Adaptive Moment Estimation (adam) as optimizer with learning rate of 0.0001.
3362

@@ -37,9 +66,8 @@ Training was done for 500 epochs using Adaptive Moment Estimation (adam) as opti
3766
|Accuracy | 99.88% | 99.45% |
3867

3968

40-
41-
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/ANN_Net_2-1.png)
4269
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/CNN_Net-1.png)
43-
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/audio_graph-1.png)
44-
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/bee_ann_audio.png)
70+
71+
#### Accuracy
72+
4573
![alt text](https://raw.githubusercontent.com/vishalshar/Audio-Classification-using-CNN-MLP/master/img/bee_cnn_audio.png)

0 commit comments

Comments
 (0)