You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: report/report.md
+45-18Lines changed: 45 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -978,6 +978,15 @@ based on selecting and manually labeling content from the Freesound[@Freesound]
978
978
1302 different recordings were annotated, for a total of 18.5 hours of labeled audio.
979
979
A curated subset with 8732 audio clips of maximum 4 seconds is known as *Urbansound8k*.
980
980
981
+
\begin{table}[h]
982
+
\centering
983
+
\scalebox{0.8}{
984
+
\input{pyincludes/urbansound8k-classes.tex}
985
+
}
986
+
\caption{Classes found in the Urbansound8k dataset}
987
+
\label{table:urbansound8k-classes}
988
+
\end{table}
989
+
981
990
YorNoise[@medhat2017masked] is a collection of vehicle noise.
982
991
It has a total of 1527 samples, in two classes: road traffic (cars, trucks, buses) and rail (trains).
983
992
The dataset follows the same design as Urbansound8k,
@@ -1256,18 +1265,15 @@ They claim a 16x improvement in power efficiency over a ARM Cortex M7 chip[@GAP8
1256
1265
1257
1266
## Dataset
1258
1267
1259
-
The dataset used for the experiements is Urbansound8K, described in chapter \ref{chapter:datasets}.
1260
-
The 10 classes in the dataset are listed in Table \ref{table:urbansound8k-classes},
1261
-
and Figure \ref{figure:urbansound8k-examples} shows example audio spectrograms.
1262
-
1263
-
\begin{table}
1264
-
\centering
1265
-
\input{pyincludes/urbansound8k-classes.tex}
1266
-
\caption{Classes found in the Urbansound8k dataset}
1267
-
\label{table:urbansound8k-classes}
1268
-
\end{table}
1268
+
The dataset used for the experiments is Urbansound8K, described in chapter \ref{chapter:datasets}.
1269
+
Figure \ref{figure:urbansound8k-examples} shows example audio spectrograms for each of the 10 classes.
1269
1270
1270
-

\caption[Testing model on device]{Model being tested on device. Sound is played back via headphones and classified on the microcontroller. Predictions are sent to computer and visualized on screen in real-time. }
1572
+
\label{figure:demo}
1573
+
\end{figure}
1552
1574
1553
-
`FIXME: add a picture of demo setup`
1575
+
The on-device demonstration used the SENSING1 application example as base,
1576
+
and modifications were made to send the predictions out over USB.
1577
+
This example code only supports mel-spectrogram preprocessing with 16 kHz sample-rate, 30 filters
1578
+
and 1024 samples FFT window with 512 hops, using max-normalization for the analysis windows.
1579
+
A Strided-DS-5x5 model was trained on fold 1-8 to match these feature settings.
1580
+
The model scored 72% on the associated validation-set, fold 9.
0 commit comments