existing models that were trained on regular singing voices to detect grunting.
The same experiments can be done the other way around as well.
-\paragraph{Decorrelation }
+\paragraph{Decorrelation: }
Adding another layer to the \gls{MLP} can be seen as applying an extra
normalization step to the input data. It could be that the last step in
converting the waveforms to \gls{MFCC} can be performed by the neural network.
hardly any difference between the performance of a model with 8 or 13 nodes.
Moreover, contrary than expected the window size does not seem to be doing much
in the performance.
+