From: Mart Lubbers Date: Thu, 2 Mar 2017 20:27:22 +0000 (+0100) Subject: Add glossary and start making excerpts from the literature found X-Git-Url: https://git.martlubbers.net/?a=commitdiff_plain;h=a0777d0d23ab65be1bfadef5322ba956a51ebb94;p=asr1617.git Add glossary and start making excerpts from the literature found --- diff --git a/asr.tex b/asr.tex index e4c126d..bfbcf86 100644 --- a/asr.tex +++ b/asr.tex @@ -1,4 +1,14 @@ %&asr +\usepackage[nonumberlist,acronyms]{glossaries} +\makeglossaries% +\newacronym{HMM}{HMM}{Hidden Markov Model} +\newacronym{HTK}{HTK}{\acrlong{HMM} Toolkit} +\newacronym{FA}{FA}{Forced alignment} +\newacronym{MFC}{MFC}{Mel-frequency cepstrum} +\newacronym{MFCC}{MFCC}{\acrlong{MFC} coefficient} +%\newglossaryentry{mTask}{name=mTask, +% description={is an abstraction for \glspl{Task} living on \acrshort{IoT} devices}} + \begin{document} %Titlepage \maketitleru[ @@ -7,31 +17,37 @@ authorstext={Author:}] \listoftodos[Todo] -t\cite{muller_multimodal_2012} - -t\cite{pedone_phoneme-level_2011} - -t\cite{fujihara_automatic_2006} - -t\cite{mesaros_adaptation_2009} +\tableofcontents -t\cite{mesaros_automatic_2010} +%Glossaries +\glsaddall{} +\printglossaries% -t\cite{dzhambazov_automatic_2016} +Berenzweig and Ellis use acoustic classifiers from speech recognition as a +detector for singing lines. They achive 80\% accuracy for forty 15 second +exerpts. They mention people that wrote signal features that discriminate +between speech and music. Neural net +\glspl{HMM}.\cite{berenzweig_locating_2001}. -t\cite{mesaros_automatic_2008} - -t\cite{berenzweig_locating_2001} +In 2014 Dzhambazov et al.\ applied state of the art segmentation methods to +polyphonic turkish music, this might be interesting to use for heavy metal. +They mention Fujihara(2011) to have a similar \gls{FA} system. This method uses +phone level segmentation, first 12 \gls{MFCC}s. They first do vocal/non-vocal +detection, then melody extraction, then alignment. They compare results with +Mesaros \& Virtanen, 2008. t\cite{dzhambazov_automatic_2014} - -t\cite{fujihara_three_2008} - -t\cite{yang_machine_2012} - +t\cite{dzhambazov_automatic_2016} +t\cite{fujihara_automatic_2006} t\cite{fujihara_lyricsynchronizer:_2011} - +t\cite{fujihara_three_2008} t\cite{mauch_integrating_2012} +t\cite{mesaros_adaptation_2009} +t\cite{mesaros_automatic_2008} +t\cite{mesaros_automatic_2010} +t\cite{muller_multimodal_2012} +t\cite{pedone_phoneme-level_2011} +t\cite{yang_machine_2012} \bibliographystyle{ieeetr} \bibliography{asr}