Three dimensions of pitched instrument onset detection
MetadataShow full item record
CitationHolzapfel, A., Stylianou, Y., Gedik, A. C., and Bozkurt, B. (2010). Three dimensions of pitched instrument onset detection. IEEE Transactions on Audio, Speech and Language Processing, 18(6), 1517-1527. doi:10.1109/TASL.2009.2036298
In this paper, we suggest a novel group delay based method for the onset detection of pitched instruments. It is proposed to approach the problem of onset detection by examining three dimensions separately: phase (i.e., group delay), magnitude and pitch. The evaluation of the suggested onset detectors for phase, pitch and magnitude is performed using a new publicly available and fully onset annotated database of monophonic recordings which is balanced in terms of included instruments and onset samples per instrument, while it contains different performance styles. Results show that the accuracy of onset detection depends on the type of instruments as well as on the style of performance. Combining the information contained in the three dimensions by means of a fusion at decision level leads to an improvement of onset detection by about 8% in terms of F-measure, compared to the best single dimension. © 2010 IEEE.