Abstract
Recognition of harmonic characteristics from polyphonic music, in particular intervals, can be very hard if the different instruments with their specific characteristics (overtones, formants, noisy components) are playing together at the same time. In our study we examined the impact of chroma features and spectrum on classification of single tone pitchs and music intervals played either by the same or different instruments. After the analysis of the audio recordings which produced the most errors we implemented two optimization approaches based on energy envelope and overtone distribution. The methods were compared during the experiment study. The results show that especially the integration of instrument-specific knowledge can significantly improve the overall performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bartsch, M. A., & Wakefield, G. H. (2005). Audio thumbnailing of popular music using chroma-based representations. IEEE Transactions on Multimedia, 7(1), 96–104.
Eronen, A. (2009). Signal processing methods for audio classification and music content analysis. PhD thesis, Tampere University of Technology.
Fujishima, T. (1999). Realtime chord recognition of musical sound: A system using common lisp music. In Proceedings of the international computer music conference (ICMC), Beijing (pp. 464–467).
Gómez, E. (2006). Tonal description of music audio signals. PhD thesis, Universitat Pompeu Fabra, Department of Technology.
Jourdain, R. (1998). Music, the brain and ecstasy: How music captures our imagination. New York: Harper Perennial.
Lartillot, O., & Toiviainen, P. (2007). MIR in Matlab (II): A toolbox for musical feature extraction from audio. In Proceedings of the 8th international conference on music information retrieval (ISMIR) (pp. 127–130).
Mauch, M. (2010). Automatic chord transcription from audio using computational models of musical context. PhD thesis, Queen Mary University of London.
McGill University Master Samples. http://www.music.mcgill.ca/resources/mums/html/.
Müller, M., & Ewert, S. (2010). Towards timbre-invariant audio features for harmony-based music. IEEE Transactions on Audio, Speech, and Language Processing, 18(3), 649–662.
Park, T. H. (2010). Introduction to digital signal processing: Computer musically speaking (1st Ed.). Singapore/Hackensack: World Scientific Publishing Co. Pte. Ltd.
Temperley, D. (2007). Music and probability. Cambridge, MA: MIT.
Theimer, W., Vatolkin, I., & Eronen, A. (2008). Definitions of audio features for music content description. Technical report TR08-2-001, University of Dortmund.
Vatolkin, I., Theimer, W., & Botteck, M. (2010). Amuse (advanced mUSic explorer) – a multitool framework for music data analysis. In Proceedings of the 11th international society for music information retrieval conference (ISMIR), Utrecht (pp. 33–38).
Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques. Amsterdam/Boston: Morgan Kaufmann.
Acknowledgements
We thank the Klaus Tschira Foundation for the financial support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Mattern, V., Vatolkin, I., Rudolph, G. (2013). A Case Study About the Effort to Classify Music Intervals by Chroma and Spectrum Analysis. In: Lausen, B., Van den Poel, D., Ultsch, A. (eds) Algorithms from and for Nature and Life. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-00035-0_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-00035-0_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-00034-3
Online ISBN: 978-3-319-00035-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)