ABSTRACT
We present Essentia 2.0, an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license. It contains an extensive collection of reusable algorithms which implement audio input/output functionality, standard digital signal processing blocks, statistical characterization of data, and a large set of spectral, temporal, tonal and high-level music descriptors. The library is also wrapped in Python and includes a number of predefined executable extractors for the available music descriptors, which facilitates its use for fast prototyping and allows setting up research experiments very rapidly. Furthermore, it includes a Vamp plugin to be used with Sonic Visualiser for visualization purposes. The library is cross-platform and currently supports Linux, Mac OS X, and Windows systems. Essentia is designed with a focus on the robustness of the provided music descriptors and is optimized in terms of the computational cost of the algorithms. The provided functionality, specifically the music descriptors included in-the-box and signal processing algorithms, is easily expandable and allows for both research experiments and development of large-scale industrial applications.
- D. Bogdanov, M. Haro, F. Fuhrmann, A. Xambó, E. Gómez, and P. Herrera. Semantic audio content-based music recommendation and visualization based on user preference examples. Inf. Process. & Management, 49(1):13--33, 2013. Google ScholarDigital Library
- D. Bogdanov, J. Serrà, N. Wack, P. Herrera, and X. Serra. Unifying low-level and high-level music similarity measures. IEEE Trans. on Multimedia, 13(4):687--701, 2011. Google ScholarDigital Library
- D. Bogdanov, N. Wack, E. Gómez, S. Gulati, P. Herrera, O. Mayor, G. Roma, J. Salamon, J. Zapata, and X. Serra. ESSENTIA: an audio analysis library for music information retrieval. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'13), 2013.Google Scholar
- C. Cannam, C. Landone, and M. Sandler. Sonic visualiser: An open source application for viewing, analysing, and annotating music audio files. In ACM Int. Conf. on Multimedia (MM'05), page 1467--1468, 2010. Google ScholarDigital Library
- F. Eyben, M. Wöllmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. In ACM Int. Conf. on Multimedia (MM'10), page 1459--1462, 2010. Google ScholarDigital Library
- F. Fuhrmann, P. Herrera, and X. Serra. Detecting solo phrases in music using spectral and pitch-related descriptors. Journal of New Music Research, 38(4):343--356, 2009.Google ScholarCross Ref
- C. F. Julià and S. Jordà. SongExplorer: a tabletop application for exploring large collections of songs. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'09), 2009.Google Scholar
- S. Koelsch, S. Skouras, T. Fritz, P. Herrera, C. Bonhage, M. Kuessner, and A. M. Jacobs. Neural correlates of music-evoked fear and joy: The roles of auditory cortex and superficial amygdala. Neuroimage. In press.Google Scholar
- K. R. Page, B. Fields, D. De Roure, T. Crawford, and J. S. Downie. Reuse, remix, repeat: the workflows of MIR. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'12), 2012.Google Scholar
- G. Roma, J. Janer, S. Kersten, M. Schirosa, P. Herrera, and X. Serra. Ecological acoustics perspective for content-based retrieval of environmental sounds. EURASIP Journal on Audio, Speech, and Music Process., 2010. Google ScholarDigital Library
- J. Serrà, E. Gómez, P. Herrera, and X. Serra. Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. on Audio, Speech, and Language Process., 16(6):1138--1151, 2008. Google ScholarDigital Library
- M. Sordo. Semantic Annotation of Music Collections: A Computational Approach. PhD thesis, UPF, Barcelona, Spain, 2012.Google Scholar
- N. Wack, C. Laurier, O. Meyers, R. Marxer, D. Bogdanov, J. Serra, E. Gomez, and P. Herrera. Music classification using high-level models. In Music Inf. Retrieval Evaluation Exchange (MIREX'10), 2010.Google Scholar
Index Terms
- ESSENTIA: an open-source library for sound and music analysis
Recommendations
madmom: A New Python Audio and Music Signal Processing Library
MM '16: Proceedings of the 24th ACM international conference on MultimediaIn this paper, we present madmom, an open-source audio processing and music information retrieval (MIR) library written in Python. madmom features a concise, NumPy-compatible, object oriented design with simple calling conventions and sensible default ...
Sonic visualiser: an open source application for viewing, analysing, and annotating music audio files
MM '10: Proceedings of the 18th ACM international conference on MultimediaSonic Visualiser is a friendly and flexible end-user desktop application for analysis, visualisation, and annotation of music audio files. Its stated goal is to be "the first program you reach for when want to study a musical recording rather than ...
PopMash: an automatic musical-mashup system using computation of musical and lyrical agreement for transitions
AbstractMusical-mashup is a popular form of music re-creation, aiming at combining multiple pieces of music to create new music artworks. Presently, it is also a challenge in the field of music information study. In this work, an effective framework for ...
Comments