research-article

ESSENTIA: an open-source library for sound and music analysis

Authors:
Dmitry Bogdanov

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Nicolas Wack

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Emilia Gómez

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Sankalp Gulati

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Perfecto Herrera

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Oscar Mayor

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Gerard Roma

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Justin Salamon

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
José Zapata

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Xavier Serra

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

MM '13: Proceedings of the 21st ACM international conference on MultimediaOctober 2013Pages 855–858https://doi.org/10.1145/2502081.2502229

Published:21 October 2013Publication History

MM '13: Proceedings of the 21st ACM international conference on Multimedia

Pages 855–858

ABSTRACT

We present Essentia 2.0, an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license. It contains an extensive collection of reusable algorithms which implement audio input/output functionality, standard digital signal processing blocks, statistical characterization of data, and a large set of spectral, temporal, tonal and high-level music descriptors. The library is also wrapped in Python and includes a number of predefined executable extractors for the available music descriptors, which facilitates its use for fast prototyping and allows setting up research experiments very rapidly. Furthermore, it includes a Vamp plugin to be used with Sonic Visualiser for visualization purposes. The library is cross-platform and currently supports Linux, Mac OS X, and Windows systems. Essentia is designed with a focus on the robustness of the provided music descriptors and is optimized in terms of the computational cost of the algorithms. The provided functionality, specifically the music descriptors included in-the-box and signal processing algorithms, is easily expandable and allows for both research experiments and development of large-scale industrial applications.

References

D. Bogdanov, M. Haro, F. Fuhrmann, A. Xambó, E. Gómez, and P. Herrera. Semantic audio content-based music recommendation and visualization based on user preference examples. Inf. Process. & Management, 49(1):13--33, 2013. Google ScholarDigital Library
D. Bogdanov, J. Serrà, N. Wack, P. Herrera, and X. Serra. Unifying low-level and high-level music similarity measures. IEEE Trans. on Multimedia, 13(4):687--701, 2011. Google ScholarDigital Library
D. Bogdanov, N. Wack, E. Gómez, S. Gulati, P. Herrera, O. Mayor, G. Roma, J. Salamon, J. Zapata, and X. Serra. ESSENTIA: an audio analysis library for music information retrieval. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'13), 2013.Google Scholar
C. Cannam, C. Landone, and M. Sandler. Sonic visualiser: An open source application for viewing, analysing, and annotating music audio files. In ACM Int. Conf. on Multimedia (MM'05), page 1467--1468, 2010. Google ScholarDigital Library
F. Eyben, M. Wöllmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. In ACM Int. Conf. on Multimedia (MM'10), page 1459--1462, 2010. Google ScholarDigital Library
F. Fuhrmann, P. Herrera, and X. Serra. Detecting solo phrases in music using spectral and pitch-related descriptors. Journal of New Music Research, 38(4):343--356, 2009.Google ScholarCross Ref
C. F. Julià and S. Jordà. SongExplorer: a tabletop application for exploring large collections of songs. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'09), 2009.Google Scholar
S. Koelsch, S. Skouras, T. Fritz, P. Herrera, C. Bonhage, M. Kuessner, and A. M. Jacobs. Neural correlates of music-evoked fear and joy: The roles of auditory cortex and superficial amygdala. Neuroimage. In press.Google Scholar
K. R. Page, B. Fields, D. De Roure, T. Crawford, and J. S. Downie. Reuse, remix, repeat: the workflows of MIR. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'12), 2012.Google Scholar
G. Roma, J. Janer, S. Kersten, M. Schirosa, P. Herrera, and X. Serra. Ecological acoustics perspective for content-based retrieval of environmental sounds. EURASIP Journal on Audio, Speech, and Music Process., 2010. Google ScholarDigital Library
J. Serrà, E. Gómez, P. Herrera, and X. Serra. Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. on Audio, Speech, and Language Process., 16(6):1138--1151, 2008. Google ScholarDigital Library
M. Sordo. Semantic Annotation of Music Collections: A Computational Approach. PhD thesis, UPF, Barcelona, Spain, 2012.Google Scholar
N. Wack, C. Laurier, O. Meyers, R. Marxer, D. Bogdanov, J. Serra, E. Gomez, and P. Herrera. Music classification using high-level models. In Music Inf. Retrieval Evaluation Exchange (MIREX'10), 2010.Google Scholar

Index Terms

ESSENTIA: an open-source library for sound and music analysis
1. Hardware
  1. Communication hardware, interfaces and storage
    1. Signal processing systems
  2. Robustness
    1. Hardware reliability
      1. Signal integrity and noise analysis

Recommendations

madmom: A New Python Audio and Music Signal Processing Library
MM '16: Proceedings of the 24th ACM international conference on Multimedia

In this paper, we present madmom, an open-source audio processing and music information retrieval (MIR) library written in Python. madmom features a concise, NumPy-compatible, object oriented design with simple calling conventions and sensible default ...
Read More
Sonic visualiser: an open source application for viewing, analysing, and annotating music audio files
MM '10: Proceedings of the 18th ACM international conference on Multimedia

Sonic Visualiser is a friendly and flexible end-user desktop application for analysis, visualisation, and annotation of music audio files. Its stated goal is to be "the first program you reach for when want to study a musical recording rather than ...
Read More
PopMash: an automatic musical-mashup system using computation of musical and lyrical agreement for transitions
Abstract
Musical-mashup is a popular form of music re-creation, aiming at combining multiple pieces of music to create new music artworks. Presently, it is also a challenge in the field of music information study. In this work, an effective framework for ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
General Chairs:
Alejandro (Alex) Jaimes
Yahoo!, Spain
,
Nicu Sebe
University of Trento, Italy
,
Nozha Boujemaa
INRIA, France
,
Program Chairs:
Daniel Gatica-Perez
IDIAP & EPFL, Switzerland
,
David A. Shamma
Yahoo!, USA
,
Marcel Worring
University of Amsterdam, The Netherlands
,
Roger Zimmermann
National University of Singapore, Singapore
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
audio analysis
music information retrieval
open source
signal processing
sound and music computing
Qualifiers
- research-article
Conference

Acceptance Rates
MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 108
  Total Citations
  View Citations
- 772
  Total Downloads
- Downloads (Last 12 months)51
- Downloads (Last 6 weeks)8
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

ESSENTIA: an open-source library for sound and music analysis

MM '13: Proceedings of the 21st ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

madmom: A New Python Audio and Music Signal Processing Library

Sonic visualiser: an open source application for viewing, analysing, and annotating music audio files

PopMash: an automatic musical-mashup system using computation of musical and lyrical agreement for transitions