Skip to main content
Erschienen in: Neural Computing and Applications 8/2020

06.03.2019 | Original Article

Probabilistic approaches for music similarity using restricted Boltzmann machines

Erschienen in: Neural Computing and Applications | Ausgabe 8/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In music informatics, there has been increasing attention to relative similarity as it plays a central role in music retrieval, recommendation, and musicology. Most approaches for relative similarity are based on distance metric learning, in which similarity relationship is modelled by a parameterised distance function. Normally, these parameters can be learned by solving a constrained optimisation problem using kernel-based methods. In this paper, we study the use of restricted Boltzmann machines (RBMs) in similarity modelling. We take advantage of RBM as a probabilistic neural network to assign a true hypothesis “x is more similar to y than to z” with a higher probability. Such model can be trained by maximising the true hypotheses while, at the same time, minimising the false hypotheses using a stochastic method. Alternatively, we show that learning similarity relations can be done deterministically by minimising the free energy function of a bipolar RBM or using a classification approach. In the experiments, we evaluate our proposed approaches on music scripts extracted from MagnaTagATune dataset. The results show that an energy-based optimisation approach with bipolar RBM can achieve better performance than other methods, including support vector machine and machine learning rank which are the state-of-the-art for this dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bar-Hillel A, Hertz T, Shental N, Weinshall D (2003) Learning distance functions using equivalence relations. In: ICML, pp 11–18 Bar-Hillel A, Hertz T, Shental N, Weinshall D (2003) Learning distance functions using equivalence relations. In: ICML, pp 11–18
2.
Zurück zum Zitat Bilenko M, Mooney RJ (2003) Adaptive duplicate detection using learnable string similarity measures. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’03. ACM, New York, NY, USA, pp 39–48. https://doi.org/10.1145/956750.956759 Bilenko M, Mooney RJ (2003) Adaptive duplicate detection using learnable string similarity measures. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’03. ACM, New York, NY, USA, pp 39–48. https://​doi.​org/​10.​1145/​956750.​956759
3.
Zurück zum Zitat Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1993) Signature verification using a “siamese” time delay neural network. In: Proceedings of the 6th international conference on neural information processing systems, NIPS’93, pp 737–744 Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1993) Signature verification using a “siamese” time delay neural network. In: Proceedings of the 6th international conference on neural information processing systems, NIPS’93, pp 737–744
4.
Zurück zum Zitat Carreira-Perpinan MA, Hinton GE (2005) On contrastive divergence learning. In: Proceedings of the tenth international workshop on artificial intelligence and statistics, pp 33–40 Carreira-Perpinan MA, Hinton GE (2005) On contrastive divergence learning. In: Proceedings of the tenth international workshop on artificial intelligence and statistics, pp 33–40
6.
Zurück zum Zitat Cherla S, Tran SN, d’Avila Garcez AS, Weyde T (2017) Generalising the discriminative restricted Boltzmann machines. In: Artificial neural networks and machine learning—ICANN 2017—26th international conference on artificial neural networks, pp 111–119 Cherla S, Tran SN, d’Avila Garcez AS, Weyde T (2017) Generalising the discriminative restricted Boltzmann machines. In: Artificial neural networks and machine learning—ICANN 2017—26th international conference on artificial neural networks, pp 111–119
8.
Zurück zum Zitat Frome A, Singer Y, Sha F, Malik J (2007) Learning globally-consistent local distance functions for shape-based image retrieval and classification. In: ICCV, pp 1–8 Frome A, Singer Y, Sha F, Malik J (2007) Learning globally-consistent local distance functions for shape-based image retrieval and classification. In: ICCV, pp 1–8
10.
Zurück zum Zitat Hoffer E, Ailon N (2015) Deep metric learning using triplet network. In: Feragen A, Pelillo M, Loog M (eds) SIMBAD, lecture notes in computer science, vol 9370. Springer, pp 84–92 Hoffer E, Ailon N (2015) Deep metric learning using triplet network. In: Feragen A, Pelillo M, Loog M (eds) SIMBAD, lecture notes in computer science, vol 9370. Springer, pp 84–92
11.
Zurück zum Zitat Hu N, Dannenberg RB, Lewis AL (2002) A probabilistic model of melodic similarity. In: International computer music conference (ICMC). 2002. The International Computer Music Association, Goteborg, Sweden Hu N, Dannenberg RB, Lewis AL (2002) A probabilistic model of melodic similarity. In: International computer music conference (ICMC). 2002. The International Computer Music Association, Goteborg, Sweden
12.
Zurück zum Zitat Huang A (2008) Similarity measures for text document clustering, pp 49–56 Huang A (2008) Similarity measures for text document clustering, pp 49–56
13.
Zurück zum Zitat Jain P, Kulis B, Dhillon IS, Grauman K (2008) Online metric learning and fast similarity search. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) NIPS. Curran Associates, Inc., pp 761–768 Jain P, Kulis B, Dhillon IS, Grauman K (2008) Online metric learning and fast similarity search. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) NIPS. Curran Associates, Inc., pp 761–768
14.
15.
Zurück zum Zitat McFee B, Lanckriet GRG (2010) Metric learning to rank. In: ICML, pp 775–782 McFee B, Lanckriet GRG (2010) Metric learning to rank. In: ICML, pp 775–782
16.
Zurück zum Zitat Moghaddam B, Nastar C, Pentland A (1996) A bayesian similarity measure for direct image matching. In: Proceedings of the 13th international conference on pattern recognition—vol 2, ICPR ’96. IEEE Computer Society, Washington, DC, USA, p 350. https://doi.org/10.1109/ICPR.1996.546848 Moghaddam B, Nastar C, Pentland A (1996) A bayesian similarity measure for direct image matching. In: Proceedings of the 13th international conference on pattern recognition—vol 2, ICPR ’96. IEEE Computer Society, Washington, DC, USA, p 350. https://​doi.​org/​10.​1109/​ICPR.​1996.​546848
18.
Zurück zum Zitat Salakhutdinov R, Mnih A, Hinton G (2007) Restricted Boltzmann machines for collaborative filtering. In: Machine learning, proceedings of the twenty-fourth international conference (ICML 2004). ACM, AAAI Press, pp 791–798 Salakhutdinov R, Mnih A, Hinton G (2007) Restricted Boltzmann machines for collaborative filtering. In: Machine learning, proceedings of the twenty-fourth international conference (ICML 2004). ACM, AAAI Press, pp 791–798
19.
Zurück zum Zitat Schultz M, Joachims T (2004) Learning a distance metric from relative comparisons. In: Thrun S, Saul L, Schölkopf B (eds) Advances in neural information processing systems, vol 16. MIT Press, Cambridge Schultz M, Joachims T (2004) Learning a distance metric from relative comparisons. In: Thrun S, Saul L, Schölkopf B (eds) Advances in neural information processing systems, vol 16. MIT Press, Cambridge
21.
Zurück zum Zitat Smolensky P (1986) Information processing in dynamical systems: foundations of harmony theory. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing. Foundations, vol 1. MIT Press, Cambridge, pp 194–281 Smolensky P (1986) Information processing in dynamical systems: foundations of harmony theory. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing. Foundations, vol 1. MIT Press, Cambridge, pp 194–281
22.
Zurück zum Zitat Stober S, Nurnberger A (2011) An experimental comparison of similarity adaptation approaches. In: Proceedings of the AMR-11 Stober S, Nurnberger A (2011) An experimental comparison of similarity adaptation approaches. In: Proceedings of the AMR-11
23.
Zurück zum Zitat Tran SN, Wolff D, Weyde T, d’Avila Garcez AS (2014) Feature preprocessing with restricted Boltzmann machines for music similarity learning. In: AES international conference on semantic audio 2014, London, UK, 27–29 Jan 2014 Tran SN, Wolff D, Weyde T, d’Avila Garcez AS (2014) Feature preprocessing with restricted Boltzmann machines for music similarity learning. In: AES international conference on semantic audio 2014, London, UK, 27–29 Jan 2014
25.
Zurück zum Zitat Wolff D, Stober S, Nurnberger A, Weyde T (2012) A systematic comparison of music similarity adaptation approaches. In: 13th international conference on music information retrieval (ISMIR’12) Wolff D, Stober S, Nurnberger A, Weyde T (2012) A systematic comparison of music similarity adaptation approaches. In: 13th international conference on music information retrieval (ISMIR’12)
27.
Zurück zum Zitat Wolff D, Weyde T (2014) Learning music similarity from relative user ratings. Inf Retr 17(2):109–136CrossRef Wolff D, Weyde T (2014) Learning music similarity from relative user ratings. Inf Retr 17(2):109–136CrossRef
Metadaten
Titel
Probabilistic approaches for music similarity using restricted Boltzmann machines
Publikationsdatum
06.03.2019
Erschienen in
Neural Computing and Applications / Ausgabe 8/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04106-y

Weitere Artikel der Ausgabe 8/2020

Neural Computing and Applications 8/2020 Zur Ausgabe