Skip to main content

2018 | OriginalPaper | Buchkapitel

13. Application of Tolerance Near Sets to Audio Signal Classification

verfasst von : Ashmeet Singh, Sheela Ramanna

Erschienen in: Advances in Feature Selection for Data and Pattern Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter is an extension of our work presented where the problem of classifying audio signals using a supervised tolerance class learning algorithm (TCL) based on tolerance near sets was first proposed. In the tolerance near set method(TNS), tolerance classes are directly induced from the data set using a tolerance level and a distance function. The TNS method lends itself to applications where features are real-valued such as image data, audio and video signal data. Extensive experimentation with different audio-video data sets were performed to provide insights into the strengths and weaknesses of the TCL algorithm compared to granular (fuzzy and rough) and classical machine learning algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Banic, N.: Detection of commercials in video content based on logo presence without its prior knowledge. In: 2012 Proceedings of the 35th International Convention MIPRO, pp. 1713–1718 (2012) Banic, N.: Detection of commercials in video content based on logo presence without its prior knowledge. In: 2012 Proceedings of the 35th International Convention MIPRO, pp. 1713–1718 (2012)
2.
Zurück zum Zitat Bazan, J.G., Szczuka, M.: The rough set exploration system. Springer Transactions on Rough Sets III. LNCS, vol. 3400, pp. 37–56. Springer, Berlin (2005) Bazan, J.G., Szczuka, M.: The rough set exploration system. Springer Transactions on Rough Sets III. LNCS, vol. 3400, pp. 37–56. Springer, Berlin (2005)
3.
Zurück zum Zitat Bergstra, J., Kegl, B.: Meta-features and adaboost for music classification. In: Machine Learning Journal: Special Issue on Machine Learning in Music (2006) Bergstra, J., Kegl, B.: Meta-features and adaboost for music classification. In: Machine Learning Journal: Special Issue on Machine Learning in Music (2006)
4.
Zurück zum Zitat Bhatt, R., Krishnamoorthy, P., Kumar, S.: Efficient general genre video abstraction scheme for embedded devices using pure audio cues. In: 7th International Conference on ICT and Knowledge Engineering, pp. 63–67 (2009) Bhatt, R., Krishnamoorthy, P., Kumar, S.: Efficient general genre video abstraction scheme for embedded devices using pure audio cues. In: 7th International Conference on ICT and Knowledge Engineering, pp. 63–67 (2009)
5.
Zurück zum Zitat Covell, M., Baluja, S., Fink, M.: Advertisement detection and replacement using acoustic and visual repetition. In: 2006 IEEE Workshop on Multimedia Signal Processing, pp. 461–466 (2006) Covell, M., Baluja, S., Fink, M.: Advertisement detection and replacement using acoustic and visual repetition. In: 2006 IEEE Workshop on Multimedia Signal Processing, pp. 461–466 (2006)
6.
Zurück zum Zitat Eyben, F.: Classification by Large Audio Feature Space Extraction, chap. Acoustic Features and Modelling, pp. 9–122. Springer International Publishing (2016) Eyben, F.: Classification by Large Audio Feature Space Extraction, chap. Acoustic Features and Modelling, pp. 9–122. Springer International Publishing (2016)
7.
Zurück zum Zitat Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. SIGKDD Explorations 11(1), 10–18 (2009)CrossRef Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. SIGKDD Explorations 11(1), 10–18 (2009)CrossRef
8.
Zurück zum Zitat Haque, M.A., Kim, J.M.: An analysis of content-based classification of audio signals using a fuzzy c-means algorithm. Multimedia Tools Appl. 63(1), 77–92 (2013)CrossRef Haque, M.A., Kim, J.M.: An analysis of content-based classification of audio signals using a fuzzy c-means algorithm. Multimedia Tools Appl. 63(1), 77–92 (2013)CrossRef
9.
Zurück zum Zitat Hendrik, S.: Improving genre annotations for the million song dataset. In: 16th International Society for Music Information Retrieval Conference, pp. 63–70 (2015) Hendrik, S.: Improving genre annotations for the million song dataset. In: 16th International Society for Music Information Retrieval Conference, pp. 63–70 (2015)
10.
Zurück zum Zitat Henry, C., Ramanna, S.: Parallel computation in finding near neighbourhoods. In: Proceedings of the 6th International Conference on Rough Sets and Knowledge Technology (RSKT11). Lecture Notes in Computer Science, pp. 523–532. Springer, Berlin (2011) Henry, C., Ramanna, S.: Parallel computation in finding near neighbourhoods. In: Proceedings of the 6th International Conference on Rough Sets and Knowledge Technology (RSKT11). Lecture Notes in Computer Science, pp. 523–532. Springer, Berlin (2011)
11.
Zurück zum Zitat Henry, C., Ramanna, S.: Signature-based perceptual nearness. Application of near sets to image retrieval. Springer J. Math. Comput. Sci. 7, 71–85 (2013) Henry, C., Ramanna, S.: Signature-based perceptual nearness. Application of near sets to image retrieval. Springer J. Math. Comput. Sci. 7, 71–85 (2013)
12.
Zurück zum Zitat Henry, C.J.: Near sets: theory and applications. Ph.D. thesis, University of Manitoba (2011) Henry, C.J.: Near sets: theory and applications. Ph.D. thesis, University of Manitoba (2011)
13.
Zurück zum Zitat Hunt, M., Lennig, M., Mermelstein, P.: Experiments in syllable-based recognition of continuous speech. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pp. 880–883 (1996) Hunt, M., Lennig, M., Mermelstein, P.: Experiments in syllable-based recognition of continuous speech. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pp. 880–883 (1996)
14.
Zurück zum Zitat Jinsong, Z., Oussalah, M.: Automatic system for music genre classification (2006) (PGMNet2006) Jinsong, Z., Oussalah, M.: Automatic system for music genre classification (2006) (PGMNet2006)
15.
Zurück zum Zitat Kannao, R., Guha, P.: Tv commercial detection using success based locally weighted kernel combination. In: Proceedings on the the International Conference on MultiMedia Modeling, pp. 793–805. Springer International Publishing (2016) Kannao, R., Guha, P.: Tv commercial detection using success based locally weighted kernel combination. In: Proceedings on the the International Conference on MultiMedia Modeling, pp. 793–805. Springer International Publishing (2016)
16.
Zurück zum Zitat Kaur, K., Ramanna, S., Henry, C.: Measuring the nearness of layered flow graphs: application to content based image retrieval. Intell. Decis. Technol. J. 10, 165–181 (2016)CrossRef Kaur, K., Ramanna, S., Henry, C.: Measuring the nearness of layered flow graphs: application to content based image retrieval. Intell. Decis. Technol. J. 10, 165–181 (2016)CrossRef
17.
Zurück zum Zitat Kostek, B.: Perception-based data processing in acoustics, applications to music information retrieval and psychophysiology of hearing. Series on Cognitive Technologies. Springer, Berlin (2005) Kostek, B.: Perception-based data processing in acoustics, applications to music information retrieval and psychophysiology of hearing. Series on Cognitive Technologies. Springer, Berlin (2005)
18.
Zurück zum Zitat Lee, C.H., Shih, J.L., Yu, K.M., Lin, H.S.: Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features. IEEE Trans. Multimed. 11(4), 670–682 (2009)CrossRef Lee, C.H., Shih, J.L., Yu, K.M., Lin, H.S.: Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features. IEEE Trans. Multimed. 11(4), 670–682 (2009)CrossRef
19.
Zurück zum Zitat Lie, L., Hao, J., HongJiang, Z.: A robust audio classification and segmentation method. In: Proceedings of the Ninth ACM International Conference on Multimedia, MULTIMEDIA ’01, pp. 203–211 (2001) Lie, L., Hao, J., HongJiang, Z.: A robust audio classification and segmentation method. In: Proceedings of the Ninth ACM International Conference on Multimedia, MULTIMEDIA ’01, pp. 203–211 (2001)
20.
Zurück zum Zitat Liu, N., Zhao, Y., Zhu, Z., Lu, H.: Exploiting visual-audio-textual characteristics for automatic tv commercial block detection and segmentation. IEEE Trans. Multimed. 13(5), 961–973 (2011)CrossRef Liu, N., Zhao, Y., Zhu, Z., Lu, H.: Exploiting visual-audio-textual characteristics for automatic tv commercial block detection and segmentation. IEEE Trans. Multimed. 13(5), 961–973 (2011)CrossRef
21.
Zurück zum Zitat Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of 1st International Conference on Music Information Retrieval, Plymouth, MA (2000) Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of 1st International Conference on Music Information Retrieval, Plymouth, MA (2000)
22.
Zurück zum Zitat Bertin-Mahieux, T., Ellis, D.P., Whitman, B., Lamere, P.: The million song dataset. In: Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR 2011), pp. 7–12 (2011) Bertin-Mahieux, T., Ellis, D.P., Whitman, B., Lamere, P.: The million song dataset. In: Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR 2011), pp. 7–12 (2011)
23.
Zurück zum Zitat Mandel, M., Poliner, G., Ellis, D.: Support vector machine active learning for music retrieval. Multimed. Syst. 12(1), 3–13 (2006)CrossRef Mandel, M., Poliner, G., Ellis, D.: Support vector machine active learning for music retrieval. Multimed. Syst. 12(1), 3–13 (2006)CrossRef
24.
Zurück zum Zitat Orłowska, E.: Semantics of Vague Concepts. Applications of rough sets. In: Technical Report 469, Institute for Computer Science. Polish Academy of Sciences (1982) Orłowska, E.: Semantics of Vague Concepts. Applications of rough sets. In: Technical Report 469, Institute for Computer Science. Polish Academy of Sciences (1982)
25.
Zurück zum Zitat Orłowska, E.: Semantics of vague concepts. In: Dorn, G., Weingartner, P. (eds.) Foundations of Logic and Linguistics. Problems and Solutions, pp. 465–482. Plenum Press, London (1985) Orłowska, E.: Semantics of vague concepts. In: Dorn, G., Weingartner, P. (eds.) Foundations of Logic and Linguistics. Problems and Solutions, pp. 465–482. Plenum Press, London (1985)
26.
27.
Zurück zum Zitat Peters, J.: Tolerance near sets and image correspondence. Int. J. Bio-Inspired Comput. 1(4), 239–245 (2009)CrossRef Peters, J.: Tolerance near sets and image correspondence. Int. J. Bio-Inspired Comput. 1(4), 239–245 (2009)CrossRef
28.
Zurück zum Zitat Peters, J.: Corrigenda and addenda: tolerance near sets and image correspondence. Int. J. Bio-Inspired Comput. 2(5), 310–318 (2010)CrossRef Peters, J.: Corrigenda and addenda: tolerance near sets and image correspondence. Int. J. Bio-Inspired Comput. 2(5), 310–318 (2010)CrossRef
29.
Zurück zum Zitat Peters, J., Wasilewski, P.: Tolerance spaces: origins, theoretical aspects and applications. Inf. Sci.: An Int. J. 195(5), 211–225 (2012)CrossRefMATHMathSciNet Peters, J., Wasilewski, P.: Tolerance spaces: origins, theoretical aspects and applications. Inf. Sci.: An Int. J. 195(5), 211–225 (2012)CrossRefMATHMathSciNet
30.
Zurück zum Zitat Peters, J.F.: Near sets. General theory about nearness of objects. Appl. Math. Sci. 1(53), 2609–2629 (2007)MATHMathSciNet Peters, J.F.: Near sets. General theory about nearness of objects. Appl. Math. Sci. 1(53), 2609–2629 (2007)MATHMathSciNet
31.
Zurück zum Zitat Peters, J.F.: Near sets. Special theory about nearness of objects. Fundam. Inf. 75(1–4), 407–433 (2007) Peters, J.F.: Near sets. Special theory about nearness of objects. Fundam. Inf. 75(1–4), 407–433 (2007)
33.
Zurück zum Zitat Peters, J.F., Naimpally, S.: Applications of near sets. Am. Math. Soc. Not. 59(4), 536–542 (2012) Peters, J.F., Naimpally, S.: Applications of near sets. Am. Math. Soc. Not. 59(4), 536–542 (2012)
35.
Zurück zum Zitat Hoffmann, P., Kostek, B.: Music genre recognition in the rough set-based environment. In: Proceedings of 6th International Conference, PReMI 2015, pp. 377–386 (2015) Hoffmann, P., Kostek, B.: Music genre recognition in the rough set-based environment. In: Proceedings of 6th International Conference, PReMI 2015, pp. 377–386 (2015)
36.
Zurück zum Zitat Poincaré, J.: Sur certaines surfaces algébriques; troisième complément ‘a l’analysis situs. Bulletin de la Société de France 30, 49–70 (1902)CrossRefMATHMathSciNet Poincaré, J.: Sur certaines surfaces algébriques; troisième complément ‘a l’analysis situs. Bulletin de la Société de France 30, 49–70 (1902)CrossRefMATHMathSciNet
37.
Zurück zum Zitat Poli, G., Llapa, E., Cecatto, J., Saito, J., Peters, J., Ramanna, S., Nicoletti, M.: Solar flare detection system based on tolerance near sets in a GPU-CUDA framework. Knowl.-Based Syst. J. Elsevier 70, 345–360 (2014)CrossRef Poli, G., Llapa, E., Cecatto, J., Saito, J., Peters, J., Ramanna, S., Nicoletti, M.: Solar flare detection system based on tolerance near sets in a GPU-CUDA framework. Knowl.-Based Syst. J. Elsevier 70, 345–360 (2014)CrossRef
38.
Zurück zum Zitat Ramanna, S., Singh, A.: Tolerance-based approach to audio signal classification. In: Proceedings of 29th Canadian AI Conference, LNAI 9673, pp. 83–88 (2016) Ramanna, S., Singh, A.: Tolerance-based approach to audio signal classification. In: Proceedings of 29th Canadian AI Conference, LNAI 9673, pp. 83–88 (2016)
40.
Zurück zum Zitat Sadlier, D.A., Marlow, S., O’Connor, N.E., Murphy, N.: Automatic TV advertisement detection from MPEG bitstream. In: Proceedings of the 1st International Workshop on Pattern Recognition in Information Systems: In Conjunction with ICEIS 2001, PRIS ’01, pp. 14–25 (2001) Sadlier, D.A., Marlow, S., O’Connor, N.E., Murphy, N.: Automatic TV advertisement detection from MPEG bitstream. In: Proceedings of the 1st International Workshop on Pattern Recognition in Information Systems: In Conjunction with ICEIS 2001, PRIS ’01, pp. 14–25 (2001)
41.
Zurück zum Zitat Sengoz, C., Ramanna, S.: A semi-supervised learning algorithm for web information extraction with tolerance rough sets. In: Proceedings of Active Media Technology 2014. LNCS, vol. 8610, pp. 1–10 (2014) Sengoz, C., Ramanna, S.: A semi-supervised learning algorithm for web information extraction with tolerance rough sets. In: Proceedings of Active Media Technology 2014. LNCS, vol. 8610, pp. 1–10 (2014)
42.
Zurück zum Zitat Sengoz, C., Ramanna, S.: Learning relational facts from the web: a tolerance rough set approach. Pattern Recognit. Lett. Elsevier 67(2), 130–137 (2015)CrossRef Sengoz, C., Ramanna, S.: Learning relational facts from the web: a tolerance rough set approach. Pattern Recognit. Lett. Elsevier 67(2), 130–137 (2015)CrossRef
43.
Zurück zum Zitat Sossinsky, A.B.: Tolerance space theory and some applications. Acta Appl. Math.: Int. Survey J. Appl. Math. Math. Appl. 5(2), 137–167 (1986)CrossRefMathSciNet Sossinsky, A.B.: Tolerance space theory and some applications. Acta Appl. Math.: Int. Survey J. Appl. Math. Math. Appl. 5(2), 137–167 (1986)CrossRefMathSciNet
44.
Zurück zum Zitat Sturm, B.L.: An analysis of the GTZAN music genre dataset. In: Proceedings of the Second International ACM Workshop on Music Information Retrieval with User-centered and Multimodal Strategies, pp. 7–12 (2012) Sturm, B.L.: An analysis of the GTZAN music genre dataset. In: Proceedings of the Second International ACM Workshop on Music Information Retrieval with User-centered and Multimodal Strategies, pp. 7–12 (2012)
45.
Zurück zum Zitat Typke, R., Wiering, F., Veltkamp, R.C.: A survey of music information retrieval systems. In: ISMIR 2005, 6th International Conference on Music Information Retrieval, London, UK, 11–15 September 2005, Proceedings, pp. 153–160 (2005) Typke, R., Wiering, F., Veltkamp, R.C.: A survey of music information retrieval systems. In: ISMIR 2005, 6th International Conference on Music Information Retrieval, London, UK, 11–15 September 2005, Proceedings, pp. 153–160 (2005)
46.
Zurück zum Zitat Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)CrossRef Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)CrossRef
47.
Zurück zum Zitat Vyas, A., Kannao, R., Bhargava, V., Guha, P.: Commercial block detection in broadcast news videos. In: Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing, No. 63 in ICVGIP ’14, pp. 63:1–63:7 (2014) Vyas, A., Kannao, R., Bhargava, V., Guha, P.: Commercial block detection in broadcast news videos. In: Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing, No. 63 in ICVGIP ’14, pp. 63:1–63:7 (2014)
48.
Zurück zum Zitat Wasilewski, P., Peters, J.F., Ramanna, S.: Perceptual tolerance intersection. Trans. Rough Sets XIII Springer LNCS 6499, 159–174 (2011)CrossRefMATH Wasilewski, P., Peters, J.F., Ramanna, S.: Perceptual tolerance intersection. Trans. Rough Sets XIII Springer LNCS 6499, 159–174 (2011)CrossRefMATH
49.
Zurück zum Zitat Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based classification, search, and retrieval of audio. IEEE Multimed. 3(2), 27–36 (1996)CrossRef Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based classification, search, and retrieval of audio. IEEE Multimed. 3(2), 27–36 (1996)CrossRef
50.
Zurück zum Zitat Wolski, M.: Perception and classification. A note on near sets and rough sets. Fundam. Inf. 101, 143–155 (2010)MATHMathSciNet Wolski, M.: Perception and classification. A note on near sets and rough sets. Fundam. Inf. 101, 143–155 (2010)MATHMathSciNet
51.
52.
Zurück zum Zitat Zeeman, E.: The topology of the brain and visual perception. University of Georgia Institute Conference Proceedings, pp. 240–256 (1962); Fort, M.K., Jr. (ed.) Topology of 3-Manifolds and Related Topics. Prentice-Hall, Inc Zeeman, E.: The topology of the brain and visual perception. University of Georgia Institute Conference Proceedings, pp. 240–256 (1962); Fort, M.K., Jr. (ed.) Topology of 3-Manifolds and Related Topics. Prentice-Hall, Inc
53.
Zurück zum Zitat Zeeman, E., Buneman, O.: Tolerance spaces and the brain. In: Towards a Theoretical Biology, vol. 1, pp. 140–151; Published in Waddington, C.H. (ed.) Towards a Theoretical Biology. The Origin of Life, Aldine Pub, Co. (1968) Zeeman, E., Buneman, O.: Tolerance spaces and the brain. In: Towards a Theoretical Biology, vol. 1, pp. 140–151; Published in Waddington, C.H. (ed.) Towards a Theoretical Biology. The Origin of Life, Aldine Pub, Co. (1968)
54.
Zurück zum Zitat Zhang, T., Kuo, C.: Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing. Kluwer Academic Publishers, Norwell (2001)CrossRefMATH Zhang, T., Kuo, C.: Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing. Kluwer Academic Publishers, Norwell (2001)CrossRefMATH
55.
Zurück zum Zitat Zhouyu, F., Guojun, L., Kai, M., Dengsheng, Z.: A survey of audio-based music classification and annotation. IEEE Trans. Multimed. 13(2), 303–319 (2011)CrossRef Zhouyu, F., Guojun, L., Kai, M., Dengsheng, Z.: A survey of audio-based music classification and annotation. IEEE Trans. Multimed. 13(2), 303–319 (2011)CrossRef
56.
Zurück zum Zitat Zwan, P., Kostek, B., Kupryjanow, A.: Automatic classification of musical audio signals employing machine learning approach. Audio Engineering Society Convention, vol. 130, pp. 2–11 (2011) Zwan, P., Kostek, B., Kupryjanow, A.: Automatic classification of musical audio signals employing machine learning approach. Audio Engineering Society Convention, vol. 130, pp. 2–11 (2011)
57.
Zurück zum Zitat Zwicker, E., Fastl, H.: Psychoacoustics, facts and models. Springer Series in Information Sciences. Springer, Berlin (1990) Zwicker, E., Fastl, H.: Psychoacoustics, facts and models. Springer Series in Information Sciences. Springer, Berlin (1990)
Metadaten
Titel
Application of Tolerance Near Sets to Audio Signal Classification
verfasst von
Ashmeet Singh
Sheela Ramanna
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-67588-6_13