Skip to main content
Erschienen in: Information Systems Frontiers 6/2018

29.08.2017

TV-Program Retrieval and Classification: A Comparison of Approaches based on Machine Learning

verfasst von: Fedelucio Narducci, Cataldo Musto, Marco de Gemmis, Pasquale Lops, Giovanni Semeraro

Erschienen in: Information Systems Frontiers | Ausgabe 6/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Electronic Program Guides (EPGs) are systems that allow users of media applications, such as web TVs, to navigate scheduling information about current and upcoming programming. Personalized EPGs help users to overcome information overload in this domain, by exploiting recommender systems that automatically compile lists of novel and diverse video assets, based on implicitly or explicitly defined user preferences. In this paper we introduce the concept of personal channel, on which Personalized EPGs are grounded, that provides users with potentially interesting programs and videos, by exploiting program genres (documentary, sports, …) and short textual descriptions of programs to find and categorize them. We investigate the problem of adopting appropriate algorithms for TV-program classification and retrieval, in the context of building personal channels, which is harder than a classical retrieval or classification task because of the short text available. The approach proposed to overcome this problem is the adoption of a new feature generation technique that enriches the textual program descriptions with additional features extracted from Wikipedia. Results of the experiments show that our approach actually improves the retrieval performance, while a limited positive effect is observed on classification accuracy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Amatriain, X., & Basilico, J. (2015). Recommender systems in industry: A netflix case study. In Ricci, F., Rokach, L., Shapira, B., Kantor, P. B. (Ed.), Recommender systems handbook (pp. 385–419). Springer. Amatriain, X., & Basilico, J. (2015). Recommender systems in industry: A netflix case study. In Ricci, F., Rokach, L., Shapira, B., Kantor, P. B. (Ed.), Recommender systems handbook (pp. 385–419). Springer.
Zurück zum Zitat Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. Boston: Addison-Wesley Longman Publishing Co., Inc. Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. Boston: Addison-Wesley Longman Publishing Co., Inc.
Zurück zum Zitat Cremonesi, P., Turrin, R., & Airoldi, F. (2011). Hybrid algorithms for recommending new items. In Proceedings of the 2nd international workshop on information heterogeneity and fusion in recommender systems, Hetrec ’11 (pp. 33–40). New York: ACM. https://doi.org/10.1145/2039320.2039325. Cremonesi, P., Turrin, R., & Airoldi, F. (2011). Hybrid algorithms for recommending new items. In Proceedings of the 2nd international workshop on information heterogeneity and fusion in recommender systems, Hetrec ’11 (pp. 33–40). New York: ACM. https://​doi.​org/​10.​1145/​2039320.​2039325.
Zurück zum Zitat Davidson, J., Liebald, B., Liu, J., Nandy, P., Van Vleet, T., Gargi, U., Gupta, S., He, Y., Lambert, M., Livingston, B., & Sampath, D. (2010). The YouTube video recommendation system. In Proceedings of the fourth ACM conference on recommender systems, recsys ’10 (pp. 293–296). New York: ACM. https://doi.org/10.1145/1864708.1864770. Davidson, J., Liebald, B., Liu, J., Nandy, P., Van Vleet, T., Gargi, U., Gupta, S., He, Y., Lambert, M., Livingston, B., & Sampath, D. (2010). The YouTube video recommendation system. In Proceedings of the fourth ACM conference on recommender systems, recsys ’10 (pp. 293–296). New York: ACM. https://​doi.​org/​10.​1145/​1864708.​1864770.
Zurück zum Zitat Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., & Harshman, R.A. (1990). Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41, 391–407.CrossRef Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., & Harshman, R.A. (1990). Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41, 391–407.CrossRef
Zurück zum Zitat de Gemmis, M., Lops, P., Musto, C., Narducci, F., & Semeraro, G. (2015). Semantics-aware content-based recommender systems. In F. Ricci, L. Rokach, B. Shapira, & P. B. Kantor (Eds.) Recommender Systems Handbook (pp. 119–159). Springer. de Gemmis, M., Lops, P., Musto, C., Narducci, F., & Semeraro, G. (2015). Semantics-aware content-based recommender systems. In F. Ricci, L. Rokach, B. Shapira, & P. B. Kantor (Eds.) Recommender Systems Handbook (pp. 119–159). Springer.
Zurück zum Zitat Deldjoo, Y., Elahi, M., Cremonesi, P., Garzotto, F., Piazzolla, P., & Quadrana, M. (2016). Content-based video recommendation system based on stylistic visual features. Journal of Data Semantics, 2016 (online version), 1–15. Deldjoo, Y., Elahi, M., Cremonesi, P., Garzotto, F., Piazzolla, P., & Quadrana, M. (2016). Content-based video recommendation system based on stylistic visual features. Journal of Data Semantics, 2016 (online version), 1–15.
Zurück zum Zitat Ehrmantraut, M., Härder, T., Wittig, H., & Steinmetz, R. (1996). The personal electronic program guide - towards the pre-selection of individual TV programs. In Proceedings of the fifth international conference on information and knowledge management, CIKM ’96 (pp. 243–250). New York: ACM. https://doi.org/10.1145/238355.238505. Ehrmantraut, M., Härder, T., Wittig, H., & Steinmetz, R. (1996). The personal electronic program guide - towards the pre-selection of individual TV programs. In Proceedings of the fifth international conference on information and knowledge management, CIKM ’96 (pp. 243–250). New York: ACM. https://​doi.​org/​10.​1145/​238355.​238505.
Zurück zum Zitat Ferragina, P., & Scaiella, U. (2010). TAGME: On-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the 19th ACM conference on information and knowledge management, CIKM 2010 (pp. 1625–1628). Toronto: ACM. https://doi.org/10.1145/1871437.1871689. Ferragina, P., & Scaiella, U. (2010). TAGME: On-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the 19th ACM conference on information and knowledge management, CIKM 2010 (pp. 1625–1628). Toronto: ACM. https://​doi.​org/​10.​1145/​1871437.​1871689.
Zurück zum Zitat Gabrilovich, E., & Markovitch, S. (2006). Overcoming the brittleness bottleneck using Wikipedia: enhancing text categorization with encyclopedic knowledge, AAAI’06. In Proceedings of the 21st national conference on artificial intelligence (pp. 1301–1306): AAAI press. Gabrilovich, E., & Markovitch, S. (2006). Overcoming the brittleness bottleneck using Wikipedia: enhancing text categorization with encyclopedic knowledge, AAAI’06. In Proceedings of the 21st national conference on artificial intelligence (pp. 1301–1306): AAAI press.
Zurück zum Zitat Harris, Z.S. (1968). Mathematical structures of language. New York: Interscience. Harris, Z.S. (1968). Mathematical structures of language. New York: Interscience.
Zurück zum Zitat Joachims, T. (1998). Text categorization with support vector machines: learning with many relevant features. In C. Nédellec, & C. Rouveirol (Eds.) Proceedings of ECML-98, 10th European conference on machine learning, lecture notes in artificial intelligence, (Vol. 1398 pp. 137–142). Heidelberg: Springer. http://www.joachims98.ps. Joachims, T. (1998). Text categorization with support vector machines: learning with many relevant features. In C. Nédellec, & C. Rouveirol (Eds.) Proceedings of ECML-98, 10th European conference on machine learning, lecture notes in artificial intelligence, (Vol. 1398 pp. 137–142). Heidelberg: Springer. http://​www.​joachims98.​ps.
Zurück zum Zitat Kennedy, L., & Hauptmann, A. (2006). LSCOM Lexicon definitions and annotations version 1.0, DTO challenge workshop on large scale concept ontology for multimedia, Tech. rep., Columbia University. Kennedy, L., & Hauptmann, A. (2006). LSCOM Lexicon definitions and annotations version 1.0, DTO challenge workshop on large scale concept ontology for multimedia, Tech. rep., Columbia University.
Zurück zum Zitat Ko, H.G., Kim, E., Ko, I.Y., & Chang, D. (2014). Semantically-based recommendation by using semantic clusters of users’ viewing history. In International conference on big data and smart computing (BIGCOMP) (pp. 83–87). IEEE. Ko, H.G., Kim, E., Ko, I.Y., & Chang, D. (2014). Semantically-based recommendation by using semantic clusters of users’ viewing history. In International conference on big data and smart computing (BIGCOMP) (pp. 83–87). IEEE.
Zurück zum Zitat Lowe, W. (2001). Towards a theory of semantic space. In Proceedings of the 23rd annual meeting of the cognitive science society (pp. 576–581). Lowe, W. (2001). Towards a theory of semantic space. In Proceedings of the 23rd annual meeting of the cognitive science society (pp. 576–581).
Zurück zum Zitat Mohammad, S., & Hirst, G. (2012). Distributional measures of semantic distance: a survey. CoRR arXiv:1203.1858. Mohammad, S., & Hirst, G. (2012). Distributional measures of semantic distance: a survey. CoRR arXiv:1203.​1858.
Zurück zum Zitat Musto, C., Narducci, F., Lops, P., Semeraro, G., de Gemmis, M., Barbieri, M., Korst, J., Pronk, V., & Clout, R. (2012). Enhanced semantic TV-show representation for personalized electronic program guides. In Masthoff, J., Mobasher, B., Desmarais, M.C. & Nkambou, R. (Eds.) Proceedings of user modeling, adaptation, and personalization: 20th international conference, UMAP 2012, Montreal, Canada, July 16-20, 2012 (pp. 88–199). Berlin: Springer. ISBN 978-3-642-31454-4. https://doi.org/10.1007/978-3-642-31454-4_16.CrossRef Musto, C., Narducci, F., Lops, P., Semeraro, G., de Gemmis, M., Barbieri, M., Korst, J., Pronk, V., & Clout, R. (2012). Enhanced semantic TV-show representation for personalized electronic program guides. In Masthoff, J., Mobasher, B., Desmarais, M.C. & Nkambou, R. (Eds.) Proceedings of user modeling, adaptation, and personalization: 20th international conference, UMAP 2012, Montreal, Canada, July 16-20, 2012 (pp. 88–199). Berlin: Springer. ISBN 978-3-642-31454-4. https://​doi.​org/​10.​1007/​978-3-642-31454-4_​16.CrossRef
Zurück zum Zitat Musto, C., Semeraro, G., Lops, P., & de Gemmis, M. (2011). Random indexing and negative user preferences for enhancing content-based recommender systems. In E-commerce and web technologies - 12th international conference, EC-web 2011, Toulouse, France, August 30–September 1, 2011. Proceedings, lecture notes in business information processing Vol. 85 (pp. 270–281). Springer. Musto, C., Semeraro, G., Lops, P., & de Gemmis, M. (2011). Random indexing and negative user preferences for enhancing content-based recommender systems. In E-commerce and web technologies - 12th international conference, EC-web 2011, Toulouse, France, August 30–September 1, 2011. Proceedings, lecture notes in business information processing Vol. 85 (pp. 270–281). Springer.
Zurück zum Zitat Porter, M. (1980). An algorithm for suffix stripping. Program, 14(3), 130–137.CrossRef Porter, M. (1980). An algorithm for suffix stripping. Program, 14(3), 130–137.CrossRef
Zurück zum Zitat Rocchio, J. (1971). Relevance feedback in information retrieval. In G. Salton (Ed.) The SMART retrieval system (pp. 313–323). Rocchio, J. (1971). Relevance feedback in information retrieval. In G. Salton (Ed.) The SMART retrieval system (pp. 313–323).
Zurück zum Zitat Sebastiani, F. (2002). Machine learning in automated text categorization. ACM, Computing Surveys, 34(1), 1–47.CrossRef Sebastiani, F. (2002). Machine learning in automated text categorization. ACM, Computing Surveys, 34(1), 1–47.CrossRef
Zurück zum Zitat Smeaton, A.F., Murphy, N., O’Connor, N.E., Marlow, S., Lee, H., McDonald, K., Browne, P., & Ye, J. (2001). The Físchlár digital video system: a digital library of broadcast TV programmes. In Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries (pp. 312–313). ACM. Smeaton, A.F., Murphy, N., O’Connor, N.E., Marlow, S., Lee, H., McDonald, K., Browne, P., & Ye, J. (2001). The Físchlár digital video system: a digital library of broadcast TV programmes. In Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries (pp. 312–313). ACM.
Zurück zum Zitat Smyth, B., & Cotter, P. (2001). Personalized electronic programme guides. Artificial Intelligence Magazine, 22(2), 89–98.CrossRef Smyth, B., & Cotter, P. (2001). Personalized electronic programme guides. Artificial Intelligence Magazine, 22(2), 89–98.CrossRef
Zurück zum Zitat Snoek, C.G.M., Worring, M., van Gemert, J.C., Geusebroek, J.M., & Smeulders, A.W.M. (2006). The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proceedings of the 14th annual ACM international conference on Multimedia, MULTIMEDIA ’06 (pp. 421–430). New York: ACM. https://doi.org/10.1145/1180639.1180727. Snoek, C.G.M., Worring, M., van Gemert, J.C., Geusebroek, J.M., & Smeulders, A.W.M. (2006). The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proceedings of the 14th annual ACM international conference on Multimedia, MULTIMEDIA ’06 (pp. 421–430). New York: ACM. https://​doi.​org/​10.​1145/​1180639.​1180727.
Zurück zum Zitat Turney, P., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research (JAIR), 37, 141–188.CrossRef Turney, P., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research (JAIR), 37, 141–188.CrossRef
Zurück zum Zitat Xu, J., Zhang, L.J., Lu, H., & Li, Y. (2002). The development and prospect of personalized TV program recommendation systems. In Proceedings of the fourth IEEE international symposium on multimedia software engineering, MSE ’02 (p. 82). Washington: IEEE Computer Society. http://dl.acm.org/citation.cfm?id=824463.824803. Xu, J., Zhang, L.J., Lu, H., & Li, Y. (2002). The development and prospect of personalized TV program recommendation systems. In Proceedings of the fourth IEEE international symposium on multimedia software engineering, MSE ’02 (p. 82). Washington: IEEE Computer Society. http://​dl.​acm.​org/​citation.​cfm?​id=​824463.​824803.
Zurück zum Zitat Yanagawa, A., Chang, S.F., Kennedy, L., & Hsu, W. (2007). Columbia University’s baseline detectors for 374 LSCOM semantic visual concepts. Columbia University ADVENT Technical Report. Yanagawa, A., Chang, S.F., Kennedy, L., & Hsu, W. (2007). Columbia University’s baseline detectors for 374 LSCOM semantic visual concepts. Columbia University ADVENT Technical Report.
Zurück zum Zitat Yuan, Y. (2003). Research on video classification and retrieval. Xi’an: Ph.D. thesis, School of Electronic and Information Engineering, Xi’an Jiaotong University. Yuan, Y. (2003). Research on video classification and retrieval. Xi’an: Ph.D. thesis, School of Electronic and Information Engineering, Xi’an Jiaotong University.
Metadaten
Titel
TV-Program Retrieval and Classification: A Comparison of Approaches based on Machine Learning
verfasst von
Fedelucio Narducci
Cataldo Musto
Marco de Gemmis
Pasquale Lops
Giovanni Semeraro
Publikationsdatum
29.08.2017
Verlag
Springer US
Erschienen in
Information Systems Frontiers / Ausgabe 6/2018
Print ISSN: 1387-3326
Elektronische ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-017-9780-0

Weitere Artikel der Ausgabe 6/2018

Information Systems Frontiers 6/2018 Zur Ausgabe

Premium Partner