Skip to main content
Top
Published in: Information Systems Frontiers 6/2018

29-08-2017

TV-Program Retrieval and Classification: A Comparison of Approaches based on Machine Learning

Authors: Fedelucio Narducci, Cataldo Musto, Marco de Gemmis, Pasquale Lops, Giovanni Semeraro

Published in: Information Systems Frontiers | Issue 6/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Electronic Program Guides (EPGs) are systems that allow users of media applications, such as web TVs, to navigate scheduling information about current and upcoming programming. Personalized EPGs help users to overcome information overload in this domain, by exploiting recommender systems that automatically compile lists of novel and diverse video assets, based on implicitly or explicitly defined user preferences. In this paper we introduce the concept of personal channel, on which Personalized EPGs are grounded, that provides users with potentially interesting programs and videos, by exploiting program genres (documentary, sports, …) and short textual descriptions of programs to find and categorize them. We investigate the problem of adopting appropriate algorithms for TV-program classification and retrieval, in the context of building personal channels, which is harder than a classical retrieval or classification task because of the short text available. The approach proposed to overcome this problem is the adoption of a new feature generation technique that enriches the textual program descriptions with additional features extracted from Wikipedia. Results of the experiments show that our approach actually improves the retrieval performance, while a limited positive effect is observed on classification accuracy.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Amatriain, X., & Basilico, J. (2015). Recommender systems in industry: A netflix case study. In Ricci, F., Rokach, L., Shapira, B., Kantor, P. B. (Ed.), Recommender systems handbook (pp. 385–419). Springer. Amatriain, X., & Basilico, J. (2015). Recommender systems in industry: A netflix case study. In Ricci, F., Rokach, L., Shapira, B., Kantor, P. B. (Ed.), Recommender systems handbook (pp. 385–419). Springer.
go back to reference Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. Boston: Addison-Wesley Longman Publishing Co., Inc. Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. Boston: Addison-Wesley Longman Publishing Co., Inc.
go back to reference Cremonesi, P., Turrin, R., & Airoldi, F. (2011). Hybrid algorithms for recommending new items. In Proceedings of the 2nd international workshop on information heterogeneity and fusion in recommender systems, Hetrec ’11 (pp. 33–40). New York: ACM. https://doi.org/10.1145/2039320.2039325. Cremonesi, P., Turrin, R., & Airoldi, F. (2011). Hybrid algorithms for recommending new items. In Proceedings of the 2nd international workshop on information heterogeneity and fusion in recommender systems, Hetrec ’11 (pp. 33–40). New York: ACM. https://​doi.​org/​10.​1145/​2039320.​2039325.
go back to reference Davidson, J., Liebald, B., Liu, J., Nandy, P., Van Vleet, T., Gargi, U., Gupta, S., He, Y., Lambert, M., Livingston, B., & Sampath, D. (2010). The YouTube video recommendation system. In Proceedings of the fourth ACM conference on recommender systems, recsys ’10 (pp. 293–296). New York: ACM. https://doi.org/10.1145/1864708.1864770. Davidson, J., Liebald, B., Liu, J., Nandy, P., Van Vleet, T., Gargi, U., Gupta, S., He, Y., Lambert, M., Livingston, B., & Sampath, D. (2010). The YouTube video recommendation system. In Proceedings of the fourth ACM conference on recommender systems, recsys ’10 (pp. 293–296). New York: ACM. https://​doi.​org/​10.​1145/​1864708.​1864770.
go back to reference Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., & Harshman, R.A. (1990). Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41, 391–407.CrossRef Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., & Harshman, R.A. (1990). Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41, 391–407.CrossRef
go back to reference de Gemmis, M., Lops, P., Musto, C., Narducci, F., & Semeraro, G. (2015). Semantics-aware content-based recommender systems. In F. Ricci, L. Rokach, B. Shapira, & P. B. Kantor (Eds.) Recommender Systems Handbook (pp. 119–159). Springer. de Gemmis, M., Lops, P., Musto, C., Narducci, F., & Semeraro, G. (2015). Semantics-aware content-based recommender systems. In F. Ricci, L. Rokach, B. Shapira, & P. B. Kantor (Eds.) Recommender Systems Handbook (pp. 119–159). Springer.
go back to reference Deldjoo, Y., Elahi, M., Cremonesi, P., Garzotto, F., Piazzolla, P., & Quadrana, M. (2016). Content-based video recommendation system based on stylistic visual features. Journal of Data Semantics, 2016 (online version), 1–15. Deldjoo, Y., Elahi, M., Cremonesi, P., Garzotto, F., Piazzolla, P., & Quadrana, M. (2016). Content-based video recommendation system based on stylistic visual features. Journal of Data Semantics, 2016 (online version), 1–15.
go back to reference Ehrmantraut, M., Härder, T., Wittig, H., & Steinmetz, R. (1996). The personal electronic program guide - towards the pre-selection of individual TV programs. In Proceedings of the fifth international conference on information and knowledge management, CIKM ’96 (pp. 243–250). New York: ACM. https://doi.org/10.1145/238355.238505. Ehrmantraut, M., Härder, T., Wittig, H., & Steinmetz, R. (1996). The personal electronic program guide - towards the pre-selection of individual TV programs. In Proceedings of the fifth international conference on information and knowledge management, CIKM ’96 (pp. 243–250). New York: ACM. https://​doi.​org/​10.​1145/​238355.​238505.
go back to reference Ferragina, P., & Scaiella, U. (2010). TAGME: On-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the 19th ACM conference on information and knowledge management, CIKM 2010 (pp. 1625–1628). Toronto: ACM. https://doi.org/10.1145/1871437.1871689. Ferragina, P., & Scaiella, U. (2010). TAGME: On-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the 19th ACM conference on information and knowledge management, CIKM 2010 (pp. 1625–1628). Toronto: ACM. https://​doi.​org/​10.​1145/​1871437.​1871689.
go back to reference Gabrilovich, E., & Markovitch, S. (2006). Overcoming the brittleness bottleneck using Wikipedia: enhancing text categorization with encyclopedic knowledge, AAAI’06. In Proceedings of the 21st national conference on artificial intelligence (pp. 1301–1306): AAAI press. Gabrilovich, E., & Markovitch, S. (2006). Overcoming the brittleness bottleneck using Wikipedia: enhancing text categorization with encyclopedic knowledge, AAAI’06. In Proceedings of the 21st national conference on artificial intelligence (pp. 1301–1306): AAAI press.
go back to reference Harris, Z.S. (1968). Mathematical structures of language. New York: Interscience. Harris, Z.S. (1968). Mathematical structures of language. New York: Interscience.
go back to reference Joachims, T. (1998). Text categorization with support vector machines: learning with many relevant features. In C. Nédellec, & C. Rouveirol (Eds.) Proceedings of ECML-98, 10th European conference on machine learning, lecture notes in artificial intelligence, (Vol. 1398 pp. 137–142). Heidelberg: Springer. http://www.joachims98.ps. Joachims, T. (1998). Text categorization with support vector machines: learning with many relevant features. In C. Nédellec, & C. Rouveirol (Eds.) Proceedings of ECML-98, 10th European conference on machine learning, lecture notes in artificial intelligence, (Vol. 1398 pp. 137–142). Heidelberg: Springer. http://​www.​joachims98.​ps.
go back to reference Kennedy, L., & Hauptmann, A. (2006). LSCOM Lexicon definitions and annotations version 1.0, DTO challenge workshop on large scale concept ontology for multimedia, Tech. rep., Columbia University. Kennedy, L., & Hauptmann, A. (2006). LSCOM Lexicon definitions and annotations version 1.0, DTO challenge workshop on large scale concept ontology for multimedia, Tech. rep., Columbia University.
go back to reference Ko, H.G., Kim, E., Ko, I.Y., & Chang, D. (2014). Semantically-based recommendation by using semantic clusters of users’ viewing history. In International conference on big data and smart computing (BIGCOMP) (pp. 83–87). IEEE. Ko, H.G., Kim, E., Ko, I.Y., & Chang, D. (2014). Semantically-based recommendation by using semantic clusters of users’ viewing history. In International conference on big data and smart computing (BIGCOMP) (pp. 83–87). IEEE.
go back to reference Lowe, W. (2001). Towards a theory of semantic space. In Proceedings of the 23rd annual meeting of the cognitive science society (pp. 576–581). Lowe, W. (2001). Towards a theory of semantic space. In Proceedings of the 23rd annual meeting of the cognitive science society (pp. 576–581).
go back to reference Mohammad, S., & Hirst, G. (2012). Distributional measures of semantic distance: a survey. CoRR arXiv:1203.1858. Mohammad, S., & Hirst, G. (2012). Distributional measures of semantic distance: a survey. CoRR arXiv:1203.​1858.
go back to reference Musto, C., Narducci, F., Lops, P., Semeraro, G., de Gemmis, M., Barbieri, M., Korst, J., Pronk, V., & Clout, R. (2012). Enhanced semantic TV-show representation for personalized electronic program guides. In Masthoff, J., Mobasher, B., Desmarais, M.C. & Nkambou, R. (Eds.) Proceedings of user modeling, adaptation, and personalization: 20th international conference, UMAP 2012, Montreal, Canada, July 16-20, 2012 (pp. 88–199). Berlin: Springer. ISBN 978-3-642-31454-4. https://doi.org/10.1007/978-3-642-31454-4_16.CrossRef Musto, C., Narducci, F., Lops, P., Semeraro, G., de Gemmis, M., Barbieri, M., Korst, J., Pronk, V., & Clout, R. (2012). Enhanced semantic TV-show representation for personalized electronic program guides. In Masthoff, J., Mobasher, B., Desmarais, M.C. & Nkambou, R. (Eds.) Proceedings of user modeling, adaptation, and personalization: 20th international conference, UMAP 2012, Montreal, Canada, July 16-20, 2012 (pp. 88–199). Berlin: Springer. ISBN 978-3-642-31454-4. https://​doi.​org/​10.​1007/​978-3-642-31454-4_​16.CrossRef
go back to reference Musto, C., Semeraro, G., Lops, P., & de Gemmis, M. (2011). Random indexing and negative user preferences for enhancing content-based recommender systems. In E-commerce and web technologies - 12th international conference, EC-web 2011, Toulouse, France, August 30–September 1, 2011. Proceedings, lecture notes in business information processing Vol. 85 (pp. 270–281). Springer. Musto, C., Semeraro, G., Lops, P., & de Gemmis, M. (2011). Random indexing and negative user preferences for enhancing content-based recommender systems. In E-commerce and web technologies - 12th international conference, EC-web 2011, Toulouse, France, August 30–September 1, 2011. Proceedings, lecture notes in business information processing Vol. 85 (pp. 270–281). Springer.
go back to reference Porter, M. (1980). An algorithm for suffix stripping. Program, 14(3), 130–137.CrossRef Porter, M. (1980). An algorithm for suffix stripping. Program, 14(3), 130–137.CrossRef
go back to reference Rocchio, J. (1971). Relevance feedback in information retrieval. In G. Salton (Ed.) The SMART retrieval system (pp. 313–323). Rocchio, J. (1971). Relevance feedback in information retrieval. In G. Salton (Ed.) The SMART retrieval system (pp. 313–323).
go back to reference Sebastiani, F. (2002). Machine learning in automated text categorization. ACM, Computing Surveys, 34(1), 1–47.CrossRef Sebastiani, F. (2002). Machine learning in automated text categorization. ACM, Computing Surveys, 34(1), 1–47.CrossRef
go back to reference Smeaton, A.F., Murphy, N., O’Connor, N.E., Marlow, S., Lee, H., McDonald, K., Browne, P., & Ye, J. (2001). The Físchlár digital video system: a digital library of broadcast TV programmes. In Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries (pp. 312–313). ACM. Smeaton, A.F., Murphy, N., O’Connor, N.E., Marlow, S., Lee, H., McDonald, K., Browne, P., & Ye, J. (2001). The Físchlár digital video system: a digital library of broadcast TV programmes. In Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries (pp. 312–313). ACM.
go back to reference Smyth, B., & Cotter, P. (2001). Personalized electronic programme guides. Artificial Intelligence Magazine, 22(2), 89–98.CrossRef Smyth, B., & Cotter, P. (2001). Personalized electronic programme guides. Artificial Intelligence Magazine, 22(2), 89–98.CrossRef
go back to reference Snoek, C.G.M., Worring, M., van Gemert, J.C., Geusebroek, J.M., & Smeulders, A.W.M. (2006). The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proceedings of the 14th annual ACM international conference on Multimedia, MULTIMEDIA ’06 (pp. 421–430). New York: ACM. https://doi.org/10.1145/1180639.1180727. Snoek, C.G.M., Worring, M., van Gemert, J.C., Geusebroek, J.M., & Smeulders, A.W.M. (2006). The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proceedings of the 14th annual ACM international conference on Multimedia, MULTIMEDIA ’06 (pp. 421–430). New York: ACM. https://​doi.​org/​10.​1145/​1180639.​1180727.
go back to reference Turney, P., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research (JAIR), 37, 141–188.CrossRef Turney, P., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research (JAIR), 37, 141–188.CrossRef
go back to reference Yanagawa, A., Chang, S.F., Kennedy, L., & Hsu, W. (2007). Columbia University’s baseline detectors for 374 LSCOM semantic visual concepts. Columbia University ADVENT Technical Report. Yanagawa, A., Chang, S.F., Kennedy, L., & Hsu, W. (2007). Columbia University’s baseline detectors for 374 LSCOM semantic visual concepts. Columbia University ADVENT Technical Report.
go back to reference Yuan, Y. (2003). Research on video classification and retrieval. Xi’an: Ph.D. thesis, School of Electronic and Information Engineering, Xi’an Jiaotong University. Yuan, Y. (2003). Research on video classification and retrieval. Xi’an: Ph.D. thesis, School of Electronic and Information Engineering, Xi’an Jiaotong University.
Metadata
Title
TV-Program Retrieval and Classification: A Comparison of Approaches based on Machine Learning
Authors
Fedelucio Narducci
Cataldo Musto
Marco de Gemmis
Pasquale Lops
Giovanni Semeraro
Publication date
29-08-2017
Publisher
Springer US
Published in
Information Systems Frontiers / Issue 6/2018
Print ISSN: 1387-3326
Electronic ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-017-9780-0

Other articles of this Issue 6/2018

Information Systems Frontiers 6/2018 Go to the issue

Premium Partner