Skip to main content
Top
Published in: International Journal of Multimedia Information Retrieval 2/2012

01-07-2012 | Regular Paper

Semantics-based selection of everyday concepts in visual lifelogging

Authors: Peng Wang, Alan F. Smeaton

Published in: International Journal of Multimedia Information Retrieval | Issue 2/2012

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Concept-based indexing, based on identifying various semantic concepts appearing in multimedia, is an attractive option for multimedia retrieval and much research tries to bridge the semantic gap between the media’s low-level features and high-level semantics. Research into concept-based multimedia retrieval has generally focussed on detecting concepts from high-quality media such as broadcast TV or movies, but it is not well addressed in other domains like lifelogging where the original data is captured with poorer quality. We argue that in noisy domains such as lifelogging, the management of data needs to include semantic reasoning in order to deduce a set of concepts to represent lifelog content for applications like searching, browsing or summarization. Using semantic concepts to manage lifelog data relies on the fusion of automatically detected concepts to provide a better understanding of the lifelog data. In this paper, we investigate the selection of semantic concepts for lifelogging which includes reasoning on semantic networks using a density-based approach. In a series of experiments we compare different semantic reasoning approaches and the experimental evaluations we report on lifelog data show the efficacy of our approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
4.
go back to reference Banerjee S, Pedersen T (2002) An adapted Lesk algorithm for word sense disambiguation using WordNet. In: CICLing ’02: Proceedings of the third international conference on computational linguistics and intelligent text processing. Springer, London, pp 136–145 Banerjee S, Pedersen T (2002) An adapted Lesk algorithm for word sense disambiguation using WordNet. In: CICLing ’02: Proceedings of the third international conference on computational linguistics and intelligent text processing. Springer, London, pp 136–145
5.
go back to reference Blum M, Pentland AS, Tröster G (2006) InSense: interest-based life logging. IEEE Multimed 13(4):40–48CrossRef Blum M, Pentland AS, Tröster G (2006) InSense: interest-based life logging. IEEE Multimed 13(4):40–48CrossRef
6.
go back to reference Bukhin M, DelGaudio M (2006) WayMarkr: acquiring perspective through continuous documentation. In: MUM ’06: Proceedings of the 5th international conference on mobile and ubiquitous multimedia, ACM, New York, USA, NY, p 9 Bukhin M, DelGaudio M (2006) WayMarkr: acquiring perspective through continuous documentation. In: MUM ’06: Proceedings of the 5th international conference on mobile and ubiquitous multimedia, ACM, New York, USA, NY, p 9
7.
go back to reference Bush V (1945) As we may think. The Atlantic Monthly Bush V (1945) As we may think. The Atlantic Monthly
8.
go back to reference Byrne D, Doherty AR, Snoek CGM, Jones GJF, Smeaton AF (2010) Everyday concept detection in visual lifelogs: validation, relationships and trends. Multimed Tools Appl 49(1):119–144CrossRef Byrne D, Doherty AR, Snoek CGM, Jones GJF, Smeaton AF (2010) Everyday concept detection in visual lifelogs: validation, relationships and trends. Multimed Tools Appl 49(1):119–144CrossRef
9.
go back to reference Chang SF, Hsu W, Jiang W, Kennedy L, Xu D, Yanagawa A, Zavesky E (2006) Evaluating the impact of 374 visual-based LSCOM concept detectors on automatic search. In: Proceedings of the 4th TRECVid workshop, USA, Gaithersburg Chang SF, Hsu W, Jiang W, Kennedy L, Xu D, Yanagawa A, Zavesky E (2006) Evaluating the impact of 374 visual-based LSCOM concept detectors on automatic search. In: Proceedings of the 4th TRECVid workshop, USA, Gaithersburg
10.
go back to reference Chilvers R, Corr S, Hayley S (2010) Investigation into the occupational lives of healthy older people through their use of time. Aust Occup Ther J 57(1):24–33CrossRef Chilvers R, Corr S, Hayley S (2010) Investigation into the occupational lives of healthy older people through their use of time. Aust Occup Ther J 57(1):24–33CrossRef
11.
go back to reference Christel MG, Hauptmann AG (2005) The use and utility of high-level semantic features in video retrieval. In: CIVR ’05: Proceedings of the international conference on video retrieval, Ireland, Dublin, pp 134–144 Christel MG, Hauptmann AG (2005) The use and utility of high-level semantic features in video retrieval. In: CIVR ’05: Proceedings of the international conference on video retrieval, Ireland, Dublin, pp 134–144
12.
go back to reference Doherty AR, Caprani N, O’Conaire C, Kalnikaite V, Gurrin C, O’Connor NE, Smeaton AF (2011) Passively recognising human activities through lifelogging. Comput Hum Behav 27(5):1948–1958CrossRef Doherty AR, Caprani N, O’Conaire C, Kalnikaite V, Gurrin C, O’Connor NE, Smeaton AF (2011) Passively recognising human activities through lifelogging. Comput Hum Behav 27(5):1948–1958CrossRef
13.
go back to reference Haubold A, Natsev A, Naphade M (2006) Semantic multimedia retrieval using lexical query expansion and model-based reranking. IEEE international conference on multimedia and expo, pp 1761– 1764 Haubold A, Natsev A, Naphade M (2006) Semantic multimedia retrieval using lexical query expansion and model-based reranking. IEEE international conference on multimedia and expo, pp 1761– 1764
14.
go back to reference Hauff C, Aly R, Hiemstra D (2007) The effectiveness of concept based search for video retrieval. In: Workshop information retrieval (FGIR 2007), Halle-Wittenberg, Germany, pp 205–212 Hauff C, Aly R, Hiemstra D (2007) The effectiveness of concept based search for video retrieval. In: Workshop information retrieval (FGIR 2007), Halle-Wittenberg, Germany, pp 205–212
15.
go back to reference Hirst G, St-Onge D (1998) Lexical chains as representation of context for the detection and correction malapropisms. In: Fellbaum C (ed) WordNet: an electronic lexical database. MIT Press, Cambridge, MA Hirst G, St-Onge D (1998) Lexical chains as representation of context for the detection and correction malapropisms. In: Fellbaum C (ed) WordNet: an electronic lexical database. MIT Press, Cambridge, MA
16.
go back to reference Hodges S, Williams L, Berry E, Izadi S, Srinivasan J, Butler A, Smyth G, Kapur N, Wood K (2006) Sense Cam: a retrospective memory aid. In: Proceedings of 8th international conference on Ubicomp, Orange County, CA, USA, pp 177–193 Hodges S, Williams L, Berry E, Izadi S, Srinivasan J, Butler A, Smyth G, Kapur N, Wood K (2006) Sense Cam: a retrospective memory aid. In: Proceedings of 8th international conference on Ubicomp, Orange County, CA, USA, pp 177–193
17.
go back to reference Hori T, Aizawa K (2003) Context-based video retrieval system for the life-log applications. In: Proceedings of the 5th ACM SIGMM international workshop on multimedia information retrieval, MIR ’03, ACM, New York, USA, NY, pp 31–38 Hori T, Aizawa K (2003) Context-based video retrieval system for the life-log applications. In: Proceedings of the 5th ACM SIGMM international workshop on multimedia information retrieval, MIR ’03, ACM, New York, USA, NY, pp 31–38
18.
go back to reference Huurnink B, Hofmann K, de Rijke M (2008) Assessing concept selection for video retrieval. In: MIR ’08: Proceeding of the 1st ACM international conference on multimedia information retrieval, ACM, New York, NY, USA, pp 459–466 Huurnink B, Hofmann K, de Rijke M (2008) Assessing concept selection for video retrieval. In: MIR ’08: Proceeding of the 1st ACM international conference on multimedia information retrieval, ACM, New York, NY, USA, pp 459–466
19.
go back to reference Kahneman D, Krueger AB, Schkade DA, Schwarz N, Stone AA (2004) A survey method for characterizing daily life experience: the day reconstruction method. Science 306(5702):1776– 1780CrossRef Kahneman D, Krueger AB, Schkade DA, Schwarz N, Stone AA (2004) A survey method for characterizing daily life experience: the day reconstruction method. Science 306(5702):1776– 1780CrossRef
20.
go back to reference Lakoff G (1990) Women, fire, and dangerous things. University of Chicago Press, Chicago Lakoff G (1990) Women, fire, and dangerous things. University of Chicago Press, Chicago
21.
go back to reference Leacock C, Chodorow M (1998) Combining local context and WordNet similarity for word sense identification. In: Fellbaum C (ed) WordNet: an electronic lexical, database. MIT Press, Cambridge, MA, pp 265–283 Leacock C, Chodorow M (1998) Combining local context and WordNet similarity for word sense identification. In: Fellbaum C (ed) WordNet: an electronic lexical, database. MIT Press, Cambridge, MA, pp 265–283
22.
go back to reference Li X, Wang D, Li J, Zhang B (2007) Video search in concept subspace: a text-like paradigm. In: Proceedings of the 6th ACM international conference on image and video retrieval, CIVR ’07, ACM, New York, NY, USA, pp 603–610 Li X, Wang D, Li J, Zhang B (2007) Video search in concept subspace: a text-like paradigm. In: Proceedings of the 6th ACM international conference on image and video retrieval, CIVR ’07, ACM, New York, NY, USA, pp 603–610
23.
go back to reference Liu H, Singh P (2004) Commonsense reasoning in and over natural language. In: Proceedings of the 8th international conference on knowledge-based intelligent information and engineering systems, Springer, Wellington, New Zealand Liu H, Singh P (2004) Commonsense reasoning in and over natural language. In: Proceedings of the 8th international conference on knowledge-based intelligent information and engineering systems, Springer, Wellington, New Zealand
24.
go back to reference Mann S (1997) Wearable computing: a first step toward personal imaging. Computer 30(2):25–32CrossRef Mann S (1997) Wearable computing: a first step toward personal imaging. Computer 30(2):25–32CrossRef
25.
go back to reference Mann S, Fung J, Aimone C, Sehgal A, Chen D (2005) Designing EyeTap digital eyeglasses for continuous lifelong capture and sharing of personal experiences. In: Proceedings of CHI 2005 conference on computer human interaction. ACM Press, Portland, Oregon, USA Mann S, Fung J, Aimone C, Sehgal A, Chen D (2005) Designing EyeTap digital eyeglasses for continuous lifelong capture and sharing of personal experiences. In: Proceedings of CHI 2005 conference on computer human interaction. ACM Press, Portland, Oregon, USA
26.
go back to reference Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11):39–41CrossRef Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11):39–41CrossRef
27.
go back to reference Naphade M, Smith JR, Tesic J, Chang SF, Hsu W, Kennedy L, Hauptmann A, Curtis J (2006) Large-scale concept ontology for multimedia. IEEE Multimed 13(3):86–91CrossRef Naphade M, Smith JR, Tesic J, Chang SF, Hsu W, Kennedy L, Hauptmann A, Curtis J (2006) Large-scale concept ontology for multimedia. IEEE Multimed 13(3):86–91CrossRef
28.
go back to reference Natsev AP, Haubold A, Tesic J, Xie L, Yan R (2007) Semantic concept-based query expansion and re-ranking for multimedia retrieval. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on multimedia, ACM, New York, NY, USA, pp 991–1000 Natsev AP, Haubold A, Tesic J, Xie L, Yan R (2007) Semantic concept-based query expansion and re-ranking for multimedia retrieval. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on multimedia, ACM, New York, NY, USA, pp 991–1000
29.
go back to reference Over P, Ianeva T, Kraaij W, Smeaton AF (2005) TRECVid 2005—an overview. In: Proceedings of TRECVid Over P, Ianeva T, Kraaij W, Smeaton AF (2005) TRECVid 2005—an overview. In: Proceedings of TRECVid
30.
go back to reference Rada R, Mili H, Bicknell E, Blettner M (1989) Development and application of a metric on semantic nets. IEEE Trans Syst Man Cybern 19(1):17–30CrossRef Rada R, Mili H, Bicknell E, Blettner M (1989) Development and application of a metric on semantic nets. IEEE Trans Syst Man Cybern 19(1):17–30CrossRef
31.
go back to reference Reddy S, Parker A, Hyman J, Burke J, Estrin D, Hansen M (2007) Image browsing, processing, and clustering for participatory sensing: lessons from a dietsense prototype. In: EmNets’07: Proceedings of the 4th workshop on embedded networked sensors, ACM Press, Cork, Ireland, pp 13–17 Reddy S, Parker A, Hyman J, Burke J, Estrin D, Hansen M (2007) Image browsing, processing, and clustering for participatory sensing: lessons from a dietsense prototype. In: EmNets’07: Proceedings of the 4th workshop on embedded networked sensors, ACM Press, Cork, Ireland, pp 13–17
32.
go back to reference Resnik P (1995) Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th international joint conference on artificial intelligence, Morgan Kaufmann, San Francisco, CA, USA, pp 448–453 Resnik P (1995) Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th international joint conference on artificial intelligence, Morgan Kaufmann, San Francisco, CA, USA, pp 448–453
33.
go back to reference Resnik P (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. J Artif Intell Res 11:95–130MATH Resnik P (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. J Artif Intell Res 11:95–130MATH
34.
go back to reference Richardson R, Smeaton AF (1995) Using WordNet in a knowledge-based approach to information retrieval. Tech Rep CA-0395, Dublin City University Richardson R, Smeaton AF (1995) Using WordNet in a knowledge-based approach to information retrieval. Tech Rep CA-0395, Dublin City University
35.
go back to reference Sellen A, Fogg A, Aitken M, Hodges S, Rother C, Wood K (2007) Do life-logging technologies support memory for the past? An experimental study using Sense Cam. In: Proceedings of CHI 2007, ACM Press, New York, NY, USA, pp 81–90 Sellen A, Fogg A, Aitken M, Hodges S, Rother C, Wood K (2007) Do life-logging technologies support memory for the past? An experimental study using Sense Cam. In: Proceedings of CHI 2007, ACM Press, New York, NY, USA, pp 81–90
36.
go back to reference Sellen AJ, Whittaker S (2010) Beyond total capture: a constructive critique of lifelogging. Commun ACM 53(5):70–77CrossRef Sellen AJ, Whittaker S (2010) Beyond total capture: a constructive critique of lifelogging. Commun ACM 53(5):70–77CrossRef
37.
go back to reference Smeaton AF, Over P, Kraaij W (2009) High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. In: Divakaran A (ed) Multimedia content analysis, theory and applications. Springer, Berlin, pp 151–174 Smeaton AF, Over P, Kraaij W (2009) High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. In: Divakaran A (ed) Multimedia content analysis, theory and applications. Springer, Berlin, pp 151–174
38.
go back to reference Smeaton AF, Quigley I (1996) Experiments on using semantic distances between words in image caption retrieval. In: Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’96, ACM, New York, NY, USA, pp 174–180 Smeaton AF, Quigley I (1996) Experiments on using semantic distances between words in image caption retrieval. In: Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’96, ACM, New York, NY, USA, pp 174–180
39.
go back to reference Snoek CGM, van Gemert JC, Gevers T, Huurnink B, Koelma DC, van Liempt M, de Rooij O, van de Sande KEA, Seinstra FJ, Smeulders AWM, Thean AHC, Veenman CJ, Worring M (2006) The MediaMill TRECVid 2006 semantic video search engine. In: Proceedings of the 4th TRECVid workshop, Gaithersburg, USA Snoek CGM, van Gemert JC, Gevers T, Huurnink B, Koelma DC, van Liempt M, de Rooij O, van de Sande KEA, Seinstra FJ, Smeulders AWM, Thean AHC, Veenman CJ, Worring M (2006) The MediaMill TRECVid 2006 semantic video search engine. In: Proceedings of the 4th TRECVid workshop, Gaithersburg, USA
40.
go back to reference Snoek CGM, Huurnink B, Hollink L, Rijke MD, Schreiber G, Worring M (2007) Adding semantics to detectors for video retrieval. IEEE Trans Multimed 9(5):975–986CrossRef Snoek CGM, Huurnink B, Hollink L, Rijke MD, Schreiber G, Worring M (2007) Adding semantics to detectors for video retrieval. IEEE Trans Multimed 9(5):975–986CrossRef
41.
go back to reference Snoek CGM, Worring M, van Gemert JC, Geusebroek JM, Smeulders AWM (2006) The challenge problem for automated detection of 101 semantic concepts in multimedia. In: Proceedings of the 14th annual ACM international conference on multimedia, MULTIMEDIA ’06, ACM, New York, NY, USA, pp 421–430 Snoek CGM, Worring M, van Gemert JC, Geusebroek JM, Smeulders AWM (2006) The challenge problem for automated detection of 101 semantic concepts in multimedia. In: Proceedings of the 14th annual ACM international conference on multimedia, MULTIMEDIA ’06, ACM, New York, NY, USA, pp 421–430
42.
go back to reference Wei XY, Ngo CW (2007) Ontology-enriched semantic space for video search. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on multimedia, ACM, New York, NY, USA, pp 981–990 Wei XY, Ngo CW (2007) Ontology-enriched semantic space for video search. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on multimedia, ACM, New York, NY, USA, pp 981–990
43.
go back to reference Wu Z, Palmer M (1994) Verb semantics and lexical selection. In: Proceedings of the 32nd annual meeting on Association for Computational Linguistics, Stroudsburg, PA, USA, pp 133–138 Wu Z, Palmer M (1994) Verb semantics and lexical selection. In: Proceedings of the 32nd annual meeting on Association for Computational Linguistics, Stroudsburg, PA, USA, pp 133–138
Metadata
Title
Semantics-based selection of everyday concepts in visual lifelogging
Authors
Peng Wang
Alan F. Smeaton
Publication date
01-07-2012
Publisher
Springer-Verlag
Published in
International Journal of Multimedia Information Retrieval / Issue 2/2012
Print ISSN: 2192-6611
Electronic ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-012-0010-8

Other articles of this Issue 2/2012

International Journal of Multimedia Information Retrieval 2/2012 Go to the issue

Premium Partner