Skip to main content

01.04.2013

Feed and fly control of visual scanpaths for foveation image processing

verfasst von: Giuseppe Boccignone, Mario Ferraro

Erschienen in: Annals of Telecommunications | Ausgabe 3-4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Foveation-based processing and communication systems can exploit a more efficient representation of images and videos by removing or reducing visual information redundancy, provided that the sequence of foveation points, the visual scanpath, can be determined. However, one point that is neglected by the great majority of foveation models is the “noisy” variation of the random visual exploration exhibited by different observers when viewing the same scene, or even by the same subject along different trials. Here, a model for the generation and control of scanpaths that accounts for such issue is presented. In the model, the sequence of fixations and gaze shifts is controlled by a saliency-based, information foraging mechanism implemented through a dynamical system switching between two states, “feed” and “fly.” Results of the simulations are compared with experimental data derived from publicly available datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat van Beers R (2007) The sources of variability in saccadic eye movements. J Neurosci 27(33):8757–8770CrossRef van Beers R (2007) The sources of variability in saccadic eye movements. J Neurosci 27(33):8757–8770CrossRef
2.
Zurück zum Zitat Begum M, Karray F (2011) Visual attention for robotic cognition: a survey. IEEE Trans Auton Mental Develop 3(1):92–105CrossRef Begum M, Karray F (2011) Visual attention for robotic cognition: a survey. IEEE Trans Auton Mental Develop 3(1):92–105CrossRef
3.
Zurück zum Zitat Bishop CM (2006) Pattern recognition and machine learning (Information Science and Statistics). Springer, New York Bishop CM (2006) Pattern recognition and machine learning (Information Science and Statistics). Springer, New York
4.
Zurück zum Zitat Boccignone G, Chianese A, Moscato V, Picariello A (2005) Foveated shot detection for video segmentation. IEEE Trans Circuits Syst Video Technol 15(3):365–377CrossRef Boccignone G, Chianese A, Moscato V, Picariello A (2005) Foveated shot detection for video segmentation. IEEE Trans Circuits Syst Video Technol 15(3):365–377CrossRef
5.
Zurück zum Zitat Boccignone G, Ferraro M (2004) Modelling gaze shift as a constrained random walk. Physica A: Statistical Mechanics and its Applications 331(1–2):207–218CrossRef Boccignone G, Ferraro M (2004) Modelling gaze shift as a constrained random walk. Physica A: Statistical Mechanics and its Applications 331(1–2):207–218CrossRef
6.
Zurück zum Zitat Boccignone G, Ferraro M (2011) Modelling eye-movement control via a constrained search approach. In: Proceedings of 3rd European workshop on visual information processing (EUVIP 2011). IEEE Press, Piscataway, pp 235–240CrossRef Boccignone G, Ferraro M (2011) Modelling eye-movement control via a constrained search approach. In: Proceedings of 3rd European workshop on visual information processing (EUVIP 2011). IEEE Press, Piscataway, pp 235–240CrossRef
7.
Zurück zum Zitat Boccignone G, Marcelli A, Napoletano P, Di Fiore G, Iacovoni G, Morsa S (2008) Bayesian integration of face and low-level cues for foveated video coding. IEEE Trans Circuits Syst Video Technol 18(12):1727–1740CrossRef Boccignone G, Marcelli A, Napoletano P, Di Fiore G, Iacovoni G, Morsa S (2008) Bayesian integration of face and low-level cues for foveated video coding. IEEE Trans Circuits Syst Video Technol 18(12):1727–1740CrossRef
9.
Zurück zum Zitat Brockmann D, Geisel T (2000) The ecology of gaze shifts. Neurocomputing 32(1):643–650CrossRef Brockmann D, Geisel T (2000) The ecology of gaze shifts. Neurocomputing 32(1):643–650CrossRef
10.
Zurück zum Zitat Cerf M, Frady E, Koch C (2009) Faces and text attract gaze independent of the task: experimental data and computer model. J Vis 9(12):10.1–10.15 Cerf M, Frady E, Koch C (2009) Faces and text attract gaze independent of the task: experimental data and computer model. J Vis 9(12):10.1–10.15
11.
Zurück zum Zitat Cerf M, Harel J, Einhäuser W, Koch C (2008) Predicting human gaze using low-level saliency combined with face detection. In: Advances in neural information processing systems, vol 20. MIT Press, Cambridge, pp 545–552 Cerf M, Harel J, Einhäuser W, Koch C (2008) Predicting human gaze using low-level saliency combined with face detection. In: Advances in neural information processing systems, vol 20. MIT Press, Cambridge, pp 545–552
12.
13.
Zurück zum Zitat Churchland P, Ramachandran V, Sejnowski T (1994) A critique of pure vision. MIT Press, Cambridge Churchland P, Ramachandran V, Sejnowski T (1994) A critique of pure vision. MIT Press, Cambridge
14.
Zurück zum Zitat Codling E, Plank M, Benhamou S (2008) Random walk models in biology. J R Soc Interface 5(25):813CrossRef Codling E, Plank M, Benhamou S (2008) Random walk models in biology. J R Soc Interface 5(25):813CrossRef
15.
Zurück zum Zitat Cotsaces C, Nikolaidis N, Pitas I (2006) Video shot detection and condensed representation. a review. IEEE Signal Process Mag 23(2):28–37CrossRef Cotsaces C, Nikolaidis N, Pitas I (2006) Video shot detection and condensed representation. a review. IEEE Signal Process Mag 23(2):28–37CrossRef
16.
Zurück zum Zitat Da Luz M, Buldyrev S, Havlin S, Raposo E, Stanley H, Viswanathan G (2001) Improvements in the statistical approach to random Lévy flight searches. Physica A: Statistical Mechanics and its Applications 295(1–2):89–92MATHCrossRef Da Luz M, Buldyrev S, Havlin S, Raposo E, Stanley H, Viswanathan G (2001) Improvements in the statistical approach to random Lévy flight searches. Physica A: Statistical Mechanics and its Applications 295(1–2):89–92MATHCrossRef
17.
Zurück zum Zitat Ellis S, Stark L (1986) Statistical dependency in visual scanning. Hum Factors 28(4):421–438 Ellis S, Stark L (1986) Statistical dependency in visual scanning. Hum Factors 28(4):421–438
18.
Zurück zum Zitat Frintrop S, Rome E, Christensen H (2010) Computational visual attention systems and their cognitive foundations: a survey. ACM Trans Appl Percept 7(1):1–39CrossRef Frintrop S, Rome E, Christensen H (2010) Computational visual attention systems and their cognitive foundations: a survey. ACM Trans Appl Percept 7(1):1–39CrossRef
19.
Zurück zum Zitat Gnedenko B, Kolmogórov A (1954) Limit distributions for sums of independent random variables. Addison-Wesley, ReadingMATH Gnedenko B, Kolmogórov A (1954) Limit distributions for sums of independent random variables. Addison-Wesley, ReadingMATH
20.
Zurück zum Zitat Harel J, Koch C, Perona P (2007) Graph-based visual saliency. In: Advances in neural information processing systems, vol 19. MIT Press, Cambridge, pp 545–552 Harel J, Koch C, Perona P (2007) Graph-based visual saliency. In: Advances in neural information processing systems, vol 19. MIT Press, Cambridge, pp 545–552
21.
Zurück zum Zitat Harris C (1998) On the optimal control of behaviour: a stochastic perspective. J Neurosci Methods 83(1):73–88CrossRef Harris C (1998) On the optimal control of behaviour: a stochastic perspective. J Neurosci Methods 83(1):73–88CrossRef
22.
Zurück zum Zitat Holmqvist K, Nyström M, Andersson R, Dewhurst R, Jarodzka H, Van de Weijer J (2011) Eye tracking: a comprehensive guide to methods and measures. Oxford University Press, Oxford Holmqvist K, Nyström M, Andersson R, Dewhurst R, Jarodzka H, Van de Weijer J (2011) Eye tracking: a comprehensive guide to methods and measures. Oxford University Press, Oxford
23.
Zurück zum Zitat Hou X, Zhang L (2007) Saliency detection: a spectral residual approach. In: Proceedings CVPR 07, vol 1, pp 1–8 Hou X, Zhang L (2007) Saliency detection: a spectral residual approach. In: Proceedings CVPR 07, vol 1, pp 1–8
24.
Zurück zum Zitat Itti L (2004) Automatic foveation for video compression using a neurobiological model of visual attention. IEEE Trans Image Process 13(10):1304–1318CrossRef Itti L (2004) Automatic foveation for video compression using a neurobiological model of visual attention. IEEE Trans Image Process 13(10):1304–1318CrossRef
25.
Zurück zum Zitat Itti L, Baldi P (2009) Bayesian surprise attracts human attention. Vis Res 49(10):1295–1306CrossRef Itti L, Baldi P (2009) Bayesian surprise attracts human attention. Vis Res 49(10):1295–1306CrossRef
26.
Zurück zum Zitat Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20:1254–1259CrossRef Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20:1254–1259CrossRef
27.
Zurück zum Zitat Jackson, J (1958) Evolution and dissolution of the nervous system (Croonian lectures). Published in parts in the British Medical Journal, Lancet pp 5–75 Jackson, J (1958) Evolution and dissolution of the nervous system (Croonian lectures). Published in parts in the British Medical Journal, Lancet pp 5–75
28.
Zurück zum Zitat Klein R, MacInnes W (1999) Inhibition of return is a foraging facilitator in visual search. Psychol Sci 10(4):346–352CrossRef Klein R, MacInnes W (1999) Inhibition of return is a foraging facilitator in visual search. Psychol Sci 10(4):346–352CrossRef
29.
30.
Zurück zum Zitat Kowler E (2011) Eye movements: the past 25 years. Vis Res 51(13):1457–1483. 50th Anniversary Special Issue of Vision Research - vol 2 Kowler E (2011) Eye movements: the past 25 years. Vis Res 51(13):1457–1483. 50th Anniversary Special Issue of Vision Research - vol 2
31.
Zurück zum Zitat Kustov A, Robinson D (1996) Shared neural control of attentional shifts and eye movements. Nature 384:74–77CrossRef Kustov A, Robinson D (1996) Shared neural control of attentional shifts and eye movements. Nature 384:74–77CrossRef
32.
Zurück zum Zitat Lee J, De Simone F, Ebrahimi T (2011) Efficient video coding based on audio-visual focus of attention. J Vis Commun Image Represent 22(8):704–711CrossRef Lee J, De Simone F, Ebrahimi T (2011) Efficient video coding based on audio-visual focus of attention. J Vis Commun Image Represent 22(8):704–711CrossRef
34.
Zurück zum Zitat Nolan J (1997) Numerical calculation of stable densities and distribution functions. Commun Stat Stoch Models 13(4):759–774MathSciNetMATHCrossRef Nolan J (1997) Numerical calculation of stable densities and distribution functions. Commun Stat Stoch Models 13(4):759–774MathSciNetMATHCrossRef
35.
Zurück zum Zitat Noton D, Stark L (1971) Scanpaths in eye movements during pattern perception. Science 171(968):308–311CrossRef Noton D, Stark L (1971) Scanpaths in eye movements during pattern perception. Science 171(968):308–311CrossRef
36.
Zurück zum Zitat Plank M, James A (2008) Optimal foraging: Lévy pattern or process? J R Soc Interface 5(26):1077CrossRef Plank M, James A (2008) Optimal foraging: Lévy pattern or process? J R Soc Interface 5(26):1077CrossRef
37.
Zurück zum Zitat Privitera CM, Stark LW (2000) Algorithms for defining visual regions-of-interest: comparison with eye fixations. IEEE Trans Pattern Anal Mach Intell 22(9):970–982CrossRef Privitera CM, Stark LW (2000) Algorithms for defining visual regions-of-interest: comparison with eye fixations. IEEE Trans Pattern Anal Mach Intell 22(9):970–982CrossRef
38.
Zurück zum Zitat Reynolds A (2008) Optimal random Lévy-loop searching: new insights into the searching behaviours of central-place foragers. EPL (Europhysics Letters) 82(2):20001.1–20001.6CrossRef Reynolds A (2008) Optimal random Lévy-loop searching: new insights into the searching behaviours of central-place foragers. EPL (Europhysics Letters) 82(2):20001.1–20001.6CrossRef
39.
Zurück zum Zitat Schütz A, Braun D, Gegenfurtner K (2011) Eye movements and perception: a selective review. J Vis 11(5):9CrossRef Schütz A, Braun D, Gegenfurtner K (2011) Eye movements and perception: a selective review. J Vis 11(5):9CrossRef
40.
Zurück zum Zitat Seo H, Milanfar P (2009) Static and space-time visual saliency detection by self-resemblance. J Vis 9(12):1–27CrossRef Seo H, Milanfar P (2009) Static and space-time visual saliency detection by self-resemblance. J Vis 9(12):1–27CrossRef
41.
Zurück zum Zitat Stark L, Privitera C, Yang H, Azzariti M, Ho Y, Blackmon T, Chernyak D (2001) Representation of human vision in the brain: how does human perception recognize images? J Electron Imaging 10:123–151CrossRef Stark L, Privitera C, Yang H, Azzariti M, Ho Y, Blackmon T, Chernyak D (2001) Representation of human vision in the brain: how does human perception recognize images? J Electron Imaging 10:123–151CrossRef
42.
Zurück zum Zitat Stephen D, Mirman D, Magnuson J, Dixon J (2009) Lévy-like diffusion in eye movements during spoken-language comprehension. Phys Rev E 79(5):056114.1–056114.6CrossRef Stephen D, Mirman D, Magnuson J, Dixon J (2009) Lévy-like diffusion in eye movements during spoken-language comprehension. Phys Rev E 79(5):056114.1–056114.6CrossRef
43.
Zurück zum Zitat Tatler B, Vincent B (2009) The prominence of behavioural biases in eye guidance. Vis Cogn 17(6–7):1029–1054CrossRef Tatler B, Vincent B (2009) The prominence of behavioural biases in eye guidance. Vis Cogn 17(6–7):1029–1054CrossRef
44.
Zurück zum Zitat Vandekerckhove J, Tuerlinckx F, Lee M (2011) Hierarchical diffusion models for two-choice response times. Psychol Methods 16(1):44CrossRef Vandekerckhove J, Tuerlinckx F, Lee M (2011) Hierarchical diffusion models for two-choice response times. Psychol Methods 16(1):44CrossRef
45.
Zurück zum Zitat Viswanathan G, Afanasyev V, Buldyrev S, Havlin S, Da Luz M, Raposo E, Stanley H (2000) Lévy flights in random searches. Physica A: Statistical Mechanics and its Applications 282(1–2):1–12CrossRef Viswanathan G, Afanasyev V, Buldyrev S, Havlin S, Da Luz M, Raposo E, Stanley H (2000) Lévy flights in random searches. Physica A: Statistical Mechanics and its Applications 282(1–2):1–12CrossRef
46.
Zurück zum Zitat Wang Z, Lu L, Bovik AC (2003) Foveation scalable video coding with automatic fixation selection. IEEE Trans Image Process 12:1–12MathSciNetMATHCrossRef Wang Z, Lu L, Bovik AC (2003) Foveation scalable video coding with automatic fixation selection. IEEE Trans Image Process 12:1–12MathSciNetMATHCrossRef
47.
Zurück zum Zitat You J, Reiter U, Hannuksela M, Gabbouj M, Perkis A (2010) Perceptual-based quality assessment for audio–visual services: a survey. Signal Process Image Commun 25(7):482–501CrossRef You J, Reiter U, Hannuksela M, Gabbouj M, Perkis A (2010) Perceptual-based quality assessment for audio–visual services: a survey. Signal Process Image Commun 25(7):482–501CrossRef
Metadaten
Titel
Feed and fly control of visual scanpaths for foveation image processing
verfasst von
Giuseppe Boccignone
Mario Ferraro
Publikationsdatum
01.04.2013
Verlag
Springer-Verlag
Erschienen in
Annals of Telecommunications / Ausgabe 3-4/2013
Print ISSN: 0003-4347
Elektronische ISSN: 1958-9395
DOI
https://doi.org/10.1007/s12243-012-0316-9