Skip to main content

2018 | OriginalPaper | Buchkapitel

Multiple-Gaze Geometry: Inferring Novel 3D Locations from Gazes Observed in Monocular Video

verfasst von : Ernesto Brau, Jinyan Guan, Tanya Jeffries, Kobus Barnard

Erschienen in: Computer Vision – ECCV 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We develop using person gaze direction for scene understanding. In particular, we use intersecting gazes to learn 3D locations that people tend to look at, which is analogous to having multiple camera views. The 3D locations that we discover need not be visible to the camera. Conversely, knowing 3D locations of scene elements that draw visual attention, such as other people in the scene, can help infer gaze direction. We provide a Bayesian generative model for the temporal scene that captures the joint probability of camera parameters, locations of people, their gaze, what they are looking at, and locations of visual attention. Both the number of people in the scene and the number of extra objects that draw attention are unknown and need to be inferred. To execute this joint inference we use a probabilistic data association approach that enables principled comparison of model hypotheses. We use MCMC for inference over the discrete correspondence variables, and approximate the marginalization over continuous parameters using the Metropolis-Laplace approximation, using Hamiltonian (Hybrid) Monte Carlo for maximization. As existing data sets do not provide the 3D locations of what people are looking at, we contribute a small data set that does. On this data set, we infer what people are looking at with 59% precision compared with 13% for a baseline approach, and where those objects are within about 0.58 m.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alameda-Pineda, X., et al.: Salsa: a novel dataset for multimodal group behavior analysis. IEEE Trans. Pattern Anal. Mach. Intell. 38(8), 1707–1720 (2016)CrossRef Alameda-Pineda, X., et al.: Salsa: a novel dataset for multimodal group behavior analysis. IEEE Trans. Pattern Anal. Mach. Intell. 38(8), 1707–1720 (2016)CrossRef
2.
Zurück zum Zitat Andriluka, M., Roth, S., Schiele, B.: People-tracking-by-detection and people-detection-by-tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008) Andriluka, M., Roth, S., Schiele, B.: People-tracking-by-detection and people-detection-by-tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
4.
Zurück zum Zitat Andriyenko, A., Schindler, K., Roth, S.: Discrete-continuous optimization for multi-target tracking. In: CVPR, pp. 1926–1933 (2012) Andriyenko, A., Schindler, K., Roth, S.: Discrete-continuous optimization for multi-target tracking. In: CVPR, pp. 1926–1933 (2012)
5.
Zurück zum Zitat Ba, S.O., Hung, H., Odobez, J.M.: Visual activity context for focus of attention estimation in dynamic meetings. In: IEEE International Conference on Multimedia and Expo, ICME 2009, pp. 1424–1427. IEEE (2009) Ba, S.O., Hung, H., Odobez, J.M.: Visual activity context for focus of attention estimation in dynamic meetings. In: IEEE International Conference on Multimedia and Expo, ICME 2009, pp. 1424–1427. IEEE (2009)
7.
Zurück zum Zitat Ba, S.O., Odobez, J.M.: Recognizing visual focus of attention from head pose in natural meetings. IEEE Trans. Syst. Man Cybern. Part B Cybern. 39(1), 16–33 (2009)CrossRef Ba, S.O., Odobez, J.M.: Recognizing visual focus of attention from head pose in natural meetings. IEEE Trans. Syst. Man Cybern. Part B Cybern. 39(1), 16–33 (2009)CrossRef
8.
Zurück zum Zitat Ba, S.O., Odobez, J.M.: Multiperson visual focus of attention from head pose and meeting contextual cues. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 101–116 (2011)CrossRef Ba, S.O., Odobez, J.M.: Multiperson visual focus of attention from head pose and meeting contextual cues. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 101–116 (2011)CrossRef
9.
Zurück zum Zitat Benfold, B., Reid, I.: Stable multi-target tracking in real-time surveillance video. In: CVPR, pp. 3457–3464 (2011) Benfold, B., Reid, I.: Stable multi-target tracking in real-time surveillance video. In: CVPR, pp. 3457–3464 (2011)
10.
Zurück zum Zitat Benfold, B., Reid, I.: Guiding visual surveillance by tracking human attention. In: BMVC, pp. 1–11 (2009) Benfold, B., Reid, I.: Guiding visual surveillance by tracking human attention. In: BMVC, pp. 1–11 (2009)
11.
Zurück zum Zitat Beymer, D.J.: Face recognition under varying pose. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1994, pp. 756–761. IEEE (1994) Beymer, D.J.: Face recognition under varying pose. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1994, pp. 756–761. IEEE (1994)
12.
Zurück zum Zitat Blanz, V., Vetter, T.: Face recognition based on fitting a 3D morphable model. IEEE Trans. Pattern Anal. Mach. Intell. 25(9), 1063–1074 (2003)CrossRef Blanz, V., Vetter, T.: Face recognition based on fitting a 3D morphable model. IEEE Trans. Pattern Anal. Mach. Intell. 25(9), 1063–1074 (2003)CrossRef
13.
Zurück zum Zitat Brau, E., Guan, J., Simek, K., Del Pero, L., Dawson, C.R., Barnard, K.: Bayesian 3D tracking from monocular video. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 3368–3375. IEEE (2013) Brau, E., Guan, J., Simek, K., Del Pero, L., Dawson, C.R., Barnard, K.: Bayesian 3D tracking from monocular video. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 3368–3375. IEEE (2013)
14.
Zurück zum Zitat Chen, C., Heili, A., Odobez, J.M.: A joint estimation of head and body orientation cues in surveillance video. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 860–867. IEEE (2011) Chen, C., Heili, A., Odobez, J.M.: A joint estimation of head and body orientation cues in surveillance video. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 860–867. IEEE (2011)
15.
Zurück zum Zitat Chen, C., Odobez, J.M.: We are not contortionists: coupled adaptive learning for head and body orientation estimation in surveillance video. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1544–1551. IEEE (2012) Chen, C., Odobez, J.M.: We are not contortionists: coupled adaptive learning for head and body orientation estimation in surveillance video. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1544–1551. IEEE (2012)
16.
Zurück zum Zitat Cristani, M., et al.: Social interaction discovery by statistical analysis of F-formations. In: BMVC (2011) Cristani, M., et al.: Social interaction discovery by statistical analysis of F-formations. In: BMVC (2011)
17.
Zurück zum Zitat Dehghan, A., Assari, S.M., Shah, M.: GMMCP tracker: globally optimal generalized maximum multi clique problem for multiple object tracking. In: CVPR, vol. 1, p. 2 (2015) Dehghan, A., Assari, S.M., Shah, M.: GMMCP tracker: globally optimal generalized maximum multi clique problem for multiple object tracking. In: CVPR, vol. 1, p. 2 (2015)
18.
Zurück zum Zitat Del Pero, L., Guan, J., Brau, E., Schlecht, J., Barnard, K.: Sampling bedrooms. In: CVPR, pp. 2009–2016 (2011) Del Pero, L., Guan, J., Brau, E., Schlecht, J., Barnard, K.: Sampling bedrooms. In: CVPR, pp. 2009–2016 (2011)
19.
Zurück zum Zitat Duffner, S., Garcia, C.: Visual focus of attention estimation with unsupervised incremental learning. IEEE Trans. Circuits Syst. Video Technol. 26(12), 2264–2272 (2016)CrossRef Duffner, S., Garcia, C.: Visual focus of attention estimation with unsupervised incremental learning. IEEE Trans. Circuits Syst. Video Technol. 26(12), 2264–2272 (2016)CrossRef
20.
Zurück zum Zitat Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. In: IEEE PAMI (2009) Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. In: IEEE PAMI (2009)
21.
Zurück zum Zitat Gee, A., Cipolla, R.: Determining the gaze of faces in images. Image Vis. Comput. 12(10), 639–647 (1994)CrossRef Gee, A., Cipolla, R.: Determining the gaze of faces in images. Image Vis. Comput. 12(10), 639–647 (1994)CrossRef
22.
Zurück zum Zitat Gu, L., Kanade, T.: 3D alignment of face in a single image. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 1305–1312. IEEE (2006) Gu, L., Kanade, T.: 3D alignment of face in a single image. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 1305–1312. IEEE (2006)
23.
Zurück zum Zitat Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, New York (2000)MATH Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, New York (2000)MATH
24.
Zurück zum Zitat Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning; Data Mining, Inference, and Prediction. Springer Series in Statistics. Springer, New York (2001)MATH Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning; Data Mining, Inference, and Prediction. Springer Series in Statistics. Springer, New York (2001)MATH
25.
Zurück zum Zitat Horprasert, T., Yacoob, Y., Davis, L.S.: Computing 3d head orientation from a monocular image sequence. In: 25th Annual AIPR Workshop on Emerging Applications of Computer Vision, pp. 244–252. International Society for Optics and Photonics (1997) Horprasert, T., Yacoob, Y., Davis, L.S.: Computing 3d head orientation from a monocular image sequence. In: 25th Annual AIPR Workshop on Emerging Applications of Computer Vision, pp. 244–252. International Society for Optics and Photonics (1997)
26.
Zurück zum Zitat Huang, J., Shao, X., Wechsler, H.: Face pose discrimination using support vector machines (SVM). In: Proceedings of the Fourteenth International Conference on Pattern Recognition, vol. 1, pp. 154–156. IEEE (1998) Huang, J., Shao, X., Wechsler, H.: Face pose discrimination using support vector machines (SVM). In: Proceedings of the Fourteenth International Conference on Pattern Recognition, vol. 1, pp. 154–156. IEEE (1998)
27.
Zurück zum Zitat Huang, Y., Duan, D., Cui, J., Davoine, F., Wang, L., Zha, H.: Joint estimation of head pose and visual focus of attention. In: 2014 IEEE International Conference on Image Processing (ICIP), pp. 3332–3336. IEEE (2014) Huang, Y., Duan, D., Cui, J., Davoine, F., Wang, L., Zha, H.: Joint estimation of head pose and visual focus of attention. In: 2014 IEEE International Conference on Image Processing (ICIP), pp. 3332–3336. IEEE (2014)
28.
Zurück zum Zitat Isard, M., MacCormick, J.: BraMBLe: a Bayesian multiple-blob tracker. In: ICCV, pp. 34–41 (2001) Isard, M., MacCormick, J.: BraMBLe: a Bayesian multiple-blob tracker. In: ICCV, pp. 34–41 (2001)
29.
Zurück zum Zitat Jayagopi, D.B., et al.: The vernissage corpus: a multimodal human-robot-interaction dataset. Technical report (2012) Jayagopi, D.B., et al.: The vernissage corpus: a multimodal human-robot-interaction dataset. Technical report (2012)
31.
Zurück zum Zitat Kuo, C., Huang, C., Nevatia, R.: Multi-target tracking by on-line learned discriminative appearance models. In: CVPR, pp. 685–692 (2010) Kuo, C., Huang, C., Nevatia, R.: Multi-target tracking by on-line learned discriminative appearance models. In: CVPR, pp. 685–692 (2010)
32.
Zurück zum Zitat La Cascia, M., Sclaroff, S., Athitsos, V.: Fast, reliable head tracking under varying illumination: an approach based on registration of texture-mapped 3d models. IEEE Trans. Pattern Anal. Mach. Intell. 22(4), 322–336 (2000)CrossRef La Cascia, M., Sclaroff, S., Athitsos, V.: Fast, reliable head tracking under varying illumination: an approach based on registration of texture-mapped 3d models. IEEE Trans. Pattern Anal. Mach. Intell. 22(4), 322–336 (2000)CrossRef
33.
Zurück zum Zitat Li, Y., Gong, S., Liddell, H.: Support vector regression and classification based multi-view face detection and recognition. In: Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 300–305. IEEE (2000) Li, Y., Gong, S., Liddell, H.: Support vector regression and classification based multi-view face detection and recognition. In: Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 300–305. IEEE (2000)
34.
Zurück zum Zitat Li, Y., Gong, S., Sherrah, J., Liddell, H.: Support vector machine based multi-view face detection and recognition. Image Vis. Comput. 22(5), 413–427 (2004)CrossRef Li, Y., Gong, S., Sherrah, J., Liddell, H.: Support vector machine based multi-view face detection and recognition. Image Vis. Comput. 22(5), 413–427 (2004)CrossRef
35.
Zurück zum Zitat Liu, C.: Exploring new representations and applications for motion analysis. Ph.D. thesis, M.I.T (2009) Liu, C.: Exploring new representations and applications for motion analysis. Ph.D. thesis, M.I.T (2009)
36.
Zurück zum Zitat Massé, B., Ba, S., Horaud, R.: Simultaneous estimation of gaze direction and visual focus of attention for multi-person-to-robot interaction. In: 2016 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2016) Massé, B., Ba, S., Horaud, R.: Simultaneous estimation of gaze direction and visual focus of attention for multi-person-to-robot interaction. In: 2016 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2016)
37.
Zurück zum Zitat Milan, A., Leal-Taixé, L., Schindler, K., Reid, I.: Joint tracking and segmentation of multiple targets. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5397–5406 (2015) Milan, A., Leal-Taixé, L., Schindler, K., Reid, I.: Joint tracking and segmentation of multiple targets. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5397–5406 (2015)
38.
Zurück zum Zitat Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 607–626 (2009)CrossRef Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 607–626 (2009)CrossRef
39.
Zurück zum Zitat Niyogi, S., Freeman, W.T.: Example-based head tracking. In: Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp. 374–378. IEEE (1996) Niyogi, S., Freeman, W.T.: Example-based head tracking. In: Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp. 374–378. IEEE (1996)
40.
Zurück zum Zitat Oh, S.: Bayesian formulation of data association and Markov chain Monte Carlo data association. In: Robotics: Science and Systems Conference (RSS) Workshop Inside Data association (2008) Oh, S.: Bayesian formulation of data association and Markov chain Monte Carlo data association. In: Robotics: Science and Systems Conference (RSS) Workshop Inside Data association (2008)
41.
Zurück zum Zitat Oh, S., Russell, S., Sastry, S.: Markov chain Monte Carlo data association for general multiple target tracking problems (2004) Oh, S., Russell, S., Sastry, S.: Markov chain Monte Carlo data association for general multiple target tracking problems (2004)
42.
Zurück zum Zitat Otsuka, K., Takemae, Y., Yamato, J.: A probabilistic inference of multiparty-conversation structure based on Markov-switching models of gaze patterns, head directions, and utterances. In: Proceedings of the 7th International Conference on Multimodal Interfaces, pp. 191–198. ACM (2005) Otsuka, K., Takemae, Y., Yamato, J.: A probabilistic inference of multiparty-conversation structure based on Markov-switching models of gaze patterns, head directions, and utterances. In: Proceedings of the 7th International Conference on Multimodal Interfaces, pp. 191–198. ACM (2005)
43.
Zurück zum Zitat Otsuka, K., Yamato, J., Takemae, Y., Murase, H.: Conversation scene analysis with dynamic Bayesian network basedon visual head tracking. In: 2006 IEEE International Conference on Multimedia and Expo, pp. 949–952. IEEE (2006) Otsuka, K., Yamato, J., Takemae, Y., Murase, H.: Conversation scene analysis with dynamic Bayesian network basedon visual head tracking. In: 2006 IEEE International Conference on Multimedia and Expo, pp. 949–952. IEEE (2006)
44.
Zurück zum Zitat Pirsiavash, H., Ramanan, D., Fowlkes, C.: Globally-optimal greedy algorithms for tracking a variable number of objects. In: CVPR, pp. 1201–1208 (2011) Pirsiavash, H., Ramanan, D., Fowlkes, C.: Globally-optimal greedy algorithms for tracking a variable number of objects. In: CVPR, pp. 1201–1208 (2011)
45.
Zurück zum Zitat Sankaranarayanan, K., Chang, M.C., Krahnstoever, N.: Tracking gaze direction from far-field surveillance cameras. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 519–526. IEEE (2011) Sankaranarayanan, K., Chang, M.C., Krahnstoever, N.: Tracking gaze direction from far-field surveillance cameras. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 519–526. IEEE (2011)
46.
Zurück zum Zitat Segal, A.V., Reid, I.: Latent data association: Bayesian model selection for multi-target tracking. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 2904–2911. IEEE (2013) Segal, A.V., Reid, I.: Latent data association: Bayesian model selection for multi-target tracking. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 2904–2911. IEEE (2013)
48.
Zurück zum Zitat Smith, K., Ba, S.O., Gatica-Perez, D., Odobez, J.M.: Tracking the multi person wandering visual focus of attention. In: Proceedings of the 8th International Conference on Multimodal Interfaces, pp. 265–272. ACM (2006) Smith, K., Ba, S.O., Gatica-Perez, D., Odobez, J.M.: Tracking the multi person wandering visual focus of attention. In: Proceedings of the 8th International Conference on Multimodal Interfaces, pp. 265–272. ACM (2006)
49.
Zurück zum Zitat Smith, K., Ba, S.O., Odobez, J.M., Gatica-Perez, D.: Tracking the visual focus of attention for a varying number of wandering people. IEEE Trans. Pattern Anal. Mach. Intell. 30(7), 1212–1229 (2008)CrossRef Smith, K., Ba, S.O., Odobez, J.M., Gatica-Perez, D.: Tracking the visual focus of attention for a varying number of wandering people. IEEE Trans. Pattern Anal. Mach. Intell. 30(7), 1212–1229 (2008)CrossRef
51.
Zurück zum Zitat Stiefelhagen, R., Yang, J., Waibel, A.: Modeling focus of attention for meeting indexing. In: Proceedings of the seventh ACM International Conference on Multimedia (Part 1), pp. 3–10. ACM (1999) Stiefelhagen, R., Yang, J., Waibel, A.: Modeling focus of attention for meeting indexing. In: Proceedings of the seventh ACM International Conference on Multimedia (Part 1), pp. 3–10. ACM (1999)
52.
Zurück zum Zitat Stiefelhagen, R., Yang, J., Waibel, A.: Modeling focus of attention for meeting indexing based on multiple cues. IEEE Trans. Neural Netw. 13(4), 928–938 (2002)CrossRef Stiefelhagen, R., Yang, J., Waibel, A.: Modeling focus of attention for meeting indexing based on multiple cues. IEEE Trans. Neural Netw. 13(4), 928–938 (2002)CrossRef
53.
Zurück zum Zitat Stiefelhagen, R., Zhu, J.: Head orientation and gaze direction in meetings. In: Extended Abstracts on Human Factors in Computing Systems, CHI 2002, pp. 858–859. ACM (2002) Stiefelhagen, R., Zhu, J.: Head orientation and gaze direction in meetings. In: Extended Abstracts on Human Factors in Computing Systems, CHI 2002, pp. 858–859. ACM (2002)
54.
Zurück zum Zitat Tang, S., Andres, B., Andriluka, M., Schiele, B.: Subgraph decomposition for multi-target tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5033–5041 (2015) Tang, S., Andres, B., Andriluka, M., Schiele, B.: Subgraph decomposition for multi-target tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5033–5041 (2015)
55.
Zurück zum Zitat Titsias, M.K., Lawrence, N.D., Rattray, M.: Efficient sampling for Gaussian Process inference using control variables. In: Advances in Neural Information Processing Systems, vol. 21, pp. 1681–1688. Curran Associates Inc., Vancouver, British Columbia, Canada (2008) Titsias, M.K., Lawrence, N.D., Rattray, M.: Efficient sampling for Gaussian Process inference using control variables. In: Advances in Neural Information Processing Systems, vol. 21, pp. 1681–1688. Curran Associates Inc., Vancouver, British Columbia, Canada (2008)
56.
Zurück zum Zitat Valenti, R., Sebe, N., Gevers, T.: Combining head pose and eye location information for gaze estimation. IEEE Trans. Image Process. 21(2), 802–815 (2012)MathSciNetCrossRef Valenti, R., Sebe, N., Gevers, T.: Combining head pose and eye location information for gaze estimation. IEEE Trans. Image Process. 21(2), 802–815 (2012)MathSciNetCrossRef
57.
58.
Zurück zum Zitat Voit, M., Stiefelhagen, R.: Deducing the visual focus of attention from head pose estimation in dynamic multi-view meeting scenarios. In: Proceedings of the 10th International Conference on Multimodal Interfaces, pp. 173–180. ACM (2008) Voit, M., Stiefelhagen, R.: Deducing the visual focus of attention from head pose estimation in dynamic multi-view meeting scenarios. In: Proceedings of the 10th International Conference on Multimodal Interfaces, pp. 173–180. ACM (2008)
59.
Zurück zum Zitat Voit, M., Stiefelhagen, R.: 3D user-perspective, voxel-based estimation of visual focus of attention in dynamic meeting scenarios. In: International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, p. 51. ACM (2010) Voit, M., Stiefelhagen, R.: 3D user-perspective, voxel-based estimation of visual focus of attention in dynamic meeting scenarios. In: International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, p. 51. ACM (2010)
61.
Zurück zum Zitat Wei, P., Zhao, Y., Zheng, N., Zhu, S.C.: Modeling 4d human-object interactions for joint event segmentation, recognition, and object localization. IEEE Trans Pattern Anal. Mach. Intell. 39, 1165–1179 (2016)CrossRef Wei, P., Zhao, Y., Zheng, N., Zhu, S.C.: Modeling 4d human-object interactions for joint event segmentation, recognition, and object localization. IEEE Trans Pattern Anal. Mach. Intell. 39, 1165–1179 (2016)CrossRef
62.
Zurück zum Zitat Wu, Y., Toyama, K.: Wide-range, person-and illumination-insensitive head orientation estimation. In: Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 183–188. IEEE (2000) Wu, Y., Toyama, K.: Wide-range, person-and illumination-insensitive head orientation estimation. In: Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 183–188. IEEE (2000)
63.
Zurück zum Zitat Xiao, J., Moriyama, T., Kanade, T., Cohn, J.F.: Robust full-motion recovery of head by dynamic templates and re-registration techniques. Int. J. Imaging Syst. Technol. 13(1), 85–94 (2003)CrossRef Xiao, J., Moriyama, T., Kanade, T., Cohn, J.F.: Robust full-motion recovery of head by dynamic templates and re-registration techniques. Int. J. Imaging Syst. Technol. 13(1), 85–94 (2003)CrossRef
64.
Zurück zum Zitat Xie, D., Todorovicy, S., Zhu, S.C.: Inferring “dark matter” and “dark energy” from videos. In: ICCV (2013) Xie, D., Todorovicy, S., Zhu, S.C.: Inferring “dark matter” and “dark energy” from videos. In: ICCV (2013)
65.
Zurück zum Zitat Yang, R., Zhang, Z.: Model-based head pose tracking with stereovision. In: Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 255–260. IEEE (2002) Yang, R., Zhang, Z.: Model-based head pose tracking with stereovision. In: Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 255–260. IEEE (2002)
66.
Zurück zum Zitat Yi, Y., Xu, H.: Hierarchical data association framework with occlusion handling for multiple targets tracking. IEEE Signal Process. Lett. 21(3), 288–291 (2014)MathSciNetCrossRef Yi, Y., Xu, H.: Hierarchical data association framework with occlusion handling for multiple targets tracking. IEEE Signal Process. Lett. 21(3), 288–291 (2014)MathSciNetCrossRef
67.
Zurück zum Zitat Yücel, Z., Salah, A.A., Mericli, C., Meriçli, T., Valenti, R., Gevers, T.: Joint attention by gaze interpolation and saliency. IEEE Trans. Cybern. 43(3), 829–842 (2013)CrossRef Yücel, Z., Salah, A.A., Mericli, C., Meriçli, T., Valenti, R., Gevers, T.: Joint attention by gaze interpolation and saliency. IEEE Trans. Cybern. 43(3), 829–842 (2013)CrossRef
68.
Zurück zum Zitat Zen, G., Lepri, B., Ricci, E., Lanz, O.: Space speaks: towards socially and personality aware visual surveillance. In: 1st ACM International Workshop on Multimodal Pervasive Video Analysis, pp. 37–42. ACM, Firenze, Italy (2010) Zen, G., Lepri, B., Ricci, E., Lanz, O.: Space speaks: towards socially and personality aware visual surveillance. In: 1st ACM International Workshop on Multimodal Pervasive Video Analysis, pp. 37–42. ACM, Firenze, Italy (2010)
69.
Zurück zum Zitat Zhang, L., Li, Y., Nevatia, R.: Global data association for multi-object tracking using network flows. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008) Zhang, L., Li, Y., Nevatia, R.: Global data association for multi-object tracking using network flows. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
70.
Zurück zum Zitat Zhao, G., Chen, L., Song, J., Chen, G.: Large head movement tracking using sift-based registration. In: Proceedings of the 15th International Conference on Multimedia, pp. 807–810. ACM (2007) Zhao, G., Chen, L., Song, J., Chen, G.: Large head movement tracking using sift-based registration. In: Proceedings of the 15th International Conference on Multimedia, pp. 807–810. ACM (2007)
71.
Zurück zum Zitat Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2879–2886. IEEE (2012) Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2879–2886. IEEE (2012)
Metadaten
Titel
Multiple-Gaze Geometry: Inferring Novel 3D Locations from Gazes Observed in Monocular Video
verfasst von
Ernesto Brau
Jinyan Guan
Tanya Jeffries
Kobus Barnard
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01225-0_38