skip to main content
research-article

Where2Stand: A Human Position Recommendation System for Souvenir Photography

Published:07 October 2015Publication History
Skip Abstract Section

Abstract

People often take photographs at tourist sites and these pictures usually have two main elements: a person in the foreground and scenery in the background. This type of “souvenir photo” is one of the most common photos clicked by tourists. Although algorithms that aid a user-photographer in taking a well-composed picture of a scene exist [Ni et al. 2013], few studies have addressed the issue of properly positioning human subjects in photographs. In photography, the common guidelines of composing portrait images exist. However, these rules usually do not consider the background scene. Therefore, in this article, we investigate human-scenery positional relationships and construct a photographic assistance system to optimize the position of human subjects in a given background scene, thereby assisting the user in capturing high-quality souvenir photos. We collect thousands of well-composed portrait photographs to learn human-scenery aesthetic composition rules. In addition, we define a set of negative rules to exclude undesirable compositions. Recommendation results are achieved by combining the first learned positive rule with our proposed negative rules. We implement the proposed system on an Android platform in a smartphone. The system demonstrates its efficacy by producing well-composed souvenir photos.

References

  1. Radhakrishna Achanta, Francisco Estrada, Patricia Wils, and Sabine Süsstrunk. 2008. Salient region detection and segmentation. In Computer Vision Systems. Springer, 66--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Jongmin Baek, Dawid Pajak, Kihwan Kim, Kari Pulli, and Marc Levoy. 2013. WYSIWYG computational photography via viewfinder editing. ACM Transactions on Graphics 32, 6 (2013), 198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Dana H Ballard. 1981. Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognition 13, 2 (1981), 111--122.Google ScholarGoogle ScholarCross RefCross Ref
  4. Subhabrata Bhattacharya, Rahul Sukthankar, and Mubarak Shah. 2010. A framework for photo-quality assessment and enhancement based on visual aesthetics. In Proceedings of the International Conference on Multimedia. ACM, 271--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dmitri Bitouk, Neeraj Kumar, Samreen Dhillon, Peter Belhumeur, and Shree K. Nayar. 2008. Face swapping: Automatically replacing faces in photographs. ACM Transactions on Graphics 27, 3 (2008), 39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Charles A. Bouman, Michael Shapiro, G. W. Cook, C. Brian Atkins, and Hui Cheng. 1997. Cluster: An Unsupervised Algorithm for Modeling Gaussian Mixtures. https://engineering.purdue.edu/∼bouman/software/cluster/. (1997).Google ScholarGoogle Scholar
  7. Tao Chen, Ming-Ming Cheng, Ping Tan, Ariel Shamir, and Shi-Min Hu. 2009. Sketch2Photo: Internet image montage. ACM Transactions on Graphics 28, 5 (2009), 124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Ming-Ming Cheng, Guo-Xin Zhang, Niloy J. Mitra, Xiaolei Huang, and Shi-Min Hu. 2011. Global contrast based salient region detection. In Proceedings of Computer Vision and Pattern Recognition. IEEE, 409--416. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Franklin C. Crow. 1984. Summed-area tables for texture mapping. In Proceedings of ACM SIGGRAPH, Vol. 18. ACM, 207--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Arthur P. Dempster, Nan M. Laird, and Donald B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (1977), 1--38.Google ScholarGoogle Scholar
  11. Zeev Farbman, Gil Hoffer, Yaron Lipman, Daniel Cohen-Or, and Dani Lischinski. 2009. Coordinates for instant image cloning. ACM Transactions on Graphics 28, 3 (2009), 67. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Douglas Hoffman. 2013. Portrait Photography Tip—Avoid Putting Peoples Head in the Horizon Line. http://mauiphototours.net/?p=389. (2013).Google ScholarGoogle Scholar
  13. Jiaya Jia, Jian Sun, Chi-Keung Tang, and Heung-Yeung Shum. 2006. Drag-and-drop pasting. ACM Transactions on Graphics 25, 3 (2006), 631--637. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Tapas Kanungo, David M. Mount, Nathan S. Netanyahu, Christine D. Piatko, Ruth Silverman, and Angela Y. Wu. 2002. An efficient k-means clustering algorithm: Analysis and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 7 (2002), 881--892. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Yan Ke, Xiaoou Tang, and Feng Jing. 2006. The design of high-level features for photo quality assessment. In Proceedings of Computer Vision and Pattern Recognition, Vol. 1. IEEE, 419--426. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Bert Krages. 2005. Photography: The art of composition. (1st ed.). Allworth Press.Google ScholarGoogle Scholar
  17. Daniel Kuettel and Vittorio Ferrari. 2012. Figure-ground segmentation by transferring window masks. In Proc. of Computer Vision and Pattern Recognition. IEEE, 558--565. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Ivan O. Kyrgyzov, Olexiy O. Kyrgyzov, Henri Maître, and Marine Campedel. 2007. Kernel MDL to determine the number of clusters. In Machine Learning and Data Mining in Pattern Recognition. Springer, 203--217. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Ligang Liu, Renjie Chen, Lior Wolf, and Daniel Cohen-Or. 2010a. Optimizing photo composition. Computer Graphics Forum 29, 2 (2010), 469--478.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ligang Liu, Yong Jin, and Qingbiao Wu. 2010b. Realtime aesthetic image retargeting. In Proceedings of the International Conference on Computational Aesthetics in Graphics, Visualization and Imaging. Eurographics Association, 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Wei Luo, Xiaogang Wang, and Xiaoou Tang. 2011. Content-based photo quality assessment. In Proceedings of the International Conference on Computer Vision. IEEE, 2206--2213. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yiwen Luo and Xiaoou Tang. 2008. Photo and video quality evaluation: Focusing on the subject. In Proceedings of the European Conference on Computer Vision. Springer, 386--399. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Bingbing Ni, Mengdi Xu, Bin Cheng, Meng Wang, Shuicheng Yan Yan, and Qi Tian. 2013. Learning to photograph: A compositional perspective. IEEE Transactions on Multimedia 15, 5 (2013), 1138--1151. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Masashi Nishiyama, Takahiro Okabe, Yoichi Sato, and Imari Sato. 2009. Sensation-based photo cropping. In Proceedings of the 17th ACM International Conference on Multimedia. ACM, 669--672. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Patrick Pérez, Michel Gangnet, and Andrew Blake. 2003. Poisson image editing. ACM Transactions on Graphics 22, 3 (2003), 313--318. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Carsten Rother. 2002. A new approach to vanishing point detection in architectural environments. Image and Vision Computing 20, 9 (2002), 647--655.Google ScholarGoogle ScholarCross RefCross Ref
  27. Michael W. Tao, Micah K. Johnson, and Sylvain Paris. 2010. Error-tolerant image compositing. In Proceedings of the European Conference on Computer Vision. Springer, 31--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Paul Viola and Michael Jones. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1. I-511--I-518.Google ScholarGoogle ScholarCross RefCross Ref
  29. Jianzhou Yan, Stephen Lin, Sing Bing Kang, and Xiaoou Tang. 2013. Learning the change for automatic image cropping. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 971--978. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Luming Zhang, Mingli Song, Qi Zhao, Xiao Liu, Jiajun Bu, and Chun Chen. 2013. Probabilistic graphlet transfer for photo cropping. IEEE Transactions on Image Processing 22, 2 (2013), 802--815. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Yanhao Zhang, Xiaoshuai Sun, Hongxun Yao, Lei Qin, and Qingming Huang. 2012. Aesthetic composition representation for portrait photographing recommendation. In Proceedings of the International Conference on Image Processing. IEEE, 2753--2756.Google ScholarGoogle Scholar
  32. David Ziser. 2010. Captured by the Light: The Essential Guide to Creating Extraordinary Wedding Photography. Pearson Education.Google ScholarGoogle Scholar
  33. Monte Zucker. 2007. Monte Zucker’s Portrait Photography Handbook. Amherst Media, Inc.Google ScholarGoogle Scholar

Index Terms

  1. Where2Stand: A Human Position Recommendation System for Souvenir Photography

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Intelligent Systems and Technology
        ACM Transactions on Intelligent Systems and Technology  Volume 7, Issue 1
        October 2015
        293 pages
        ISSN:2157-6904
        EISSN:2157-6912
        DOI:10.1145/2830012
        • Editor:
        • Yu Zheng
        Issue’s Table of Contents

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 October 2015
        • Accepted: 1 April 2015
        • Revised: 1 March 2015
        • Received: 1 November 2014
        Published in tist Volume 7, Issue 1

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader