skip to main content
research-article
Public Access

Transform recipes for efficient cloud photo enhancement

Published:02 November 2015Publication History
Skip Abstract Section

Abstract

Cloud image processing is often proposed as a solution to the limited computing power and battery life of mobile devices: it allows complex algorithms to run on powerful servers with virtually unlimited energy supply. Unfortunately, this overlooks the time and energy cost of uploading the input and downloading the output images. When transfer overhead is accounted for, processing images on a remote server becomes less attractive and many applications do not benefit from cloud offloading. We aim to change this in the case of image enhancements that preserve the overall content of an image. Our key insight is that, in this case, the server can compute and transmit a description of the transformation from input to output, which we call a transform recipe. At equivalent quality, our recipes are much more compact than JPEG images: this reduces the client's download. Furthermore, recipes can be computed from highly compressed inputs which significantly reduces the data uploaded to the server. The client reconstructs a high-fidelity approximation of the output by applying the recipe to its local high-quality input. We demonstrate our results on 168 images and 10 image processing applications, showing that our recipes form a compact representation for a diverse set of image filters. With an equivalent transmission budget, they provide higher-quality results than JPEG-compressed input/output images, with a gain of the order of 10 dB in many cases. We demonstrate the utility of recipes on a mobile phone by profiling the energy consumption and latency for both local and cloud computation: a transform recipe-based pipeline runs 2--4x faster and uses 2--7x less energy than local or naive cloud computation.

Skip Supplemental Material Section

Supplemental Material

References

  1. Aubry, M., Paris, S., Hasinoff, S. W., Kautz, J., and Durand, F. 2014. Fast local laplacian filters: Theory and applications. ACM Trans. Graph. 33, 5 (Sept.), 167:1--167:14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Barr, K. C., and Asanović, K. 2006. Energy-aware lossless data compression. ACM Trans. Comput. Syst. 24, 3 (Aug.), 250--291. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Berthouzoz, F., Li, W., Dontcheva, M., and Agrawala, M. 2011. A framework for content-adaptive photo manipulation macros: Application to face, landscape, and global manipulations. ACM Transactions on Graphics 30, 5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Burt, P. J., and Adelson, E. H. 1983. The laplacian pyramid as a compact image code. IEEE Transactions on Communications 31, 4, 532--540.Google ScholarGoogle ScholarCross RefCross Ref
  5. Bychkovsky, V., Paris, S., Chan, E., and Durand, F. 2011. Learning photographic global tonal adjustment with a database of input / output image pairs. In IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chen, Q., Li, D., and Tang, C.-K. 2012. Knn matting. In IEEE Conference on Computer Vision and Pattern Recognition, 869--876. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Deutsch, P., 1996. Deflate compressed data format specification version 1.3.Google ScholarGoogle Scholar
  8. Farbman, Z., Fattal, R., Lischinski, D., and Szeliski, R. 2008. Edge-preserving decompositions for multi-scale tone and detail manipulation. In ACM Transaction on Graphics (SIGGRAPH), ACM, New York, NY, USA, SIGGRAPH '08, 67:1--67:10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Farbman, Z., Fattal, R., and Lischinski, D. 2011. Convolution pyramids. ACM Transactions on Graphics (Proc. of SIGGRAPH Asia) 30, 6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Freeman, W. T., and Torralba, A. 2002. Shape recipes: Scene representations that refer to the image. In Vision Sciences Society Annual Meeting, MIT Press, 25--47.Google ScholarGoogle Scholar
  11. Hamilton, E. 1992. Jpeg file interchange format. C-Cube Microsystems.Google ScholarGoogle Scholar
  12. Heeger, D. J., and Bergen, J. R. 1995. Pyramid-based texture analysis/synthesis. In Proceedings of the 22Nd Annual Conference on Computer Graphics and Interactive Techniques, ACM, New York, NY, USA, SIGGRAPH '95, 229--238. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Huang, J., Qian, F., Gerber, A., Mao, Z. M., Sen, S., and Spatscheck, O., 2012. A close examination of performance and power characteristics of 4g lte networks.Google ScholarGoogle Scholar
  14. Huffman, D. 1952. A method for the construction of minimum-redundancy codes. Proceedings of the IRE 40, 9 (Sept), 1098--1101.Google ScholarGoogle ScholarCross RefCross Ref
  15. Jeong, W.-K., Johnson, M. K., Yu, I., Kautz, J., Pfister, H., and Paris, S. 2011. Display-aware image editing. In International Conference on Computational Photography.Google ScholarGoogle Scholar
  16. Kaufman, L., Lischinski, D., and Werman, M. 2012. Content-aware automatic photo enhancement. Computer Graphics Forum 31, 8, 2528--2540. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Kim, J.-H., Jang, W.-D., Sim, J.-Y., and Kim, C.-S. 2013. Optimized contrast enhancement for real-time image and video dehazing. J. Vis. Comun. Image Represent. 24, 3 (Apr.), 410--425. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Kumar, K., Liu, J., Lu, Y.-H., and Bhargava, B. 2013. A survey of computation offloading for mobile systems. Mob. Netw. Appl. 18, 1 (Feb.), 129--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Laffont, P.-Y., Ren, Z., Tao, X., Qian, C., and Hays, J. 2014. Transient attributes for high-level understanding and editing of outdoor scenes. ACM Transaction on Graphics (SIGGRAPH) 33, 4 (July), 149:1--149:11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Lee, K., Chu, D., Cuervo, E., Kopf, J., Grizan, S., Wolman, A., and Flinn, J. 2014. Outatime: Using speculation to enable low-latency continuous interaction for cloud gaming. Tech. Rep. MSR-TR-2014-115.Google ScholarGoogle Scholar
  21. Levin, A., Lischinski, D., and Weiss, Y. 2004. Colorization using optimization. ACM Transaction on Graphics (SIGGRAPH) 23, 3 (Aug.), 689--694. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Levin, A., Lischinski, D., and Weiss, Y. 2008. A closed-form solution to natural image matting. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 2 (Feb), 228--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Levoy, M. 1995. Polygon-assisted jpeg and mpeg compression of synthetic images. In Computer Graphics and Interactive Techniques, ACM, New York, NY, USA, SIGGRAPH '95, 21--28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. LiKamWa, R., Priyantha, B., Philipose, M., Zhong, L., and Bahl, P. 2013. Energy characterization and optimization of image sensing toward continuous mobile vision. In Proc. of International Conference on Mobile Systems, Applications, and Services, ACM, 69--82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Mantiuk, R., and Seidel, H.-P. 2008. Modeling a generic tone-mapping operator. Computer Graphics Forum (Proc. of Eurographics) 27, 2.Google ScholarGoogle ScholarCross RefCross Ref
  26. Paris, S., Hasinoff, S. W., and Kautz, J. 2011. Local laplacian filters: Edge-aware image processing with a laplacian pyramid. In ACM Transaction on Graphics (SIGGRAPH), ACM, New York, NY, USA, SIGGRAPH '11, 68:1--68:12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Rabbani, M., and Jones, P. W. 1991. Digital Image Compression Techniques, 1st ed. Society of Photo-Optical Instrumentation Engineers (SPIE), Bellingham, WA, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Ragan-Kelley, J., Adams, A., Paris, S., Levoy, M., Amarasinghe, S., and Durand, F. 2012. Decoupling algorithms from schedules for easy optimization of image processing pipelines. ACM Transactions on Graphics 31, 4 (July), 32:1--32:12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Ragan-Kelley, J., Barnes, C., Adams, A., Paris, S., Durand, F., and Amarasinghe, S. 2013. Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines. In Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, ACM, New York, NY, USA, PLDI, 519--530. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Rhemann, C., Rother, C., Wang, J., Gelautz, M., Kohli, P., and Rott, P. 2009. A perceptually motivated online benchmark for image matting. In IEEE Conference on Computer Vision and Pattern Recognition, 1826--1833.Google ScholarGoogle Scholar
  31. Shih, Y., Paris, S., Durand, F., and Freeman, W. T. 2013. Data-driven hallucination of different times of day from a single outdoor photo. ACM Transaction on Graphics (SIGGRAPH) 32, 6 (Nov.), 200:1--200:11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Shih, Y., Paris, S., Barnes, C., Freeman, W. T., and Durand, F. 2014. Style transfer for headshot portraits. ACM Transaction on Graphics (SIGGRAPH) 33, 4 (July), 148:1--148:14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Skodras, A., Christopoulos, C., and Ebrahimi, T. 2001. The jpeg 2000 still image compression standard. IEEE Signal Processing Magazine 18, 5, 36--58.Google ScholarGoogle ScholarCross RefCross Ref
  34. Tibshirani, R. 1994. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B 58, 267--288.Google ScholarGoogle Scholar
  35. Torralba, A., and Freeman, W. 2003. Properties and applications of shape recipes. In IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, II-383-90 vol.2.Google ScholarGoogle Scholar
  36. Wallace, G. 1992. The jpeg still picture compression standard. IEEE Transactions on Consumer Electronics 38, 1 (Feb), xviii--xxxiv. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Welch, T. 1984. A technique for high-performance data compression. Computer 17, 6 (June), 8--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Witten, I. H., Neal, R. M., and Cleary, J. G. 1987. Arithmetic coding for data compression. Communications of the ACM 30, 6, 520--540. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Xu, L., Lu, C., Xu, Y., and Jia, J. 2011. Image smoothing via l0 gradient minimization. ACM Transaction on Graphics (SIGGRAPH) 30, 6 (Dec.), 174:1--174:12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Ziv, J., and Lempel, A. 1977. A universal algorithm for sequential data compression. IEEE Transactions on Information Theory 23, 3 (May), 337--343. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Transform recipes for efficient cloud photo enhancement

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Graphics
        ACM Transactions on Graphics  Volume 34, Issue 6
        November 2015
        944 pages
        ISSN:0730-0301
        EISSN:1557-7368
        DOI:10.1145/2816795
        Issue’s Table of Contents

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 2 November 2015
        Published in tog Volume 34, Issue 6

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader