ABSTRACT
Live migration is one of the key technologies to improve data center utilization, power efficiency, and maintenance. Various live migration algorithms have been proposed; each exhibiting distinct characteristics in terms of completion time, amount of data transferred, virtual machine (VM) downtime, and VM performance degradation. To make matters worse, not only the migration algorithm but also the applications running inside the migrated VM affect the different performance metrics. With service-level agreements and operational constraints in place, choosing the optimal live migration technique has so far been an open question. In this work, we propose an adaptive machine learning-based model that is able to predict with high accuracy the key characteristics of live migration in dependence of the migration algorithm and the workload running inside the VM. We discuss the important input parameters for accurately modeling the target metrics, and describe how to profile them with little overhead. Compared to existing work, we are not only able to model all commonly used migration algorithms but also predict important metrics that have not been considered so far such as the performance degradation of the VM. In a comparison with the state-of-the-art, we show that the proposed model outperforms existing work by a factor 2 to 5.
- Sherif Akoush, Ripduman Sohan, Andrew Rice, Andrew W. Moore, and Andy Hopper. 2010. Predicting the Performance of Virtual Machine Migration. In Proceedings of the 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS '10). IEEE Computer Society, Washington, DC, USA, 37--46. Google ScholarDigital Library
- Arwa Aldhalaan and Daniel A. Menascé. 2013. Analytic Performance Modeling and Optimization of Live VM Migration. In Proceedings of 10th European Workshop (EPEW '13). 28--42. Google ScholarCross Ref
- Amazon EC2 - Virtual Server Hosting 2017. https://aws.amazon.com/ec2/. (2017). Online; accessed August 2017.Google Scholar
- Fabrice Bellard. 2005. QEMU, a Fast and Portable Dynamic Translator. In Proceedings of the Annual Conference on USENIX Annual Technical Conference (ATEC '05). USENIX Association, Berkeley, CA, USA, 41--41. http://dl.acm.org/citation.cfm?id=1247360.1247401Google ScholarDigital Library
- Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, and Kai Li. 2008. The PARSEC Benchmark Suite: Characterization and Architectural Implications. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques (PACT '08). ACM, New York, NY, USA, 72--81. Google ScholarDigital Library
- Stephen M. Blackburn, Robin Garner, Chris Hoffmann, Asjad M. Khang, Kathryn S. McKinley, Rotem Bentzur, Amer Diwan, Daniel Feinberg, Daniel Frampton, Samuel Z. Guyer, Martin Hirzel, Antony Hosking, Maria Jump, Han Lee, J. Eliot B. Moss, Aashish Phansalkar, Darko Stefanović, Thomas VanDrunen, Daniel von Dincklage, and Ben Wiedermann. 2006. The DaCapo Benchmarks: Java Benchmarking Development and Analysis. In Proceedings of the 21st Annual ACM SIGPLAN Conference on Object-oriented Programming Systems, Languages, and Applications (OOPSLA '06). ACM, New York, NY, USA, 169--190. Google ScholarDigital Library
- Leo Breiman. 1996. Bagging predictors. Machine Learning 24, 2 (1996), 123--140. Google ScholarDigital Library
- A. Colin Cameron and Frank A.G. Windmeijer. 1997. An R-squared measure of goodness of fit for some common nonlinear regression models. Journal of Econometrics 77, 2 (1997), 329 -- 342. Google ScholarCross Ref
- Ron C. Chiang, Jinho Hwang, H. Howie Huang, and Timothy Wood. 2014. Matrix: Achieving Predictable Virtual Machine Performance in the Clouds. In 11th International Conference on Autonomic Computing (ICAC 14). USENIX Association, Philadelphia, PA, 45--56. https://www.usenix.org/conference/icac14/technical-sessions/presentation/chiangGoogle Scholar
- Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, and Andrew Warfield. 2005. Live Migration of Virtual Machines. In Proceedings of the 2nd Conference on Symposium on Networked Systems Design & Implementation - Volume 2 (NSDI'05). USENIX Association, Berkeley, CA, USA, 273--286. http://dl.acm.org/citation.cfm?id=1251203.1251223Google ScholarDigital Library
- Christina Delimitrou and Christos Kozyrakis. 2013. QoS-Aware Scheduling in Heterogeneous Datacenters with Paragon. ACM Trans. Comput. Syst. 31, 4, Article 12 (Dec. 2013), 34 pages. Google ScholarDigital Library
- Christina Delimitrou and Christos Kozyrakis. 2014. Quasar: Resource-efficient and QoS-aware Cluster Management. In Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '14). ACM, New York, NY, USA, 127--144. Google ScholarDigital Library
- Li Deng, Hai Jin, Huacai Chen, and Song Wu. 2013. Migration Cost Aware Mitigating Hot Nodes in the Cloud. In Proceedings of the 2013 International Conference on Cloud Computing and Big Data (CLOUDCOM-ASIA '13). IEEE Computer Society, Washington, DC, USA, 197--204. Google ScholarDigital Library
- Djellel Eddine Difallah, Andrew Pavlo, Carlo Curino, and Philippe Cudre-Mauroux. 2013. OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases. Proc. VLDB Endow. 7, 4 (Dec. 2013), 277--288. Google ScholarDigital Library
- Jim Gao and Ratnesh Jamidar. 2014. Machine learning applications for data center optimization. Google White Paper (2014).Google Scholar
- Google Compute Engine 2017. https://cloud.google.com/compute. (2017). Online; accessed August 2017.Google Scholar
- Google Compute Engine uses Live Migration technology to service infrastructure without application downtime 2017. https://goo.gl/Ui3HFd. (2017). Online; accessed August 2017.Google Scholar
- Fabien Hermenier, Xavier Lorca, Jean-Marc Menaud, Gilles Muller, and Julia Lawall. 2009. Entropy: A Consolidation Manager for Clusters. In Proceedings of the 2009 ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE '09). ACM, New York, NY, USA, 41--50. Google ScholarDigital Library
- Michael R. Hines and Kartik Gopalan. 2009. Post-copy Based Live Virtual Machine Migration Using Adaptive Pre-paging and Dynamic Self-ballooning. In Proceedings of the 2009 ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE '09). ACM, New York, NY, USA, 51--60. Google ScholarDigital Library
- Hai Jin, Li Deng, Song Wu, Xuanhua Shi, and Xiaodong Pan. 2009. Live virtual machine migration with adaptive, memory compression. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--10. Google ScholarCross Ref
- Changyeon Jo and Bernhard Egger. 2013. Optimizing Live Migration for Virtual Desktop Clouds. In IEEE 5th International Conference on Cloud Computing Technology and Science (CloudCom '13), Vol. 1. 104--111. Google ScholarDigital Library
- Changyeon Jo, Erik Gustafsson, Jeongseok Son, and Bernhard Egger. 2013. Efficient Live Migration of Virtual Machines Using Shared Storage. In Proceedings of the 9th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE '13). ACM, New York, NY, USA, 41--50. Google ScholarDigital Library
- Jonathan Koomey. 2011. Growth in data center electricity use 2005 to 2010. A report by Analytical Press, completed at the request of The New York Times 9 (2011).Google Scholar
- Sajib Kundu, Raju Rangaswami, Ajay Gulati, Ming Zhao, and Kaushik Dutta. 2012. Modeling Virtualized Applications Using Machine Learning Techniques. In Proceedings of the 8th ACM SIGPLAN/SIGOPS Conference on Virtual Execution Environments (VEE '12). ACM, New York, NY, USA, 3--14. Google ScholarDigital Library
- Jianxin Li, Jieyu Zhao, Yi Li, Lei Cui, Bo Li, Lu Liu, and John Panneerselvam. 2014. iMIG: Toward an Adaptive Live Migration Method for KVM Virtual Machines. Comput. J. 58, 6 (2014), 1227. Google ScholarCross Ref
- Haikun Liu and Bingsheng He. 2015. VMbuddies: Coordinating Live Migration of Multi-Tier Applications in Cloud Environments. IEEE Transactions on Parallel and Distributed Systems 26, 4 (April 2015), 1192--1205. Google ScholarDigital Library
- Haikun Liu, Hai Jin, Cheng-Zhong Xu, and Xiaofei Liao. 2013. Performance and energy modeling for live migration of virtual machines. Cluster Computing 16, 2 (2013), 249--264. Google ScholarDigital Library
- Zhaobin Liu, Wenyu Qu, Weijiang Liu, and Keqiu Li. 2010. Xen Live Migration with Slowdown Scheduling Algorithm. In Proceedings of the 2010 International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT '10). IEEE Computer Society, Washington, DC, USA, 215--221. Google ScholarDigital Library
- Vijay Mann, Akanksha Gupta, Partha Dutta, Anilkumar Vishnoi, Parantapa Bhattacharya, Rishabh Poddar, and Aakash Iyer. 2012. Remedy: Network-Aware Steady State VM Management for Data Centers. Springer Berlin Heidelberg, Berlin, Heidelberg, 190--204. Google ScholarDigital Library
- Memcached - a distributed memory object caching system 2017. https://memcached.org/. (2017). Online; accessed August 2017.Google Scholar
- Microsoft Azure - Virtual Machines 2017. https://azure.microsoft.com/en-us/services/virtual-machines/. (2017). Online; accessed August 2017.Google Scholar
- M. Mishra, A. Das, P. Kulkarni, and A. Sahoo. 2012. Dynamic resource management using virtual machine migrations. IEEE Communications Magazine 50, 9 (September 2012), 34--40. Google ScholarCross Ref
- MPlayer - The Movie Player 2017. http://www.mplayerhq.hu/design7/news.html. (2017). Online; accessed August 2017.Google Scholar
- Senthil Nathan, Umesh Bellur, and Purushottam Kulkarni. 2015. Towards a Comprehensive Performance Model of Virtual Machine Live Migration. In Proceedings of the Sixth ACM Symposium on Cloud Computing (SoCC '15). ACM, New York, NY, USA, 288--301. Google ScholarDigital Library
- Senthil Nathan, Umesh Bellur, and Purushottam Kulkarni. 2016. On Selecting the Right Optimizations for Virtual Machine Migration. In Proceedings of the 12th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE '16). ACM, New York, NY, USA, 37--49. Google ScholarDigital Library
- Senthil Nathan, Purushottam Kulkarni, and Umesh Bellur. 2013. Resource Availability Based Performance Benchmarking of Virtual Machine Migrations. In Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering (ICPE '13). ACM, New York, NY, USA, 387--398. Google ScholarDigital Library
- Michael Nelson, Beng-Hong Lim, and Greg Hutchins. 2005. Fast Transparent Migration for Virtual Machines. In Proceedings of the Annual Conference on USENIX Annual Technical Conference (ATEC '05). USENIX Association, Berkeley, CA, USA, 25--25. http://dl.acm.org/citation.cfm?id=1247360.1247385Google ScholarDigital Library
- Hiep Nguyen, Zhiming Shen, Xiaohui Gu, Sethuraman Subbiah, and John Wilkes. 2013. AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service. In Proceedings of the 10th International Conference on Autonomic Computing (ICAC 13). USENIX, San Jose, CA, 69--82. https://www.usenix.org/conference/icac13/technical-sessions/presentation/nguyenGoogle Scholar
- Dejan Novaković, Nedeljko Vasić, Stanko Novaković, Dejan Kostić, and Ricardo Bianchini. 2013. DeepDive: Transparently Identifying and Managing Performance Interference in Virtualized Environments. In Proceedings of the 2013 USENIX Conference on Annual Technical Conference (USENIX ATC'13). USENIX Association, Berkeley, CA, USA, 219--230. http://dl.acm.org/citation.cfm?id=2535461.2535489Google Scholar
- Greg Schulz. 2009. The green and virtual data center. Auerbach Publications Boston.Google Scholar
- scikit-learn - Machine Learning in Python 2017. http://scikit-learn.org/stable/. (2017). Online; accessed August 2017.Google Scholar
- C. E. Shannon. 2001. A Mathematical Theory of Communication. SIGMOBILE Mob. Comput. Commun. Rev. 5, 1 (Jan. 2001), 3--55. Google ScholarDigital Library
- SPECweb2009 2017. https://www.spec.org/web2009/. (2017). Online; accessed August 2017.Google Scholar
- Petter Svärd, Benoit Hudzia, Johan Tordsson, and Erik Elmroth. 2011. Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines. In Proceedings of the 7th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE '11). ACM, New York, NY, USA, 111--120. Google ScholarDigital Library
- Petter Svärd, Benoit Hudzia, Steve Walsh, Johan Tordsson, and Erik Elmroth. 2015. Principles and Performance Characteristics of Algorithms for Live VM Migration. SIGOPS Operating Systems Review - Special Issue on Repeatability and Sharing of Experimental Artifacts 49, 1 (Jan. 2015), 142--155. Google ScholarDigital Library
- The Xen Project 2017. https://www.xenproject.org/. (2017). Online; accessed August 2017.Google Scholar
- Marvin M. Theimer, Keith A. Lantz, and David R. Cheriton. 1985. Preemptable Remote Execution Facilities for the V-system. In Proceedings of the Tenth ACM Symposium on Operating Systems Principles (SOSP '85). ACM, New York, NY, USA, 2--12. Google ScholarDigital Library
- VMware Virtualization Solutions 2017. http://www.vmware.com/. (2017). Online; accessed August 2017.Google Scholar
- William Voorsluys, James Broberg, Srikumar Venugopal, and Rajkumar Buyya. 2009. Cost of Virtual Machine Live Migration in Clouds: A Performance Evaluation. In Proceedings of the 1st International Conference on Cloud Computing (CloudCom '09). Springer-Verlag, Berlin, Heidelberg, 254--265. Google ScholarDigital Library
- Yangyang Wu and Ming Zhao. 2011. Performance Modeling of Virtual Machine Live Migration. In Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing (CLOUD '11). IEEE Computer Society, Washington, DC, USA, 492--499. Google ScholarDigital Library
- Fei Xu, Fangming Liu, Linghui Liu, Hai Jin, Bo Li, and Baochun Li. 2014. iAware: Making Live Migration of Virtual Machines Interference-Aware in the Cloud. IEEE Trans. Comput. 63, 12 (Dec 2014), 3012--3025. Google ScholarDigital Library
- Kejiang Ye, Zhaohui Wu, Chen Wang, Bing Bing Zhou, Weisheng Si, Xiaohong Jiang, and Albert Y Zomaya. 2015. Profiling-Based Workload Consolidation and Migration in Virtualized Data Centers. IEEE Transactions on Parallel and Distributed Systems 26, 3 (March 2015), 878--890. Google ScholarDigital Library
- Jiao Zhang, Fengyuan Ren, and Chuang Lin. 2014. Delay guaranteed live migration of Virtual Machines. In IEEE INFOCOM 2014 - IEEE Conference on Computer Communications. 574--582. Google ScholarCross Ref
- Jie Zheng, TS Ng, Kunwadee Sripanidkulchai, and Zhaolei Liu. 2013. Pacer: A Progress Management System for Live Virtual Machine Migration in Cloud Computing. IEEE Transactions on Network and Service Management 10, 4 (December 2013), 369--382. Google ScholarCross Ref
Index Terms
- A machine learning approach to live migration modeling
Recommendations
A quantitative study of virtual machine live migration
CAC '13: Proceedings of the 2013 ACM Cloud and Autonomic Computing ConferenceVirtual machine (VM) live migration is a critical feature for managing virtualized environments, enabling dynamic load balancing, consolidation for power management, preparation for planned maintenance, and other management features. However, not all ...
Improving Total Migration Time in Live Virtual Machine Migration
ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015Virtualization is the key underlying technology enabling cloud providers to host services for a large number of customers. Live migration is an essential feature of virtualization that allows transfer of virtual machines from one physical server to ...
A Hypervisor Approach to Enable Live Migration with Passthrough SR-IOV Network Devices
Special TopicsSingle-Root I/O Virtualization (SR-IOV) is a specification that allows a single PCI Express (PCIe) device (physical function or PF) to be used as multiple PCIe devices (virtual functions or VF). In a virtualization system, each VF can be directly ...
Comments