Skip to main content

2019 | OriginalPaper | Buchkapitel

Profit Maximization and Time Minimization Admission Control and Resource Scheduling for Cloud-Based Big Data Analytics-as-a-Service Platforms

verfasst von : Yali Zhao, Rodrigo N. Calheiros, Athanasios V. Vasilakos, James Bailey, Richard O. Sinnott

Erschienen in: Web Services – ICWS 2019

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Big data analytics typically requires large amounts of resources to process ever-increasing data volumes. This can be time consuming and result in considerable expenses. Analytics-as-a-Service (AaaS) platforms provide a way to tackle expensive resource costs and lengthy data processing times by leveraging automatic resource management with a pay-per-use service delivery model. This paper explores optimization of resource management algorithms for AaaS platforms to automatically and elastically provision cloud resources to execute queries with Service Level Agreement (SLA) guarantees. We present admission control and cloud resource scheduling algorithms that serve multiple objectives including profit maximization for AaaS platform providers and query time minimization for users. Moreover, to enable queries that require timely responses and/or have constrained budgets, we apply data sampling-based admission control and resource scheduling where accuracy can be traded-off for reduced costs and quicker responses when necessary. We conduct extensive experimental evaluations for the algorithm performances compared to state-of-the-art algorithms. Experiment results show that our proposed algorithms perform significantly better in increasing query admission rates, consuming less resources and hence reducing costs, and ultimately provide a more flexible resource management solution for fast, cost-effective, and reliable big data processing.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Assunção, M.D., Calheiros, R.N., Bianchi, S., Netto, M.A., Buyya, R.: Big data computing and clouds: trends and future directions. J. Parallel Distrib. Comput. 79, 3–15 (2015)CrossRef Assunção, M.D., Calheiros, R.N., Bianchi, S., Netto, M.A., Buyya, R.: Big data computing and clouds: trends and future directions. J. Parallel Distrib. Comput. 79, 3–15 (2015)CrossRef
2.
Zurück zum Zitat Hashem, I.A.T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A., Khan, U.: The rise of ‘big data’ on cloud computing: review and open research issues. Inf. Syst. 47, 98–115 (2014)CrossRef Hashem, I.A.T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A., Khan, U.: The rise of ‘big data’ on cloud computing: review and open research issues. Inf. Syst. 47, 98–115 (2014)CrossRef
3.
Zurück zum Zitat Zhao, Y., Calheiros, R.N., Bailey, J., Sinnott, R.: SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. In: Proceedings of the IEEE International Conference on Big Data, pp. 432–441 (2016) Zhao, Y., Calheiros, R.N., Bailey, J., Sinnott, R.: SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. In: Proceedings of the IEEE International Conference on Big Data, pp. 432–441 (2016)
4.
Zurück zum Zitat Chaudhuri, S., Das, G., Narasayya, V.: Optimized stratified sampling for approximate query processing. ACM Trans. Database Syst. (TODS) 32(2), 9 (2007)CrossRef Chaudhuri, S., Das, G., Narasayya, V.: Optimized stratified sampling for approximate query processing. ACM Trans. Database Syst. (TODS) 32(2), 9 (2007)CrossRef
5.
Zurück zum Zitat Benayoun, R., De Montgolfier, J., Tergny, J., Laritchev, O.: Linear programming with multiple objective functions: step method (STEM). Math. Program. 1, 366–375 (1971)MathSciNetCrossRef Benayoun, R., De Montgolfier, J., Tergny, J., Laritchev, O.: Linear programming with multiple objective functions: step method (STEM). Math. Program. 1, 366–375 (1971)MathSciNetCrossRef
6.
Zurück zum Zitat Isermann, H.: Linear lexicographic optimization. OR Spektrum 4, 223–228 (1982)CrossRef Isermann, H.: Linear lexicographic optimization. OR Spektrum 4, 223–228 (1982)CrossRef
7.
Zurück zum Zitat Calheiros, R.N., Ranjan, R., Beloglazov, A., De Rose, C.A., Buyya, R.: CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw. Pract. Exp. 41(1), 23–50 (2011)CrossRef Calheiros, R.N., Ranjan, R., Beloglazov, A., De Rose, C.A., Buyya, R.: CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw. Pract. Exp. 41(1), 23–50 (2011)CrossRef
11.
Zurück zum Zitat Agarwal, S., Mozafari, B., Panda, A., Milner, H., Madden, S., Stoica, I.: BlinkDB: queries with bounded errors and bounded response times on very large data. In: Proceedings of the 8th ACM European Conference on Computer Systems, p. 29 (2013) Agarwal, S., Mozafari, B., Panda, A., Milner, H., Madden, S., Stoica, I.: BlinkDB: queries with bounded errors and bounded response times on very large data. In: Proceedings of the 8th ACM European Conference on Computer Systems, p. 29 (2013)
12.
Zurück zum Zitat Zhao, Y., Calheiros, R.N., Gange, G., Ramamohanarao, K., Buyya, R.: SLA-based resource scheduling for big data analytics as a service in cloud computing environments. In: Proceedings of the 44th IEEE International Conference on Parallel Processing, pp. 510–519 (2015) Zhao, Y., Calheiros, R.N., Gange, G., Ramamohanarao, K., Buyya, R.: SLA-based resource scheduling for big data analytics as a service in cloud computing environments. In: Proceedings of the 44th IEEE International Conference on Parallel Processing, pp. 510–519 (2015)
13.
Zurück zum Zitat Zhao, Y., Calheiros, R.N., Gange, G., Bailey, J., Sinnott, R.: SLA-based profit optimization for resource scheduling of big data analytics-as-a-service in cloud computing environments. IEEE Trans. Cloud Comput. 1–18 (2018) Zhao, Y., Calheiros, R.N., Gange, G., Bailey, J., Sinnott, R.: SLA-based profit optimization for resource scheduling of big data analytics-as-a-service in cloud computing environments. IEEE Trans. Cloud Comput. 1–18 (2018)
14.
Zurück zum Zitat Zhang, H., Zhao, Y., Pang, C., He, J.: Splitting large medical data sets based on normal distribution in cloud environment. IEEE Trans. Cloud Comput. (99), 1 (2015, in press) Zhang, H., Zhao, Y., Pang, C., He, J.: Splitting large medical data sets based on normal distribution in cloud environment. IEEE Trans. Cloud Comput. (99), 1 (2015, in press)
15.
Zurück zum Zitat Tordini, F., Aldinucci, M., Viviani, P., Merelli, I., Lio, P.: Scientific workflows on clouds with heterogeneous and preemptible instances. In: Proceedings of the International Conference on Parallel Computing, pp. 605–614 (2017) Tordini, F., Aldinucci, M., Viviani, P., Merelli, I., Lio, P.: Scientific workflows on clouds with heterogeneous and preemptible instances. In: Proceedings of the International Conference on Parallel Computing, pp. 605–614 (2017)
16.
Zurück zum Zitat Mian, R., Martin, P., Vazquez-Poletti, J.: Provisioning data analytic workloads in a cloud. Future Gener. Comput. Syst. 29(6), 1452–1458 (2013)CrossRef Mian, R., Martin, P., Vazquez-Poletti, J.: Provisioning data analytic workloads in a cloud. Future Gener. Comput. Syst. 29(6), 1452–1458 (2013)CrossRef
17.
Zurück zum Zitat Wang, K., Zhou, X., Li, T., Zhao, D., Lang, M., Raicu, I.: Optimizing load balancing and data-locality with data-aware scheduling. In: Proceedings of the 2014 IEEE International Conference on Big Data, pp. 119–128 (2015) Wang, K., Zhou, X., Li, T., Zhao, D., Lang, M., Raicu, I.: Optimizing load balancing and data-locality with data-aware scheduling. In: Proceedings of the 2014 IEEE International Conference on Big Data, pp. 119–128 (2015)
18.
Zurück zum Zitat Xia, Q., Xu, Z., Liang, W., Zomaya, A.Y.: Collaboration-and fairness-aware big data management in distributed clouds. IEEE Trans. Parallel Distrib. Syst. 27(7), 1941–1953 (2015)CrossRef Xia, Q., Xu, Z., Liang, W., Zomaya, A.Y.: Collaboration-and fairness-aware big data management in distributed clouds. IEEE Trans. Parallel Distrib. Syst. 27(7), 1941–1953 (2015)CrossRef
19.
Zurück zum Zitat Garg, S.K., Toosi, A.N., Gopalaiyengar, S.K., Buyya, R.: SLA-based virtual machine management for heterogeneous workloads in a cloud datacenter. J. Netw. Comput. Appl. 45, 108–120 (2014)CrossRef Garg, S.K., Toosi, A.N., Gopalaiyengar, S.K., Buyya, R.: SLA-based virtual machine management for heterogeneous workloads in a cloud datacenter. J. Netw. Comput. Appl. 45, 108–120 (2014)CrossRef
20.
Zurück zum Zitat Gu, L., Zeng, D., Li, P., Guo, S.: Cost minimization for big data processing in geo-distributed data centers. IEEE Trans. Emerg. Top. Comput. 2(3), 314–323 (2014)CrossRef Gu, L., Zeng, D., Li, P., Guo, S.: Cost minimization for big data processing in geo-distributed data centers. IEEE Trans. Emerg. Top. Comput. 2(3), 314–323 (2014)CrossRef
21.
Zurück zum Zitat Zheng, W., Qin, Y., Bugingo, E., Zhang, D., Chen, J.: Cost optimization for deadline-aware scheduling of big data processing jobs on clouds. Future Gener. Comput. Syst. 82, 244–255 (2018)CrossRef Zheng, W., Qin, Y., Bugingo, E., Zhang, D., Chen, J.: Cost optimization for deadline-aware scheduling of big data processing jobs on clouds. Future Gener. Comput. Syst. 82, 244–255 (2018)CrossRef
22.
Zurück zum Zitat Dai, W., Qiu, L., Wu, A., Qiu, M.: Cloud infrastructure resource allocation for big data applications. IEEE Trans. Big Data 4(3), 313–324 (2018)CrossRef Dai, W., Qiu, L., Wu, A., Qiu, M.: Cloud infrastructure resource allocation for big data applications. IEEE Trans. Big Data 4(3), 313–324 (2018)CrossRef
23.
Zurück zum Zitat Zhou, A.C., He, B., Cheng, X., Lau, C.T.: A declarative optimization engine for resource provisioning of scientific workflows in IaaS clouds. In: Proceedings of the 24th International Symposium on High Performance Parallel Distributed Computing, pp. 223–234 (2015) Zhou, A.C., He, B., Cheng, X., Lau, C.T.: A declarative optimization engine for resource provisioning of scientific workflows in IaaS clouds. In: Proceedings of the 24th International Symposium on High Performance Parallel Distributed Computing, pp. 223–234 (2015)
24.
Zurück zum Zitat Mao, M., Humphrey, M.: Auto-scaling to minimize cost and meet application deadlines in cloud workflows. In: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Seatle, WA, pp. 1–12 (2011) Mao, M., Humphrey, M.: Auto-scaling to minimize cost and meet application deadlines in cloud workflows. In: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Seatle, WA, pp. 1–12 (2011)
25.
Zurück zum Zitat Chen, J., Wang, C., Zhou, B.B., Sun, L., Lee, Y.C., Zomaya, A.Y.: Tradeoffs between profit and customer satisfaction for service provisioning in the cloud. In: Proceedings of the 20th International Symposium on High Performance Distributed Computing, pp. 229–238 (2011) Chen, J., Wang, C., Zhou, B.B., Sun, L., Lee, Y.C., Zomaya, A.Y.: Tradeoffs between profit and customer satisfaction for service provisioning in the cloud. In: Proceedings of the 20th International Symposium on High Performance Distributed Computing, pp. 229–238 (2011)
Metadaten
Titel
Profit Maximization and Time Minimization Admission Control and Resource Scheduling for Cloud-Based Big Data Analytics-as-a-Service Platforms
verfasst von
Yali Zhao
Rodrigo N. Calheiros
Athanasios V. Vasilakos
James Bailey
Richard O. Sinnott
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-23499-7_3

Premium Partner