Skip to main content
Top
Published in: Cluster Computing 4/2018

25-08-2018

Planning of distributed data production for High Energy and Nuclear Physics

Authors: Dzmitry Makatun, Jérôme Lauret, Hana Rudová

Published in: Cluster Computing | Issue 4/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Modern experiments in High Energy and Nuclear Physics heavily rely on distributed computations using multiple computational facilities across the world. One of the essential types of the computations is a distributed data production where petabytes of raw files from a single source has to be processed once (per production campaign) using thousands of CPUs at distant locations and the output has to be transferred back to that source. The data distribution over a large system does not necessary match the distribution of storage, network and CPU capacity. Therefore, bottlenecks may appear and lead to increased latency and degraded performance. In this paper we propose a new scheduling approach for distributed data production which is based on the network flow maximization model. In our approach a central planner defines how much input and output data should be transferred over each network link in order to maximize the computational throughput. Such plans are created periodically for a fixed planning time interval using up-to-date information on network, storage and CPU resources. The centrally created plans are executed in a distributed manner by dedicated services running at participating sites. Our simulations based on the log records from the data production framework of the experiment STAR (Solenoid Tracker at RHIC) have shown that the proposed model systematically provides a better performance compared to the simulated traditional techniques.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)CrossRef Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)CrossRef
2.
go back to reference Ghemawat, S., Gobioff, H., Leung, S.T.: The Google file system. In: ACM SIGOPS Operating Systems Review, vol. 37, pp. 29–43. ACM (2003) Ghemawat, S., Gobioff, H., Leung, S.T.: The Google file system. In: ACM SIGOPS Operating Systems Review, vol. 37, pp. 29–43. ACM (2003)
3.
go back to reference Isard, M., Budiu, M., Yu, Y., Birrell, A., Fetterly, D.: Dryad: distributed data-parallel programs from sequential building blocks. In: ACM SIGOPS Operating Systems Review, vol. 41, pp. 59–72. ACM (2007) Isard, M., Budiu, M., Yu, Y., Birrell, A., Fetterly, D.: Dryad: distributed data-parallel programs from sequential building blocks. In: ACM SIGOPS Operating Systems Review, vol. 41, pp. 59–72. ACM (2007)
4.
go back to reference Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The Hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies. IEEE (2010) Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The Hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies. IEEE (2010)
5.
go back to reference Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D., Maltzahn, C.: Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, pp. 307–320. USENIX Association (2006) Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D., Maltzahn, C.: Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, pp. 307–320. USENIX Association (2006)
6.
go back to reference White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Inc., Beijing (2012) White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Inc., Beijing (2012)
7.
go back to reference Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauley, M., Franklin, M.J., Shenker, S., Stoica, I.: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. USENIX Association (2012) Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauley, M., Franklin, M.J., Shenker, S., Stoica, I.: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. USENIX Association (2012)
8.
go back to reference Hu, H., Wen, Y., Chua, T.S., Li, X.: Toward scalable systems for big data analytics: a technology tutorial. IEEE Access 2, 652–687 (2014)CrossRef Hu, H., Wen, Y., Chua, T.S., Li, X.: Toward scalable systems for big data analytics: a technology tutorial. IEEE Access 2, 652–687 (2014)CrossRef
9.
go back to reference Snir, M.: MPI—The Complete Reference: The MPI Core, vol. 1. MIT Press, Cambridge (1998) Snir, M.: MPI—The Complete Reference: The MPI Core, vol. 1. MIT Press, Cambridge (1998)
10.
go back to reference Lopes, R.V., Menascé, D.: A taxonomy of job scheduling on distributed computing systems. IEEE Trans. Parallel Distrib. Syst. 27, 3412–3428 (2016)CrossRef Lopes, R.V., Menascé, D.: A taxonomy of job scheduling on distributed computing systems. IEEE Trans. Parallel Distrib. Syst. 27, 3412–3428 (2016)CrossRef
11.
go back to reference Burns, B., Grant, B., Oppenheimer, D., Brewer, E., Wilkes, J.: Borg, Omega, and Kubernetes. Commun. ACM 59, 50–57 (2016)CrossRef Burns, B., Grant, B., Oppenheimer, D., Brewer, E., Wilkes, J.: Borg, Omega, and Kubernetes. Commun. ACM 59, 50–57 (2016)CrossRef
12.
go back to reference Zhou, S.: LSF: load sharing in large heterogeneous distributed systems. In: 1st Workshop on Cluster Computing, vol. 136 (1992) Zhou, S.: LSF: load sharing in large heterogeneous distributed systems. In: 1st Workshop on Cluster Computing, vol. 136 (1992)
13.
go back to reference Hindman, B., Konwinski, A., Zaharia, M., Ghodsi, A., Joseph, A.D., Katz, R.H., Shenker, S., Stoica, I.: Mesos: a platform for fine-grained resource sharing in the data center. In: Proceedings of the USENIX Conference on Networked Systems Design and Implementation, vol. 11 (2011) Hindman, B., Konwinski, A., Zaharia, M., Ghodsi, A., Joseph, A.D., Katz, R.H., Shenker, S., Stoica, I.: Mesos: a platform for fine-grained resource sharing in the data center. In: Proceedings of the USENIX Conference on Networked Systems Design and Implementation, vol. 11 (2011)
14.
go back to reference Bode, B., Halstead, D.M., Kendall, R., Lei, Z., Jackson, D.: The Portable Batch Scheduler and the Maui scheduler on Linux clusters. In: Annual Linux Showcase and Conference (2000) Bode, B., Halstead, D.M., Kendall, R., Lei, Z., Jackson, D.: The Portable Batch Scheduler and the Maui scheduler on Linux clusters. In: Annual Linux Showcase and Conference (2000)
15.
go back to reference Isard, M., Prabhakaran, V., Currey, J., Wieder, U., Talwar, K., Goldberg, A.: Quincy: fair scheduling for distributed computing clusters. In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles, pp. 261–276. ACM (2009) Isard, M., Prabhakaran, V., Currey, J., Wieder, U., Talwar, K., Goldberg, A.: Quincy: fair scheduling for distributed computing clusters. In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles, pp. 261–276. ACM (2009)
16.
go back to reference Bagnasco, S.: AliEn: ALICE environment on the grid. J. Phys. Conf. Ser. 119, 062012 (2008)CrossRef Bagnasco, S.: AliEn: ALICE environment on the grid. J. Phys. Conf. Ser. 119, 062012 (2008)CrossRef
17.
go back to reference Bockelman, B., et al.: Commissioning the HTCondor-CE for the open science grid. J. Phys. Conf. Ser. 664, 062003 (2015)CrossRef Bockelman, B., et al.: Commissioning the HTCondor-CE for the open science grid. J. Phys. Conf. Ser. 664, 062003 (2015)CrossRef
18.
go back to reference Couvares, P., Kosar, T., Roy, A., Weber, J., Wenger, K.: Workflow management in Condor. In: Workflows for e-Science pp. 357–375. Springer, Cham (2007) Couvares, P., Kosar, T., Roy, A., Weber, J., Wenger, K.: Workflow management in Condor. In: Workflows for e-Science pp. 357–375. Springer, Cham (2007)
19.
go back to reference Thain, D., Tannenbaum, T., Livny, M.: Distributed computing in practice: the Condor experience. Concurr. Comput. Pract. Exp. 17, 323–356 (2005)CrossRef Thain, D., Tannenbaum, T., Livny, M.: Distributed computing in practice: the Condor experience. Concurr. Comput. Pract. Exp. 17, 323–356 (2005)CrossRef
20.
go back to reference Paterson, S.K., Tsaregorodtsev, A.: DIRAC optimized workload management. J. Phys. Conf. Ser. 119, 062040 (2008)CrossRef Paterson, S.K., Tsaregorodtsev, A.: DIRAC optimized workload management. J. Phys. Conf. Ser. 119, 062040 (2008)CrossRef
21.
go back to reference Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E.A., Tao, J., Zhao, Y.: Scientific workflow management and the Kepler system. Concurr. Comput. Pract. Exp. 18(10), 1039–1065 (2006)CrossRef Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E.A., Tao, J., Zhao, Y.: Scientific workflow management and the Kepler system. Concurr. Comput. Pract. Exp. 18(10), 1039–1065 (2006)CrossRef
22.
go back to reference De, K.: The future of PanDA in ATLAS distributed computing. J. Phys. Conf. Ser. 664, 062035 (2015)CrossRef De, K.: The future of PanDA in ATLAS distributed computing. J. Phys. Conf. Ser. 664, 062035 (2015)CrossRef
23.
go back to reference Deelman, E., Vahi, K., Juve, G., Rynge, M., Callaghan, S., Maechling, P.J., Mayani, R., Chen, W., da Silva, R.F., Livny, M.: Pegasus, a workflow management system for science automation. Future Gener. Comput. Syst. 46, 17–35 (2015)CrossRef Deelman, E., Vahi, K., Juve, G., Rynge, M., Callaghan, S., Maechling, P.J., Mayani, R., Chen, W., da Silva, R.F., Livny, M.: Pegasus, a workflow management system for science automation. Future Gener. Comput. Syst. 46, 17–35 (2015)CrossRef
24.
go back to reference Qureshi, M.B., Dehnavi, M.M., Min-Allah, N., Qureshi, M.S., Hussain, H., Rentifis, I., Tziritas, N., Loukopoulos, T., Khan, S.U., Xu, C.Z.: Survey on grid resource allocation mechanisms. J. Grid Comput. 12, 399–441 (2014)CrossRef Qureshi, M.B., Dehnavi, M.M., Min-Allah, N., Qureshi, M.S., Hussain, H., Rentifis, I., Tziritas, N., Loukopoulos, T., Khan, S.U., Xu, C.Z.: Survey on grid resource allocation mechanisms. J. Grid Comput. 12, 399–441 (2014)CrossRef
25.
go back to reference Kołodziej, J., Khan, S.U.: Data scheduling in data grids and data centers: a short taxonomy of problems and intelligent resolution techniques. In: Transactions on Computational Collective Intelligence X, pp. 103–119. Springer, Berlin (2013)CrossRef Kołodziej, J., Khan, S.U.: Data scheduling in data grids and data centers: a short taxonomy of problems and intelligent resolution techniques. In: Transactions on Computational Collective Intelligence X, pp. 103–119. Springer, Berlin (2013)CrossRef
26.
go back to reference Venugopal, S., Buyya, R., Ramamohanarao, K.: A taxonomy of data grids for distributed data sharing, management, and processing. ACM Comput. Surv. 38(1), Article 3 (2006)CrossRef Venugopal, S., Buyya, R., Ramamohanarao, K.: A taxonomy of data grids for distributed data sharing, management, and processing. ACM Comput. Surv. 38(1), Article 3 (2006)CrossRef
27.
go back to reference Yu, J., Buyya, R.: A taxonomy of workflow management systems for grid computing. J. Grid Comput. 3, 171–200 (2005)CrossRef Yu, J., Buyya, R.: A taxonomy of workflow management systems for grid computing. J. Grid Comput. 3, 171–200 (2005)CrossRef
28.
go back to reference Rodriguez, M.A., Buyya, R.: A taxonomy and survey on scheduling algorithms for scientific workflows in IaaS cloud computing environments. Concurr. Comput. Pract. Exp. 29(8), e4041 (2017)CrossRef Rodriguez, M.A., Buyya, R.: A taxonomy and survey on scheduling algorithms for scientific workflows in IaaS cloud computing environments. Concurr. Comput. Pract. Exp. 29(8), e4041 (2017)CrossRef
29.
go back to reference Smanchat, S., Viriyapant, K.: Taxonomies of workflow scheduling problem and techniques in the cloud. Future Gener. Comput. Syst. 52, 1–12 (2015)CrossRef Smanchat, S., Viriyapant, K.: Taxonomies of workflow scheduling problem and techniques in the cloud. Future Gener. Comput. Syst. 52, 1–12 (2015)CrossRef
30.
go back to reference Vijaya, C., Srinivasan, D.: A survey on resource scheduling in cloud computing. Int. J. Pharm. Technol. 8(4), 26142–26162 (2016) Vijaya, C., Srinivasan, D.: A survey on resource scheduling in cloud computing. Int. J. Pharm. Technol. 8(4), 26142–26162 (2016)
31.
go back to reference Etsion, Y., Tsafrir, D.: A Short Survey of Commercial Cluster Batch Schedulers. School of Computer Science and Engineering, The Hebrew University of Jerusalem 44221 (2005) Etsion, Y., Tsafrir, D.: A Short Survey of Commercial Cluster Batch Schedulers. School of Computer Science and Engineering, The Hebrew University of Jerusalem 44221 (2005)
32.
go back to reference Maheswaran, M., Ali, S., Siegal, H., Hensgen, D., Freund, R.F.: Dynamic matching and scheduling of a class of independent tasks onto heterogeneous computing systems. In: Heterogeneous Computing Workshop (HCW’99) Proceedings, pp. 30–44. IEEE (1999) Maheswaran, M., Ali, S., Siegal, H., Hensgen, D., Freund, R.F.: Dynamic matching and scheduling of a class of independent tasks onto heterogeneous computing systems. In: Heterogeneous Computing Workshop (HCW’99) Proceedings, pp. 30–44. IEEE (1999)
33.
go back to reference Casanova, H., Legrand, A., Zagorodnov, D., Berman, F.: Heuristics for scheduling parameter sweep applications in grid environments. In: Proceedings of the 9th Heterogeneous Computing Workshop, pp. 349–363. IEEE (2000) Casanova, H., Legrand, A., Zagorodnov, D., Berman, F.: Heuristics for scheduling parameter sweep applications in grid environments. In: Proceedings of the 9th Heterogeneous Computing Workshop, pp. 349–363. IEEE (2000)
34.
go back to reference Santos-Neto, E., Cirne, W., Brasileiro, F., Lima, A.: Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids. In: Job Scheduling Strategies for Parallel Processing, pp. 210–232. Springer, Berlin (2005)CrossRef Santos-Neto, E., Cirne, W., Brasileiro, F., Lima, A.: Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids. In: Job Scheduling Strategies for Parallel Processing, pp. 210–232. Springer, Berlin (2005)CrossRef
35.
go back to reference Garonne, V., Stewart, G.A., Lassnig, M., Molfetas, A., Barisits, M., Beermann, T., Nairz, A., Goossens, L., Megino, F.B., Serfon, C.: The ATLAS distributed data management project: past and future. J. Phys. Conf. Ser. 396, 032045 (2012)CrossRef Garonne, V., Stewart, G.A., Lassnig, M., Molfetas, A., Barisits, M., Beermann, T., Nairz, A., Goossens, L., Megino, F.B., Serfon, C.: The ATLAS distributed data management project: past and future. J. Phys. Conf. Ser. 396, 032045 (2012)CrossRef
36.
go back to reference Thain, D., Basney, J., Son, S.C., Livny, M.: The kangaroo approach to data movement on the grid. In: 10th IEEE International Symposium on High Performance Distributed Computing, 2001. Proceedings, pp. 325–333. IEEE (2001) Thain, D., Basney, J., Son, S.C., Livny, M.: The kangaroo approach to data movement on the grid. In: 10th IEEE International Symposium on High Performance Distributed Computing, 2001. Proceedings, pp. 325–333. IEEE (2001)
37.
go back to reference Rehn, J., Barrass, T., Bonacorsi, D., Hernandez, J., Semeniouk, I., Tuura, L., Wu, Y.: PhEDEx high-throughput data transfer management system. In: Computing in High Energy and Nuclear Physics (CHEP) 2006 (2006) Rehn, J., Barrass, T., Bonacorsi, D., Hernandez, J., Semeniouk, I., Tuura, L., Wu, Y.: PhEDEx high-throughput data transfer management system. In: Computing in High Energy and Nuclear Physics (CHEP) 2006 (2006)
38.
go back to reference Garonne, V., Vigne, R., Stewart, G., Barisits, M., Lassnig, M., Serfon, C., Goossens, L., Nairz, A.: Rucio–the next generation of large scale distributed system for ATLAS data management. J. Phys. Conf. Ser. 513, 042021 (2014)CrossRef Garonne, V., Vigne, R., Stewart, G., Barisits, M., Lassnig, M., Serfon, C., Goossens, L., Nairz, A.: Rucio–the next generation of large scale distributed system for ATLAS data management. J. Phys. Conf. Ser. 513, 042021 (2014)CrossRef
39.
go back to reference Kosar, T., Livny, M.: Stork: making data placement a first class citizen in the grid. In: 24th International Conference on Distributed Computing Systems, 2004. Proceedings, pp. 342–349. IEEE (2004) Kosar, T., Livny, M.: Stork: making data placement a first class citizen in the grid. In: 24th International Conference on Distributed Computing Systems, 2004. Proceedings, pp. 342–349. IEEE (2004)
40.
go back to reference Bharathi, S., Chervenak, A.: Data staging strategies and their impact on the execution of scientific workflows. In: Proceedings of the Second International Workshop on Data-Aware Distributed Computing. ACM (2009) Bharathi, S., Chervenak, A.: Data staging strategies and their impact on the execution of scientific workflows. In: Proceedings of the Second International Workshop on Data-Aware Distributed Computing. ACM (2009)
41.
go back to reference Chervenak, A.L., Sim, A., Gu, J., Schuler, R., Hirpathak, N.: Efficient data staging using performance-based adaptation and policy-based resource allocation. In: Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 244–247. IEEE (2014) Chervenak, A.L., Sim, A., Gu, J., Schuler, R., Hirpathak, N.: Efficient data staging using performance-based adaptation and policy-based resource allocation. In: Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 244–247. IEEE (2014)
42.
go back to reference Chervenak, A.L., Sim, A., Gu, J., Schuler, R.E., Hirpathak, N.: Adaptation and policy-based resource allocation for efficient bulk data transfers in high performance computing environments. In: Proceedings of the Fourth International Workshop on Network-Aware Data Management. IEEE Press (2014) Chervenak, A.L., Sim, A., Gu, J., Schuler, R.E., Hirpathak, N.: Adaptation and policy-based resource allocation for efficient bulk data transfers in high performance computing environments. In: Proceedings of the Fourth International Workshop on Network-Aware Data Management. IEEE Press (2014)
43.
go back to reference Grace, R.K., Manimegalai, R.: Dynamic replica placement and selection strategies in data grids–a comprehensive survey. J. Parallel Distrib. Comput. 74, 2099–2108 (2014)CrossRef Grace, R.K., Manimegalai, R.: Dynamic replica placement and selection strategies in data grids–a comprehensive survey. J. Parallel Distrib. Comput. 74, 2099–2108 (2014)CrossRef
44.
go back to reference Hamrouni, T., Slimani, S., Charrada, F.B.: A survey of dynamic replication and replica selection strategies based on data mining techniques in data grids. Eng. Appl. Artif. Intell. 48, 140–158 (2016)CrossRef Hamrouni, T., Slimani, S., Charrada, F.B.: A survey of dynamic replication and replica selection strategies based on data mining techniques in data grids. Eng. Appl. Artif. Intell. 48, 140–158 (2016)CrossRef
45.
go back to reference Mokadem, R., Hameurlain, A.: Data replication strategies with performance objective in data grid systems: a survey. Int. J. Grid Util. Comput. 6, 30–46 (2014)CrossRef Mokadem, R., Hameurlain, A.: Data replication strategies with performance objective in data grid systems: a survey. Int. J. Grid Util. Comput. 6, 30–46 (2014)CrossRef
46.
go back to reference Ranganathan, K., Foster, I.: Decoupling computation and data scheduling in distributed data-intensive applications. In: 11th IEEE International Symposium on High Performance Distributed Computing, pp. 352–358 (2002) Ranganathan, K., Foster, I.: Decoupling computation and data scheduling in distributed data-intensive applications. In: 11th IEEE International Symposium on High Performance Distributed Computing, pp. 352–358 (2002)
47.
go back to reference Tos, U., Mokadem, R., Hameurlain, A., Ayav, T., Bora, S.: Dynamic replication strategies in data grid systems: a survey. J. Supercomput. 71, 4116–4140 (2015)CrossRef Tos, U., Mokadem, R., Hameurlain, A., Ayav, T., Bora, S.: Dynamic replication strategies in data grid systems: a survey. J. Supercomput. 71, 4116–4140 (2015)CrossRef
48.
go back to reference Bird, I., et al.: Update of the Computing Models of the WLCG and the LHC Experiments. Technical Report CERN-LHCC-2014-014, LCG-TDR-002. CERN, Geneva (2014) Bird, I., et al.: Update of the Computing Models of the WLCG and the LHC Experiments. Technical Report CERN-LHCC-2014-014, LCG-TDR-002. CERN, Geneva (2014)
49.
go back to reference Ackermann, K.H.: STAR detector overview. Nucl. Instrum. Methods Phys. Res. A499, 624–632 (2003)CrossRef Ackermann, K.H.: STAR detector overview. Nucl. Instrum. Methods Phys. Res. A499, 624–632 (2003)CrossRef
50.
go back to reference Balewski, J., Lauret, J., Olson, D., Sakrejda, I., Arkhipkin, D.: Offloading peak processing to virtual farm by STAR experiment at RHIC. J. Phys. Conf. Ser. 368, 012011 (2012)CrossRef Balewski, J., Lauret, J., Olson, D., Sakrejda, I., Arkhipkin, D.: Offloading peak processing to virtual farm by STAR experiment at RHIC. J. Phys. Conf. Ser. 368, 012011 (2012)CrossRef
51.
go back to reference Hajdu, L., et al.: STAR experience with automated high efficiency Grid based data production framework at KISTI/Korea. In: HEPiX Spring 2015 Workshop. Oxford University (2015) Hajdu, L., et al.: STAR experience with automated high efficiency Grid based data production framework at KISTI/Korea. In: HEPiX Spring 2015 Workshop. Oxford University (2015)
52.
go back to reference Zerola, M.: Distributed data management in experiments at RHIC and LHC. PhD Thesis, Czech Technical University (2012) Zerola, M.: Distributed data management in experiments at RHIC and LHC. PhD Thesis, Czech Technical University (2012)
53.
go back to reference Barczyk, A.: Advanced networking for scientific applications. In: Grid and Cloud Computing: Concepts and Practical Applications, vol. 192 (2016) Barczyk, A.: Advanced networking for scientific applications. In: Grid and Cloud Computing: Concepts and Practical Applications, vol. 192 (2016)
54.
go back to reference Lapadatescu, V., Melo, A., Mughal, A., Newman, H., Barczyk, A., Sheldon, P., Voicu, R., Wildish, T., De, K., Legrand, I., et al.: Integrating network-awareness and network-management into PhEDEx. In: Proceedings of Science (2016) Lapadatescu, V., Melo, A., Mughal, A., Newman, H., Barczyk, A., Sheldon, P., Voicu, R., Wildish, T., De, K., Legrand, I., et al.: Integrating network-awareness and network-management into PhEDEx. In: Proceedings of Science (2016)
55.
go back to reference Kreutz, D., Ramos, F.M., Verissimo, P.E., Rothenberg, C.E., Azodolmolky, S., Uhlig, S.: Software-defined networking: a comprehensive survey. Proc. IEEE 103, 14–76 (2015)CrossRef Kreutz, D., Ramos, F.M., Verissimo, P.E., Rothenberg, C.E., Azodolmolky, S., Uhlig, S.: Software-defined networking: a comprehensive survey. Proc. IEEE 103, 14–76 (2015)CrossRef
56.
go back to reference Nunes, B.A.A., Mendonca, M., Nguyen, X.N., Obraczka, K., Turletti, T.: A survey of software-defined networking: past, present, and future of programmable networks. IEEE Commun. Surv. Tutor. 16, 1617–1634 (2014)CrossRef Nunes, B.A.A., Mendonca, M., Nguyen, X.N., Obraczka, K., Turletti, T.: A survey of software-defined networking: past, present, and future of programmable networks. IEEE Commun. Surv. Tutor. 16, 1617–1634 (2014)CrossRef
57.
go back to reference Al-Fares, M., Radhakrishnan, S., Raghavan, B., Huang, N., Vahdat, A.: Hedera: dynamic flow scheduling for data center networks. In: Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation, vol. 10 (2010) Al-Fares, M., Radhakrishnan, S., Raghavan, B., Huang, N., Vahdat, A.: Hedera: dynamic flow scheduling for data center networks. In: Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation, vol. 10 (2010)
58.
go back to reference Bredel, M., Bozakov, Z., Barczyk, A., Newman, H.: Flow-based load balancing in multipathed layer-2 networks using OpenFlow and multipath-TCP. In: Proceedings of the Third Workshop on Hot Topics in Software Defined Networking, pp. 213–214. ACM (2014) Bredel, M., Bozakov, Z., Barczyk, A., Newman, H.: Flow-based load balancing in multipathed layer-2 networks using OpenFlow and multipath-TCP. In: Proceedings of the Third Workshop on Hot Topics in Software Defined Networking, pp. 213–214. ACM (2014)
59.
go back to reference Foster, I., Roy, A., Sander, V.: A quality of service architecture that combines resource reservation and application adaptation. In: Proceedings of the 8th International Workshop on Quality of Service, pp. 181–188. IEEE (2000) Foster, I., Roy, A., Sander, V.: A quality of service architecture that combines resource reservation and application adaptation. In: Proceedings of the 8th International Workshop on Quality of Service, pp. 181–188. IEEE (2000)
60.
go back to reference Legrand, I.: Monitoring and control of large-scale distributed systems. In: Grid and Cloud Computing: Concepts and Practical Applications, vol. 192 (2016) Legrand, I.: Monitoring and control of large-scale distributed systems. In: Grid and Cloud Computing: Concepts and Practical Applications, vol. 192 (2016)
61.
go back to reference Xie, D., Ding, N., Hu, Y.C., Kompella, R.: The only constant is change: incorporating time-varying network reservations in data centers. ACM SIGCOMM Comput. Commun. Rev. 42, 199–210 (2012)CrossRef Xie, D., Ding, N., Hu, Y.C., Kompella, R.: The only constant is change: incorporating time-varying network reservations in data centers. ACM SIGCOMM Comput. Commun. Rev. 42, 199–210 (2012)CrossRef
63.
go back to reference Ahlgren, B., Dannewitz, C., Imbrenda, C., Kutscher, D., Ohlman, B.: A survey of information-centric networking. IEEE Commun. Mag. 50(7), 26–36 (2012)CrossRef Ahlgren, B., Dannewitz, C., Imbrenda, C., Kutscher, D., Ohlman, B.: A survey of information-centric networking. IEEE Commun. Mag. 50(7), 26–36 (2012)CrossRef
64.
go back to reference Xylomenos, G., Ververidis, C.N., Siris, V.A., Fotiou, N., Tsilopoulos, C., Vasilakos, X., Katsaros, K.V., Polyzos, G.C.: A survey of information-centric networking research. IEEE Commun. Surv. Tutor. 16, 1024–1049 (2014)CrossRef Xylomenos, G., Ververidis, C.N., Siris, V.A., Fotiou, N., Tsilopoulos, C., Vasilakos, X., Katsaros, K.V., Polyzos, G.C.: A survey of information-centric networking research. IEEE Commun. Surv. Tutor. 16, 1024–1049 (2014)CrossRef
65.
go back to reference Kliazovich, D., Bouvry, P., Khan, S.U.: DENS: data center energy-efficient network-aware scheduling. Clust. Comput. 16, 65–75 (2013)CrossRef Kliazovich, D., Bouvry, P., Khan, S.U.: DENS: data center energy-efficient network-aware scheduling. Clust. Comput. 16, 65–75 (2013)CrossRef
66.
go back to reference Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Bandwidth-centric allocation of independent tasks on heterogeneous platforms. In: International Parallel and Distributed Processing Symposium, Proceedings. IEEE (2001) Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Bandwidth-centric allocation of independent tasks on heterogeneous platforms. In: International Parallel and Distributed Processing Symposium, Proceedings. IEEE (2001)
67.
go back to reference Gog, I., Schwarzkopf, M., Gleave, A., Watson, R.N.M., Hand, S.: Firmament: fast, centralized cluster scheduling at scale. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, pp. 99–115. USENIX Association, Berkeley (2016) Gog, I., Schwarzkopf, M., Gleave, A., Watson, R.N.M., Hand, S.: Firmament: fast, centralized cluster scheduling at scale. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, pp. 99–115. USENIX Association, Berkeley (2016)
68.
go back to reference Zerola, M., Lauret, J., Barták, R., Šumbera, M.: One click dataset transfer: toward efficient coupling of distributed storage resources and CPUs. J. Phys. Conf. Ser. 368, 012022 (2012)CrossRef Zerola, M., Lauret, J., Barták, R., Šumbera, M.: One click dataset transfer: toward efficient coupling of distributed storage resources and CPUs. J. Phys. Conf. Ser. 368, 012022 (2012)CrossRef
69.
go back to reference Makatun, D., Lauret, J., Rudová, H., Šumbera, M.: Model for planning of distributed data production. In: Proceedings of the 7th Multidisciplinary International Scheduling Conference (MISTA), pp. 699–703 (2015) Makatun, D., Lauret, J., Rudová, H., Šumbera, M.: Model for planning of distributed data production. In: Proceedings of the 7th Multidisciplinary International Scheduling Conference (MISTA), pp. 699–703 (2015)
70.
go back to reference Makatun, D., Lauret, J., Rudová, H., Šumbera, M.: Simulations and study of a new scheduling approach for distributed data production. J. Phys. Conf. Ser. 762(1), 012023 (2016)CrossRef Makatun, D., Lauret, J., Rudová, H., Šumbera, M.: Simulations and study of a new scheduling approach for distributed data production. J. Phys. Conf. Ser. 762(1), 012023 (2016)CrossRef
71.
go back to reference Makatun, D., Lauret, J., Rudová, H., Šumbera, M.: Network flows for data distribution and computation. In: Proceedings of the IEEE Symposium on Computational Intelligence in Scheduling and Network Design (2016) Makatun, D., Lauret, J., Rudová, H., Šumbera, M.: Network flows for data distribution and computation. In: Proceedings of the IEEE Symposium on Computational Intelligence in Scheduling and Network Design (2016)
72.
go back to reference Casajus, A., Graciani, R., Paterson, S., Tsaregorodtsev, A.: DIRAC pilot framework and the DIRAC workload management system. J. Phys. Conf. Ser. 219, 062049 (2010)CrossRef Casajus, A., Graciani, R., Paterson, S., Tsaregorodtsev, A.: DIRAC pilot framework and the DIRAC workload management system. J. Phys. Conf. Ser. 219, 062049 (2010)CrossRef
73.
go back to reference Legrand, I., Newman, H., Voicu, R., Cirstoiu, C., Grigoras, C., Dobre, C., Muraru, A., Costan, A., Dediu, M., Stratan, C.: MonALISA: an agent based, dynamic service system to monitor, control and optimize distributed systems. Comput. Phys. Commun. 180, 2472–2498 (2009)CrossRef Legrand, I., Newman, H., Voicu, R., Cirstoiu, C., Grigoras, C., Dobre, C., Muraru, A., Costan, A., Dediu, M., Stratan, C.: MonALISA: an agent based, dynamic service system to monitor, control and optimize distributed systems. Comput. Phys. Commun. 180, 2472–2498 (2009)CrossRef
74.
go back to reference Magoules, F., Nguyen, T.M.H., Yu, L.: Grid Resource Management: Towards Virtual and Services Compliant Grid Computing, 1st edn. CRC Press, Inc., Boca Raton (2008)CrossRef Magoules, F., Nguyen, T.M.H., Yu, L.: Grid Resource Management: Towards Virtual and Services Compliant Grid Computing, 1st edn. CRC Press, Inc., Boca Raton (2008)CrossRef
75.
go back to reference Ahuja, R.K., Magnati, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Prentice Hall, Englewood Cliffs (1993) Ahuja, R.K., Magnati, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Prentice Hall, Englewood Cliffs (1993)
76.
go back to reference Buyya, R., Murshed, M.: GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for grid computing. J. Concurr. Comput. Pract. Exp. 14, 1175–1220 (2002)CrossRef Buyya, R., Murshed, M.: GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for grid computing. J. Concurr. Comput. Pract. Exp. 14, 1175–1220 (2002)CrossRef
77.
go back to reference Klusáček, D., Rudová, H.: A metaheuristic for optimizing the performance and the fairness in job scheduling systems. In: Laalaoui, Y., Bouguila, N. (eds.) Artificial Intelligence Applications in Information and Communication Technologies, vol. 607, pp. 3–29. Springer, Heidelberg (2015)CrossRef Klusáček, D., Rudová, H.: A metaheuristic for optimizing the performance and the fairness in job scheduling systems. In: Laalaoui, Y., Bouguila, N. (eds.) Artificial Intelligence Applications in Information and Communication Technologies, vol. 607, pp. 3–29. Springer, Heidelberg (2015)CrossRef
82.
go back to reference Sulistio, A., Poduval, G., Buyya, R., Tham, C.K.: Constructing a grid simulation with differentiated network service using GridSim. In: Proceedings of the 6th International Conference on Internet Computing (2005) Sulistio, A., Poduval, G., Buyya, R., Tham, C.K.: Constructing a grid simulation with differentiated network service using GridSim. In: Proceedings of the 6th International Conference on Internet Computing (2005)
83.
go back to reference Hanushevsky, A., Trunov, A., Cottrell, L.: Peer to peer computing for secure high performance data copying. In: Proceedings of the 2001 International Conference on Computing in High Energy and Nuclear Physics, Beijing (2001) Hanushevsky, A., Trunov, A., Cottrell, L.: Peer to peer computing for secure high performance data copying. In: Proceedings of the 2001 International Conference on Computing in High Energy and Nuclear Physics, Beijing (2001)
84.
go back to reference Allcock, W., Bresnahan, J., Kettimuthu, R., Link, M., Dumitrescu, C., Raicu, I., Foster, I.: The Globus striped GridFTP framework and server. In: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing. IEEE Computer Society (2005) Allcock, W., Bresnahan, J., Kettimuthu, R., Link, M., Dumitrescu, C., Raicu, I., Foster, I.: The Globus striped GridFTP framework and server. In: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing. IEEE Computer Society (2005)
85.
go back to reference Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. ACM SIGCOMM Comput. Commun. Rev. 29, 251–262 (1999)CrossRef Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. ACM SIGCOMM Comput. Commun. Rev. 29, 251–262 (1999)CrossRef
Metadata
Title
Planning of distributed data production for High Energy and Nuclear Physics
Authors
Dzmitry Makatun
Jérôme Lauret
Hana Rudová
Publication date
25-08-2018
Publisher
Springer US
Published in
Cluster Computing / Issue 4/2018
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-018-2834-3

Other articles of this Issue 4/2018

Cluster Computing 4/2018 Go to the issue

Premium Partner