Skip to main content
Erschienen in:
Buchtitelbild

2019 | OriginalPaper | Buchkapitel

Learning Concave-Convex Profiles of Data Transport over Dedicated Connections

verfasst von : Nageswara S. V. Rao, Satyabrata Sen, Zhengchun Liu, Rajkumar Kettimuthu, Ian Foster

Erschienen in: Machine Learning for Networking

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Dedicated data transport infrastructures are increasingly being deployed to support distributed big-data and high-performance computing scenarios. These infrastructures employ data transfer nodes that use sophisticated software stacks to support network transport among sites, which often house distributed file and storage systems. Throughput measurements collected over such infrastructures for a range of round trip times (RTTs) reflect the underlying complex end-to-end connections, and have revealed dichotomous throughput profiles as functions of RTT. In particular, concave regions of throughput profiles at lower RTTs indicate near-optimal performance, and convex regions at higher RTTs indicate bottlenecks due to factors such as buffer or credit limits. We present a machine learning method that explicitly infers these concave and convex regions and transitions between them using sigmoid functions. We also provide distribution-free confidence estimates for the generalization error of these concave-convex profile estimates. Throughput profiles for data transfers over 10 Gbps connections with 0–366 ms RTT provide important performance insights, including the near optimality of transfers performed with the XDD tool between XFS filesystems, and the performance limits of wide-area Lustre extensions using LNet routers. A direct application of generic machine learning packages does not adequately highlight these critical performance regions or provide as precise confidence estimates.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Allcock, W., et al.: The Globus striped gridFTP framework and server. In: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC 2005, pp. 54–64. IEEE Computer Society, Washington (2005) Allcock, W., et al.: The Globus striped gridFTP framework and server. In: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC 2005, pp. 54–64. IEEE Computer Society, Washington (2005)
2.
Zurück zum Zitat Allen, B., Bresnahan, J., Childers, L., Foster, I., Kandaswamy, G., Kettimuthu, R., Kordas, J., Link, M., Martin, S., Pickett, K., Tuecke, S.: Software as a service for data scientists. Commun. ACM 55(2), 81–88 (2012)CrossRef Allen, B., Bresnahan, J., Childers, L., Foster, I., Kandaswamy, G., Kettimuthu, R., Kordas, J., Link, M., Martin, S., Pickett, K., Tuecke, S.: Software as a service for data scientists. Commun. ACM 55(2), 81–88 (2012)CrossRef
4.
Zurück zum Zitat Cardwell, N., Cheng, Y., Gunn, C.S., Yeganeh, S.H., Jacobson, V.: BBR: congestion based congestion control. ACM Qeueue 14(5), 50 (2016) Cardwell, N., Cheng, Y., Gunn, C.S., Yeganeh, S.H., Jacobson, V.: BBR: congestion based congestion control. ACM Qeueue 14(5), 50 (2016)
6.
Zurück zum Zitat Gu, Y., Grossman, R.L.: UDT: UDP-based data transfer for high-speed wide area networks. Comput. Netw. 51(7), 1777–1799 (2007)CrossRef Gu, Y., Grossman, R.L.: UDT: UDP-based data transfer for high-speed wide area networks. Comput. Netw. 51(7), 1777–1799 (2007)CrossRef
7.
Zurück zum Zitat Krzyzak, A., Linder, T., Lugosi, G.: Nonparametric estimation and classification using radial basis function nets and empirical risk minimization. IEEE Trans. Neural Netw. 7(2), 475–487 (1996)CrossRef Krzyzak, A., Linder, T., Lugosi, G.: Nonparametric estimation and classification using radial basis function nets and empirical risk minimization. IEEE Trans. Neural Netw. 7(2), 475–487 (1996)CrossRef
8.
Zurück zum Zitat Liu, Q., Rao, N.S.V., Wu, C.Q., Yun, D., Kettimuthu, R., Foster, I.: Measurement-based performance profiles and dynamics of UDT over dedicated connections. In: International Conference on Network Protocols, Singapore, November 2016 Liu, Q., Rao, N.S.V., Wu, C.Q., Yun, D., Kettimuthu, R., Foster, I.: Measurement-based performance profiles and dynamics of UDT over dedicated connections. In: International Conference on Network Protocols, Singapore, November 2016
9.
Zurück zum Zitat Liu, Z., Kettimuthu, R., Foster, I., Ra, N.S.V.: Cross-geography scientific data transferring trends and behavior. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), June 2018 Liu, Z., Kettimuthu, R., Foster, I., Ra, N.S.V.: Cross-geography scientific data transferring trends and behavior. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), June 2018
13.
Zurück zum Zitat Rao, N.S.V.: Simple sample bound for feedforward sigmoid networks with bounded weights. Neurocomputing 29, 115–122 (1999)CrossRef Rao, N.S.V.: Simple sample bound for feedforward sigmoid networks with bounded weights. Neurocomputing 29, 115–122 (1999)CrossRef
14.
Zurück zum Zitat Rao, N.S.V.: Overlay networks of in-situ instruments for probabilistic guarantees on message delays in wide-area networks. IEEE J. Sel. Areas Commun. 22(1), 79–90 (2004)CrossRef Rao, N.S.V.: Overlay networks of in-situ instruments for probabilistic guarantees on message delays in wide-area networks. IEEE J. Sel. Areas Commun. 22(1), 79–90 (2004)CrossRef
15.
Zurück zum Zitat Rao, N.S.V.: Finite-sample generalization theory for machine learning practice for science. In: DOE ASCR Scientific Machine Learning Workshop (2018) Rao, N.S.V.: Finite-sample generalization theory for machine learning practice for science. In: DOE ASCR Scientific Machine Learning Workshop (2018)
16.
Zurück zum Zitat Rao, N.S.V., Imam, N., Hanley, J., Sarp, O.: Wide-area lustre file system using LNet routers. In: 12th Annual IEEE International Systems Conference (2018) Rao, N.S.V., Imam, N., Hanley, J., Sarp, O.: Wide-area lustre file system using LNet routers. In: 12th Annual IEEE International Systems Conference (2018)
17.
Zurück zum Zitat Rao, N.S.V., et al.: TCP throughput profiles using measurements over dedicated connections. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), Washington, DC, July-August 2017 Rao, N.S.V., et al.: TCP throughput profiles using measurements over dedicated connections. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), Washington, DC, July-August 2017
18.
Zurück zum Zitat Rao, N.S.V., et al.: Experimental analysis of file transfer rates over wide-area dedicated connections. In: 18th IEEE International Conference on High Performance Computing and Communications (HPCC), Sydney, Australia, pp. 198–205, December 2016 Rao, N.S.V., et al.: Experimental analysis of file transfer rates over wide-area dedicated connections. In: 18th IEEE International Conference on High Performance Computing and Communications (HPCC), Sydney, Australia, pp. 198–205, December 2016
19.
Zurück zum Zitat Rao, N.S.V. et al.: Experiments and analyses of data transfers over wide-area dedicated connections. In: The 26th International Conference on Computer Communications and Network (ICCCN 2017) (2017) Rao, N.S.V. et al.: Experiments and analyses of data transfers over wide-area dedicated connections. In: The 26th International Conference on Computer Communications and Network (ICCCN 2017) (2017)
20.
Zurück zum Zitat Rhee, I., Xu, L.: CUBIC: a new TCP-friendly high-speed TCP variant. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2005) Rhee, I., Xu, L.: CUBIC: a new TCP-friendly high-speed TCP variant. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2005)
21.
Zurück zum Zitat Sen, S., Rao, N.S.V., Liu, Q., Imam, N., Foster, I.T., Kettimuthu, R.: On analytics of file transfer rates over dedicated wide-area connections. In: First International Workshop on Workflow Science (WOWS), Auckland, New Zealand, October 2017. in conjunction with 13th IEEE International Conference on e-Science Sen, S., Rao, N.S.V., Liu, Q., Imam, N., Foster, I.T., Kettimuthu, R.: On analytics of file transfer rates over dedicated wide-area connections. In: First International Workshop on Workflow Science (WOWS), Auckland, New Zealand, October 2017. in conjunction with 13th IEEE International Conference on e-Science
22.
Zurück zum Zitat Settlemyer, B.W., Dobson, J.D., Hodson, S.W., Kuehn, J.A., Poole, S.W., Ruwart, T.M.: A technique for moving large data sets over high-performance long distance networks. In: IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–6, May 2011 Settlemyer, B.W., Dobson, J.D., Hodson, S.W., Kuehn, J.A., Poole, S.W., Ruwart, T.M.: A technique for moving large data sets over high-performance long distance networks. In: IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–6, May 2011
23.
Zurück zum Zitat Shorten, R.N., Leith, D.J.: H-TCP: TCP for high-speed and long-distance networks. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2004) Shorten, R.N., Leith, D.J.: H-TCP: TCP for high-speed and long-distance networks. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2004)
24.
Zurück zum Zitat Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)MATH Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)MATH
Metadaten
Titel
Learning Concave-Convex Profiles of Data Transport over Dedicated Connections
verfasst von
Nageswara S. V. Rao
Satyabrata Sen
Zhengchun Liu
Rajkumar Kettimuthu
Ian Foster
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-19945-6_1

Premium Partner