Skip to main content
Top
Published in:
Cover of the book

2019 | OriginalPaper | Chapter

Learning Concave-Convex Profiles of Data Transport over Dedicated Connections

Authors : Nageswara S. V. Rao, Satyabrata Sen, Zhengchun Liu, Rajkumar Kettimuthu, Ian Foster

Published in: Machine Learning for Networking

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Dedicated data transport infrastructures are increasingly being deployed to support distributed big-data and high-performance computing scenarios. These infrastructures employ data transfer nodes that use sophisticated software stacks to support network transport among sites, which often house distributed file and storage systems. Throughput measurements collected over such infrastructures for a range of round trip times (RTTs) reflect the underlying complex end-to-end connections, and have revealed dichotomous throughput profiles as functions of RTT. In particular, concave regions of throughput profiles at lower RTTs indicate near-optimal performance, and convex regions at higher RTTs indicate bottlenecks due to factors such as buffer or credit limits. We present a machine learning method that explicitly infers these concave and convex regions and transitions between them using sigmoid functions. We also provide distribution-free confidence estimates for the generalization error of these concave-convex profile estimates. Throughput profiles for data transfers over 10 Gbps connections with 0–366 ms RTT provide important performance insights, including the near optimality of transfers performed with the XDD tool between XFS filesystems, and the performance limits of wide-area Lustre extensions using LNet routers. A direct application of generic machine learning packages does not adequately highlight these critical performance regions or provide as precise confidence estimates.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Allcock, W., et al.: The Globus striped gridFTP framework and server. In: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC 2005, pp. 54–64. IEEE Computer Society, Washington (2005) Allcock, W., et al.: The Globus striped gridFTP framework and server. In: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC 2005, pp. 54–64. IEEE Computer Society, Washington (2005)
2.
go back to reference Allen, B., Bresnahan, J., Childers, L., Foster, I., Kandaswamy, G., Kettimuthu, R., Kordas, J., Link, M., Martin, S., Pickett, K., Tuecke, S.: Software as a service for data scientists. Commun. ACM 55(2), 81–88 (2012)CrossRef Allen, B., Bresnahan, J., Childers, L., Foster, I., Kandaswamy, G., Kettimuthu, R., Kordas, J., Link, M., Martin, S., Pickett, K., Tuecke, S.: Software as a service for data scientists. Commun. ACM 55(2), 81–88 (2012)CrossRef
4.
go back to reference Cardwell, N., Cheng, Y., Gunn, C.S., Yeganeh, S.H., Jacobson, V.: BBR: congestion based congestion control. ACM Qeueue 14(5), 50 (2016) Cardwell, N., Cheng, Y., Gunn, C.S., Yeganeh, S.H., Jacobson, V.: BBR: congestion based congestion control. ACM Qeueue 14(5), 50 (2016)
6.
go back to reference Gu, Y., Grossman, R.L.: UDT: UDP-based data transfer for high-speed wide area networks. Comput. Netw. 51(7), 1777–1799 (2007)CrossRef Gu, Y., Grossman, R.L.: UDT: UDP-based data transfer for high-speed wide area networks. Comput. Netw. 51(7), 1777–1799 (2007)CrossRef
7.
go back to reference Krzyzak, A., Linder, T., Lugosi, G.: Nonparametric estimation and classification using radial basis function nets and empirical risk minimization. IEEE Trans. Neural Netw. 7(2), 475–487 (1996)CrossRef Krzyzak, A., Linder, T., Lugosi, G.: Nonparametric estimation and classification using radial basis function nets and empirical risk minimization. IEEE Trans. Neural Netw. 7(2), 475–487 (1996)CrossRef
8.
go back to reference Liu, Q., Rao, N.S.V., Wu, C.Q., Yun, D., Kettimuthu, R., Foster, I.: Measurement-based performance profiles and dynamics of UDT over dedicated connections. In: International Conference on Network Protocols, Singapore, November 2016 Liu, Q., Rao, N.S.V., Wu, C.Q., Yun, D., Kettimuthu, R., Foster, I.: Measurement-based performance profiles and dynamics of UDT over dedicated connections. In: International Conference on Network Protocols, Singapore, November 2016
9.
go back to reference Liu, Z., Kettimuthu, R., Foster, I., Ra, N.S.V.: Cross-geography scientific data transferring trends and behavior. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), June 2018 Liu, Z., Kettimuthu, R., Foster, I., Ra, N.S.V.: Cross-geography scientific data transferring trends and behavior. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), June 2018
13.
go back to reference Rao, N.S.V.: Simple sample bound for feedforward sigmoid networks with bounded weights. Neurocomputing 29, 115–122 (1999)CrossRef Rao, N.S.V.: Simple sample bound for feedforward sigmoid networks with bounded weights. Neurocomputing 29, 115–122 (1999)CrossRef
14.
go back to reference Rao, N.S.V.: Overlay networks of in-situ instruments for probabilistic guarantees on message delays in wide-area networks. IEEE J. Sel. Areas Commun. 22(1), 79–90 (2004)CrossRef Rao, N.S.V.: Overlay networks of in-situ instruments for probabilistic guarantees on message delays in wide-area networks. IEEE J. Sel. Areas Commun. 22(1), 79–90 (2004)CrossRef
15.
go back to reference Rao, N.S.V.: Finite-sample generalization theory for machine learning practice for science. In: DOE ASCR Scientific Machine Learning Workshop (2018) Rao, N.S.V.: Finite-sample generalization theory for machine learning practice for science. In: DOE ASCR Scientific Machine Learning Workshop (2018)
16.
go back to reference Rao, N.S.V., Imam, N., Hanley, J., Sarp, O.: Wide-area lustre file system using LNet routers. In: 12th Annual IEEE International Systems Conference (2018) Rao, N.S.V., Imam, N., Hanley, J., Sarp, O.: Wide-area lustre file system using LNet routers. In: 12th Annual IEEE International Systems Conference (2018)
17.
go back to reference Rao, N.S.V., et al.: TCP throughput profiles using measurements over dedicated connections. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), Washington, DC, July-August 2017 Rao, N.S.V., et al.: TCP throughput profiles using measurements over dedicated connections. In: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC), Washington, DC, July-August 2017
18.
go back to reference Rao, N.S.V., et al.: Experimental analysis of file transfer rates over wide-area dedicated connections. In: 18th IEEE International Conference on High Performance Computing and Communications (HPCC), Sydney, Australia, pp. 198–205, December 2016 Rao, N.S.V., et al.: Experimental analysis of file transfer rates over wide-area dedicated connections. In: 18th IEEE International Conference on High Performance Computing and Communications (HPCC), Sydney, Australia, pp. 198–205, December 2016
19.
go back to reference Rao, N.S.V. et al.: Experiments and analyses of data transfers over wide-area dedicated connections. In: The 26th International Conference on Computer Communications and Network (ICCCN 2017) (2017) Rao, N.S.V. et al.: Experiments and analyses of data transfers over wide-area dedicated connections. In: The 26th International Conference on Computer Communications and Network (ICCCN 2017) (2017)
20.
go back to reference Rhee, I., Xu, L.: CUBIC: a new TCP-friendly high-speed TCP variant. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2005) Rhee, I., Xu, L.: CUBIC: a new TCP-friendly high-speed TCP variant. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2005)
21.
go back to reference Sen, S., Rao, N.S.V., Liu, Q., Imam, N., Foster, I.T., Kettimuthu, R.: On analytics of file transfer rates over dedicated wide-area connections. In: First International Workshop on Workflow Science (WOWS), Auckland, New Zealand, October 2017. in conjunction with 13th IEEE International Conference on e-Science Sen, S., Rao, N.S.V., Liu, Q., Imam, N., Foster, I.T., Kettimuthu, R.: On analytics of file transfer rates over dedicated wide-area connections. In: First International Workshop on Workflow Science (WOWS), Auckland, New Zealand, October 2017. in conjunction with 13th IEEE International Conference on e-Science
22.
go back to reference Settlemyer, B.W., Dobson, J.D., Hodson, S.W., Kuehn, J.A., Poole, S.W., Ruwart, T.M.: A technique for moving large data sets over high-performance long distance networks. In: IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–6, May 2011 Settlemyer, B.W., Dobson, J.D., Hodson, S.W., Kuehn, J.A., Poole, S.W., Ruwart, T.M.: A technique for moving large data sets over high-performance long distance networks. In: IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–6, May 2011
23.
go back to reference Shorten, R.N., Leith, D.J.: H-TCP: TCP for high-speed and long-distance networks. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2004) Shorten, R.N., Leith, D.J.: H-TCP: TCP for high-speed and long-distance networks. In: Proceedings of the Third International Workshop on Protocols for Fast Long-Distance Networks (2004)
24.
go back to reference Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)MATH Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)MATH
Metadata
Title
Learning Concave-Convex Profiles of Data Transport over Dedicated Connections
Authors
Nageswara S. V. Rao
Satyabrata Sen
Zhengchun Liu
Rajkumar Kettimuthu
Ian Foster
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-19945-6_1

Premium Partner