Skip to main content

2018 | OriginalPaper | Buchkapitel

19. Cyber-Surveillance Analysis for Supercomputing Environments

verfasst von : A. D. Clark, J. M. Absher

Erschienen in: Surveillance in Action

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

High performance computers (HPCs) have contributed to rapid scientific discovery and global economic prosperity as well as defense-related applications. However, their complex nature makes them difficult to troubleshoot thus questioning their reliability. As a result, these supercomputing systems are susceptible to malicious behavior or cyber attacks. Similar investigations have been made in the context of malicious objects in computer networks; however, limited attention has been given in the context of large-scale parallel systems. In this chapter, we present a sophisticated process that characterizes observed failures in supercomputing infrastructures due to variations of consistent intentional attacks. First, we present a data network extrapolation (DNE) process that automatically does failure accounting and error checking while considering a HPC tree-like reliability infrastructure. Next, dynamic and static characterization of failures are performed. By introducing a normalization metric, we observe that the complete spectrum of failure observations is deterministic in nature that depends on the total number of failed jobs, the time between processed jobs, and the total number of processed jobs per node. Our simulations using the Structural Simulation Toolkit (SST) show that our approach is highly effective for dynamically and statically representing observed failures. Furthermore, our results can be applied for improving job-based scheduling in supercomputing environments.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The spline estimates are third degree polynomials.
 
Literatur
1.
Zurück zum Zitat Howard C (2011) Military & aerospace electronics Howard C (2011) Military & aerospace electronics
2.
Zurück zum Zitat Nyland LS, Prins JF, Goldberg A, Mills PH (2000) IEEE Trans Software Eng 26(4):293CrossRef Nyland LS, Prins JF, Goldberg A, Mills PH (2000) IEEE Trans Software Eng 26(4):293CrossRef
3.
Zurück zum Zitat Ravichandran D, Pantel P, Hovy E (2004) KDD workshop on mining for and from the semantic web (MSW-04). Citeseer, pp 1–11 Ravichandran D, Pantel P, Hovy E (2004) KDD workshop on mining for and from the semantic web (MSW-04). Citeseer, pp 1–11
4.
Zurück zum Zitat Schroeder B, Gibson GA (2007) J Phy Conf Ser, vol 78. IOP Publishing, pp 12–22 Schroeder B, Gibson GA (2007) J Phy Conf Ser, vol 78. IOP Publishing, pp 12–22
5.
Zurück zum Zitat Chen Z, Dongarra J (2008) IEEE Trans Parallel Distrib Syst 19(12):1628CrossRef Chen Z, Dongarra J (2008) IEEE Trans Parallel Distrib Syst 19(12):1628CrossRef
6.
Zurück zum Zitat Daly JT, Pritchett-Sheats LA, Michala SE (2008) 8th IEEE international symposium on cluster computing and the grid (CCGRID). IEEE, pp 795–800 Daly JT, Pritchett-Sheats LA, Michala SE (2008) 8th IEEE international symposium on cluster computing and the grid (CCGRID). IEEE, pp 795–800
7.
Zurück zum Zitat DeBardeleben N, Laros J, Daly J, Scott S, Engelmann C, Harrod B (2009) Whitepaper DeBardeleben N, Laros J, Daly J, Scott S, Engelmann C, Harrod B (2009) Whitepaper
8.
Zurück zum Zitat Jones WM, Daly JT, DeBardeleben NA (2008) 8th IEEE international symposium on cluster computing and the grid. IEEE, pp 789–794 Jones WM, Daly JT, DeBardeleben NA (2008) 8th IEEE international symposium on cluster computing and the grid. IEEE, pp 789–794
9.
Zurück zum Zitat Raicu I (2009) Many-task computing: bridging the gap between high-throughput computing and high-performance computing. ProQuest Raicu I (2009) Many-task computing: bridging the gap between high-throughput computing and high-performance computing. ProQuest
10.
Zurück zum Zitat Lunacek M, Braden J, Hauser T (2013) IEEE international conference on cluster computing (CLUSTER). IEEE, pp 1–8 Lunacek M, Braden J, Hauser T (2013) IEEE international conference on cluster computing (CLUSTER). IEEE, pp 1–8
11.
Zurück zum Zitat Quintero D, Bosworth K, Chaudhary P, da Silva RG, Ha B, Higino J, Kahle ME, Kamenoue T, Pearson J, Perez MM et al (2014) IBM power systems 775 for AIX and Linux HPC solution. IBM Redbooks Quintero D, Bosworth K, Chaudhary P, da Silva RG, Ha B, Higino J, Kahle ME, Kamenoue T, Pearson J, Perez MM et al (2014) IBM power systems 775 for AIX and Linux HPC solution. IBM Redbooks
12.
13.
Zurück zum Zitat Gray J (1986) Symposium on reliability in distributed software and database systems, pp 3–12 Gray J (1986) Symposium on reliability in distributed software and database systems, pp 3–12
14.
Zurück zum Zitat Gray J (1990) Reliability. IEEE Trans 39(4):409 Gray J (1990) Reliability. IEEE Trans 39(4):409
16.
Zurück zum Zitat Kalyanakrishnam M, Kalbarczyk Z, Iyer R (1999) 18th IEEE symposium on reliable distributed systems. IEEE, pp 178–187 Kalyanakrishnam M, Kalbarczyk Z, Iyer R (1999) 18th IEEE symposium on reliable distributed systems. IEEE, pp 178–187
18.
Zurück zum Zitat Oppenheimer D, Ganapathi A, Patterson DA (2003) USENIX symposium on internet technologies and systems, vol 67. Seattle, WA Oppenheimer D, Ganapathi A, Patterson DA (2003) USENIX symposium on internet technologies and systems, vol 67. Seattle, WA
19.
Zurück zum Zitat Xu J, Kalbarczyk Z, Iyer RK (1999) Pacific rim international symposium on dependable computing. IEEE, pp 178–185 Xu J, Kalbarczyk Z, Iyer RK (1999) Pacific rim international symposium on dependable computing. IEEE, pp 178–185
20.
Zurück zum Zitat Sahoo RK, Squillante MS, Sivasubramaniam A, Zhang Y (2004) International conference on dependable systems and networks. IEEE, pp 772–781 Sahoo RK, Squillante MS, Sivasubramaniam A, Zhang Y (2004) International conference on dependable systems and networks. IEEE, pp 772–781
21.
Zurück zum Zitat Liang Y, Zhang Y, Sivasubramaniam A, Sahoo RK, Gupta JMM (2005) International conference on dependable systems and networks. IEEE, pp 476–485 Liang Y, Zhang Y, Sivasubramaniam A, Sahoo RK, Gupta JMM (2005) International conference on dependable systems and networks. IEEE, pp 476–485
22.
Zurück zum Zitat Tang D, Iyer RK, Subramani SS (1990) 20th international symposium on fault-tolerant computing. IEEE, pp 244–251 Tang D, Iyer RK, Subramani SS (1990) 20th international symposium on fault-tolerant computing. IEEE, pp 244–251
23.
Zurück zum Zitat Schroeder B, Gibson GA (2010) IEEE Trans Dependable Secure Comput 7(4):337CrossRef Schroeder B, Gibson GA (2010) IEEE Trans Dependable Secure Comput 7(4):337CrossRef
24.
Zurück zum Zitat Yuan H, Chen G (2008) Applied Math Comput 206(1):357 Yuan H, Chen G (2008) Applied Math Comput 206(1):357
25.
Zurück zum Zitat Data S, Wang H (2005) Canadian conference on electrical and computer engineering. IEEE, pp 219–223 Data S, Wang H (2005) Canadian conference on electrical and computer engineering. IEEE, pp 219–223
29.
30.
Zurück zum Zitat Fan W, Yeung K (2013) The influence of technology on social network analysis and mining. Springer, pp 185–199 Fan W, Yeung K (2013) The influence of technology on social network analysis and mining. Springer, pp 185–199
31.
Zurück zum Zitat Wang P, González MC, Menezes R, Barabási A (2013) Int J Inf Secur 12(5):383CrossRef Wang P, González MC, Menezes R, Barabási A (2013) Int J Inf Secur 12(5):383CrossRef
32.
Zurück zum Zitat Faissol G, Gallagher B (2014) Lawrence livermore national laboratory (technical report) Faissol G, Gallagher B (2014) Lawrence livermore national laboratory (technical report)
33.
Zurück zum Zitat Moscibroda T, Schmid S, Wattenhofer R (2006) 25th annual ACM symposium on principles of distributed computing. ACM, pp 35–44 Moscibroda T, Schmid S, Wattenhofer R (2006) 25th annual ACM symposium on principles of distributed computing. ACM, pp 35–44
34.
35.
Zurück zum Zitat Pritchett-Sheats LA (2013) Los Alamos national laboratory (technical report) Pritchett-Sheats LA (2013) Los Alamos national laboratory (technical report)
36.
Zurück zum Zitat Clark AD (2016) Handbook of research on next-generation high performance computing. IGI Global Clark AD (2016) Handbook of research on next-generation high performance computing. IGI Global
37.
Zurück zum Zitat Clark AD, Tellez LM, Besse S , Absher JM (2016) IEEE/ACM international conference on advances in social networks analysis and mining. IEEE/ACM Clark AD, Tellez LM, Besse S , Absher JM (2016) IEEE/ACM international conference on advances in social networks analysis and mining. IEEE/ACM
38.
Zurück zum Zitat Fekedulegn D, Siurtain MPM, Colbert JJ (1999) Silva Fennica Fekedulegn D, Siurtain MPM, Colbert JJ (1999) Silva Fennica
39.
Zurück zum Zitat Rodrigues AF, Hemmert KS, Barrett BW, Kersey C, Oldfield R, Weston M, Risen R, Cook J, Rosenfeld P, CooperBalls E et al (2011) ACM SIGMETRICS Perf Eval Rev 38(4):37CrossRef Rodrigues AF, Hemmert KS, Barrett BW, Kersey C, Oldfield R, Weston M, Risen R, Cook J, Rosenfeld P, CooperBalls E et al (2011) ACM SIGMETRICS Perf Eval Rev 38(4):37CrossRef
40.
Zurück zum Zitat Schmidtke R, Laubender G, Steinke T (2016) Cray User Group (CUG) Schmidtke R, Laubender G, Steinke T (2016) Cray User Group (CUG)
Metadaten
Titel
Cyber-Surveillance Analysis for Supercomputing Environments
verfasst von
A. D. Clark
J. M. Absher
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-68533-5_19