Skip to main content
Top
Published in: The Journal of Supercomputing 5/2016

01-05-2016

Experimental analysis of operating system jitter caused by page reclaim

Authors: Yoshihiro Oyama, Shun Ishiguro, Jun Murakami, Shin Sasaki, Ryo Matsumiya, Osamu Tatebe

Published in: The Journal of Supercomputing | Issue 5/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Operating system jitter is one of the causes of runtime overhead in high-performance computing applications. Many high-performance computing applications perform burst accesses to I/O, and such accesses consume a large amount of memory. When the Linux kernel runs out of memory, it awakens special kernel threads to reclaim memory pages. If the kernel threads are frequently awakened, application performance is degraded because of the threads’ resource consumption as well as the increase in the application’s page faults and migration between CPU cores. In this study, we empirically analyze the impact of jitter caused by reclaiming memory pages, and we propose a method for reducing it. The proposed method reclaims memory pages in advance of the kernel thread. It reclaims more pages at one time than the kernel threads, thus reducing the frequency of page reclaim and the impact of jitter. We conducted experiments using practical weather forecast software, the results of which showed that the proposed method minimized performance degradation caused by jitter.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Akkan H, Lang M, Liebrock LM (2012) Stepping towards noiseless Linux environment. In: Proceedings of the 2nd international workshop on runtime and operating systems for supercomputers Akkan H, Lang M, Liebrock LM (2012) Stepping towards noiseless Linux environment. In: Proceedings of the 2nd international workshop on runtime and operating systems for supercomputers
3.
go back to reference Beckman P, Iskra K, Yoshii K, Coghlan S (2006) The influence of operating systems on the performance of collective operations at extreme scale. In: Proceedings of the 2006 IEEE international conference on cluster computing Beckman P, Iskra K, Yoshii K, Coghlan S (2006) The influence of operating systems on the performance of collective operations at extreme scale. In: Proceedings of the 2006 IEEE international conference on cluster computing
4.
go back to reference Betti E, Cesati M, Gioiosa R, Piermaria F (2009) A global operating system for HPC clusters. In: Proceedings of the 2009 IEEE international conference on cluster computing Betti E, Cesati M, Gioiosa R, Piermaria F (2009) A global operating system for HPC clusters. In: Proceedings of the 2009 IEEE international conference on cluster computing
5.
go back to reference Chinner D, Higdon J (2006) Exploring high bandwidth filesystems on large systems. Proc Ott Linux Symp 2006:177–191 Chinner D, Higdon J (2006) Exploring high bandwidth filesystems on large systems. Proc Ott Linux Symp 2006:177–191
6.
go back to reference De P, Kothari R, Mann V (2007) Identifying sources of operating system jitter through fine-grained kernel instrumentation. In: Proceedings of the 2007 IEEE international conference on cluster computing, pp 331–340 De P, Kothari R, Mann V (2007) Identifying sources of operating system jitter through fine-grained kernel instrumentation. In: Proceedings of the 2007 IEEE international conference on cluster computing, pp 331–340
7.
go back to reference De P, Mann V, Mittaly U (2009) Handling OS jitter on multicore multithreaded systems. In: Proceedings of the 23rd IEEE international symposium on parallel and distributed processing De P, Mann V, Mittaly U (2009) Handling OS jitter on multicore multithreaded systems. In: Proceedings of the 23rd IEEE international symposium on parallel and distributed processing
8.
go back to reference Dunigan TH (1994) Early experiences and performance of the Intel Paragon. Tech. Rep. ORNL/TM-12194, Oak Ridge National Laboratory Dunigan TH (1994) Early experiences and performance of the Intel Paragon. Tech. Rep. ORNL/TM-12194, Oak Ridge National Laboratory
9.
go back to reference Ferreira KB, Bridges P, Brightwell R (2008) Characterizing application sensitivity to os interference using kernel-level noise injection. In: Proceedings of the 2008 ACM/IEEE conference on supercomputing Ferreira KB, Bridges P, Brightwell R (2008) Characterizing application sensitivity to os interference using kernel-level noise injection. In: Proceedings of the 2008 ACM/IEEE conference on supercomputing
10.
go back to reference Giampapa M, Gooding T, Inglett T, Wisniewski RW (2010) Experiences with a lightweight supercomputer kernel: lessons learned from Blue Gene’s CNK. In: Proceedings of SC10 Giampapa M, Gooding T, Inglett T, Wisniewski RW (2010) Experiences with a lightweight supercomputer kernel: lessons learned from Blue Gene’s CNK. In: Proceedings of SC10
11.
go back to reference Gioiosa R, Petrini F, Davis K, Lebaillif-Delamare F (2004) Analysis of system overhead on parallel computers. In: Proceedings of the 4th IEEE international symposium on signal processing and information technology, pp 387–390 Gioiosa R, Petrini F, Davis K, Lebaillif-Delamare F (2004) Analysis of system overhead on parallel computers. In: Proceedings of the 4th IEEE international symposium on signal processing and information technology, pp 387–390
13.
go back to reference Hoefler T, Schneider T, Lumsdaine A (2010) Characterizing the influence of system noise on large-scale applications by simulation. In: Proceedings of SC10 Hoefler T, Schneider T, Lumsdaine A (2010) Characterizing the influence of system noise on large-scale applications by simulation. In: Proceedings of SC10
14.
go back to reference Isaila F, Balaprakash P, Wild SM, Kimpe D, Latham R, Ross R, Hovland P (2015) Collective I/O tuning using analytical and machine learning models. In: Proceedings of the 2015 IEEE international conference on cluster computing, pp 128–137 Isaila F, Balaprakash P, Wild SM, Kimpe D, Latham R, Ross R, Hovland P (2015) Collective I/O tuning using analytical and machine learning models. In: Proceedings of the 2015 IEEE international conference on cluster computing, pp 128–137
15.
go back to reference Jones T (2011) Linux kernel co-scheduling for bulk synchronous parallel applications. In: Proceedings of the 1st international workshop on runtime and operating systems for supercomputers, pp 57–64 Jones T (2011) Linux kernel co-scheduling for bulk synchronous parallel applications. In: Proceedings of the 1st international workshop on runtime and operating systems for supercomputers, pp 57–64
16.
go back to reference Kuo CS, Shah A, Nomura A, Matsuoka S, Wolf F (2014) How file access patterns influence interference among cluster applications. In: Proceedings of 2014 IEEE international conference on cluster computing, pp 185–193 Kuo CS, Shah A, Nomura A, Matsuoka S, Wolf F (2014) How file access patterns influence interference among cluster applications. In: Proceedings of 2014 IEEE international conference on cluster computing, pp 185–193
17.
go back to reference Morari A, Gioiosa R, Wisniewski RW, Cazorla FJ, Valero M (2011) A quantitative analysis of OS noise. In: Proceedings of the 2011 IEEE international parallel and distributed processing symposium, pp 852–863 Morari A, Gioiosa R, Wisniewski RW, Cazorla FJ, Valero M (2011) A quantitative analysis of OS noise. In: Proceedings of the 2011 IEEE international parallel and distributed processing symposium, pp 852–863
19.
go back to reference Nataraj A, Morris A, Malony AD, Sottile M, Beckman P (2007) The ghost in the machine: observing the effects of kernel operation on parallel application performance. In: Proceedings of SC07 Nataraj A, Morris A, Malony AD, Sottile M, Beckman P (2007) The ghost in the machine: observing the effects of kernel operation on parallel application performance. In: Proceedings of SC07
20.
go back to reference Oral S, Wang F, Shipman GM, Dillow D, Miller R, Maxwell D, Becklehimer J, Larkin J, Henseler D (2010) Reducing application runtime variability on Jaguar XT5. Cray User Group (CUG) Meeting Oral S, Wang F, Shipman GM, Dillow D, Miller R, Maxwell D, Becklehimer J, Larkin J, Henseler D (2010) Reducing application runtime variability on Jaguar XT5. Cray User Group (CUG) Meeting
21.
go back to reference Oyama Y, Ishiguro S, Murakami J, Sasaki S, Matsumiya R, Tatebe O (2014) Reduction of operating system jitter caused by page reclaim. In: Proceedings of the 4th international workshop on runtime and operating systems for supercomputers (ROSS’14) Oyama Y, Ishiguro S, Murakami J, Sasaki S, Matsumiya R, Tatebe O (2014) Reduction of operating system jitter caused by page reclaim. In: Proceedings of the 4th international workshop on runtime and operating systems for supercomputers (ROSS’14)
22.
go back to reference Park Y, Hensbergen EV, Hillenbrand M, Inglett T, Rosenburg B, Ryu KD, Wisniewski RW (2012) FusedOS: fusing LWK performance with FWK functionality in a heterogeneous environment. In: Proceedings of the 24th international symposium on computer architecture and high performance computing, pp 211–218 Park Y, Hensbergen EV, Hillenbrand M, Inglett T, Rosenburg B, Ryu KD, Wisniewski RW (2012) FusedOS: fusing LWK performance with FWK functionality in a heterogeneous environment. In: Proceedings of the 24th international symposium on computer architecture and high performance computing, pp 211–218
23.
go back to reference Rosenthal E, León EA, Moody AT (2013) Mitigating system noise with simultaneous multi-threading. In: Proceedings of SC13, poster session Rosenthal E, León EA, Moody AT (2013) Mitigating system noise with simultaneous multi-threading. In: Proceedings of SC13, poster session
24.
go back to reference Schwan P (2003) Lustre: building a file system for 1000-node clusters. In: Proceedings of the 2003 Linux symposium Schwan P (2003) Lustre: building a file system for 1000-node clusters. In: Proceedings of the 2003 Linux symposium
25.
go back to reference Seelam S, Fong L, Lewars J, Divirgilio J, Veale BF, Gildea K (2011) Characterization of system services and their performance impact in multi-core nodes. In: Proceedings of the 25th IEEE international parallel and distributed processing symposium, pp 104–117 Seelam S, Fong L, Lewars J, Divirgilio J, Veale BF, Gildea K (2011) Characterization of system services and their performance impact in multi-core nodes. In: Proceedings of the 25th IEEE international parallel and distributed processing symposium, pp 104–117
26.
go back to reference Shvachko K, Kuang H, Radia S, Chansler R (2010) The Hadoop distributed file system. In: Proceedings of the 26th IEEE symposium on massive storage systems and technologies Shvachko K, Kuang H, Radia S, Chansler R (2010) The Hadoop distributed file system. In: Proceedings of the 26th IEEE symposium on massive storage systems and technologies
28.
29.
go back to reference Tsafrir D, Etsion Y, Feitelson DG, Kirkpatrick S (2005) System noise, OS clock ticks, and fine-grained parallel applications. In: Proceedings of the 19th ACM international conference on supercomputing, pp 303–312 Tsafrir D, Etsion Y, Feitelson DG, Kirkpatrick S (2005) System noise, OS clock ticks, and fine-grained parallel applications. In: Proceedings of the 19th ACM international conference on supercomputing, pp 303–312
31.
go back to reference Vicente E Jr, Matias R (2012) Exploratory study on the Linux OS jitter. In: Proceedings of the 2012 Brazilian symposium on computing system engineering, pp 19–24 Vicente E Jr, Matias R (2012) Exploratory study on the Linux OS jitter. In: Proceedings of the 2012 Brazilian symposium on computing system engineering, pp 19–24
33.
go back to reference Yuan Q, Zhao J, Chen M, Sun N (2010) GenerOS: an asymmetric operating system kernel for multi-core systems. In: Proceedings of the 24th IEEE international parallel and distributed processing symposium Yuan Q, Zhao J, Chen M, Sun N (2010) GenerOS: an asymmetric operating system kernel for multi-core systems. In: Proceedings of the 24th IEEE international parallel and distributed processing symposium
Metadata
Title
Experimental analysis of operating system jitter caused by page reclaim
Authors
Yoshihiro Oyama
Shun Ishiguro
Jun Murakami
Shin Sasaki
Ryo Matsumiya
Osamu Tatebe
Publication date
01-05-2016
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 5/2016
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-016-1703-1

Other articles of this Issue 5/2016

The Journal of Supercomputing 5/2016 Go to the issue

Premium Partner