Skip to main content
Erschienen in: The Journal of Supercomputing 1/2013

01.04.2013

Two-level Hash/Table approach for metadata management in distributed file systems

verfasst von: Antonio F. Díaz, Mancia Anguita, Hugo E. Camacho, Erik Nieto, Julio Ortega

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

AbFS is a distributed file system that makes it possible to efficiently share the inexpensive devices attached to the commodity computers of a cluster. The implementation of AbFS offers high-performance metadata management by combining hashing and tables in several levels, hierarchical structures and caches, and by combining the attributes and the namespace in the same structure. No additional layers are needed to implement caches because AbFS uses the Linux metadata caches, inode and dentry, to implement them. Along with the description of the proposed implementation for metadata management and the comparison with other implementations, this work provides experimental results to evaluate its performance obtained with a prototype made from scratch at kernel level. AbFS experimental results show that the implementation proposed is capable to manage files and directories with high performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Carns PH, Ligon WB, Ross RB, Thakur R (2000) PVFS: a parallel file system for Linux clusters. In: Proc 4th annual Linux showc and conf, pp 317–327 Carns PH, Ligon WB, Ross RB, Thakur R (2000) PVFS: a parallel file system for Linux clusters. In: Proc 4th annual Linux showc and conf, pp 317–327
2.
Zurück zum Zitat Braam PJ (2002) The Lustre storage architecture Braam PJ (2002) The Lustre storage architecture
3.
Zurück zum Zitat Weil SA et al (2006) Ceph: a scalable, high-performance distributed file system. In: Proc 7th symp on oper syst des and implement (OSDI’06), pp 307–320 Weil SA et al (2006) Ceph: a scalable, high-performance distributed file system. In: Proc 7th symp on oper syst des and implement (OSDI’06), pp 307–320
4.
Zurück zum Zitat Schmuck F, Haskin R (2002) GPFS: a shared-disk file system for large computing clusters. In: Proc 1st USENIX conf on file and storage technol, Berkeley, pp 19–23 Schmuck F, Haskin R (2002) GPFS: a shared-disk file system for large computing clusters. In: Proc 1st USENIX conf on file and storage technol, Berkeley, pp 19–23
5.
Zurück zum Zitat Soltis SR, Ruwart TM, O’Keefe MT (1996) The global file system. In: Proc 5th NASA Goddard conf on mass storage syst and technol. IEEE Comput. Soc., Los Alamitos, pp 319–342 Soltis SR, Ruwart TM, O’Keefe MT (1996) The global file system. In: Proc 5th NASA Goddard conf on mass storage syst and technol. IEEE Comput. Soc., Los Alamitos, pp 319–342
6.
Zurück zum Zitat Ousterhout JK et al (1985) A trace-driven analysis of the UNIX 4.2 BSD file system. In: Proc 10th ACM symp on oper syst princ, pp 15–24 Ousterhout JK et al (1985) A trace-driven analysis of the UNIX 4.2 BSD file system. In: Proc 10th ACM symp on oper syst princ, pp 15–24
7.
Zurück zum Zitat Mummert L, Satyanarayanan M, (1996) Long term distributed file reference tracing: implementation and experience. Softw Pract Exp 26(6):705–736 CrossRef Mummert L, Satyanarayanan M, (1996) Long term distributed file reference tracing: implementation and experience. Softw Pract Exp 26(6):705–736 CrossRef
8.
Zurück zum Zitat Roselli D, Lorch JR, Anderson TE (2000) A comparison of file system workloads. In: Proc annual conf on USENIX annual tech conf, p 4 Roselli D, Lorch JR, Anderson TE (2000) A comparison of file system workloads. In: Proc annual conf on USENIX annual tech conf, p 4
9.
Zurück zum Zitat SPECsfs2008 User’s guide v. 1.0. Standard performance evaluation corporation (SPEC) SPECsfs2008 User’s guide v. 1.0. Standard performance evaluation corporation (SPEC)
10.
Zurück zum Zitat Zhu Y, Jiang H, Wang J, Xian F (2008) HBA: distributed metadata management for large cluster-based storage systems. IEEE Trans Parallel Distrib Syst 19(6):750–763 CrossRef Zhu Y, Jiang H, Wang J, Xian F (2008) HBA: distributed metadata management for large cluster-based storage systems. IEEE Trans Parallel Distrib Syst 19(6):750–763 CrossRef
11.
Zurück zum Zitat Xing J, Xiong J, Sun N, Ma J (2009) Adaptive and scalable metadata management to support a trillion files. In: Proc conf. on high perform comput netw, storage and anal. ACM, New York, pp 1–11 Xing J, Xiong J, Sun N, Ma J (2009) Adaptive and scalable metadata management to support a trillion files. In: Proc conf. on high perform comput netw, storage and anal. ACM, New York, pp 1–11
12.
Zurück zum Zitat Floyd RA, Ellis CS (1989) Directory reference patterns in hierarchical file systems. IEEE Trans Knowl Data Eng 1(2):238 CrossRef Floyd RA, Ellis CS (1989) Directory reference patterns in hierarchical file systems. IEEE Trans Knowl Data Eng 1(2):238 CrossRef
13.
Zurück zum Zitat Wang F et al (2004) File system workload analysis for large scale scientific computing applications. In: Proc 21st IEEE/12th NASA Goddard conf on mass storage syst and technol Wang F et al (2004) File system workload analysis for large scale scientific computing applications. In: Proc 21st IEEE/12th NASA Goddard conf on mass storage syst and technol
14.
Zurück zum Zitat Hua Y et al (2011) Supporting scalable and adaptive metadata management in ultralarge-scale file systems. IEEE Trans Parallel Distrib Syst 22(4):580–593 CrossRef Hua Y et al (2011) Supporting scalable and adaptive metadata management in ultralarge-scale file systems. IEEE Trans Parallel Distrib Syst 22(4):580–593 CrossRef
15.
Zurück zum Zitat Ghemawat S, Gobioff H, Leung ST (2003) The Google file system. In: Proc 9th ACM symp on oper syst princ, pp 96–108 Ghemawat S, Gobioff H, Leung ST (2003) The Google file system. In: Proc 9th ACM symp on oper syst princ, pp 96–108
16.
Zurück zum Zitat Sun M (2009) Clustered metadata design. Sun and Cray Confidential Sun M (2009) Clustered metadata design. Sun and Cray Confidential
17.
Zurück zum Zitat Anderson TE et al (2001) Serverless network file systems. In: Jin H, Cortes T, Buyya R (eds) High perform mass storage and parallel {I/O}: technol and appl. IEEE Comput Soc/Wiley, New York, pp 364–385 Anderson TE et al (2001) Serverless network file systems. In: Jin H, Cortes T, Buyya R (eds) High perform mass storage and parallel {I/O}: technol and appl. IEEE Comput Soc/Wiley, New York, pp 364–385
18.
Zurück zum Zitat Corbett PF, Feitelson DG (2001) The Vesta parallel file system. In: Jin H, Cortes T, Buyya R (eds) High perform mass storage and parallel {I/O}: technol and appl. IEEE Computer Society/Wiley, New York, pp 285–308 Corbett PF, Feitelson DG (2001) The Vesta parallel file system. In: Jin H, Cortes T, Buyya R (eds) High perform mass storage and parallel {I/O}: technol and appl. IEEE Computer Society/Wiley, New York, pp 285–308
19.
Zurück zum Zitat Weil SA, Pollack KT, Brandt SA, Miller EL (2004) Dynamic metadata management for petabyte-scale file systems. In: Proc ACM/IEEE conf supercomput, p 4 CrossRef Weil SA, Pollack KT, Brandt SA, Miller EL (2004) Dynamic metadata management for petabyte-scale file systems. In: Proc ACM/IEEE conf supercomput, p 4 CrossRef
20.
Zurück zum Zitat Brandt SA et al (2003) Efficient metadata management in large distributed storage systems. In: Proc 20th IEEE/11th NASA Goddard conf on mass storage syst and technol, pp 290–298 CrossRef Brandt SA et al (2003) Efficient metadata management in large distributed storage systems. In: Proc 20th IEEE/11th NASA Goddard conf on mass storage syst and technol, pp 290–298 CrossRef
21.
Zurück zum Zitat Xiong J, Hu Y, Li G, Tang R, Fan Z (2011) Metadata distribution and consistency techniques for large-scale cluster file systems. IEEE Trans Parallel Distrib Syst 22(5):803–816 CrossRef Xiong J, Hu Y, Li G, Tang R, Fan Z (2011) Metadata distribution and consistency techniques for large-scale cluster file systems. IEEE Trans Parallel Distrib Syst 22(5):803–816 CrossRef
22.
Zurück zum Zitat Fagin R, Nievergelt J, Pippenger N, Strong HR (1979) Extendible hashing: a fast access method for dynamic files. ACM Trans Database Syst 4(3):315–344 CrossRef Fagin R, Nievergelt J, Pippenger N, Strong HR (1979) Extendible hashing: a fast access method for dynamic files. ACM Trans Database Syst 4(3):315–344 CrossRef
23.
Zurück zum Zitat Hwang I, Maeng S, Cho J (2006) Home-based cooperative cache for parallel I/O applications. Future Gener Comput Syst 22(5):633–642 CrossRef Hwang I, Maeng S, Cho J (2006) Home-based cooperative cache for parallel I/O applications. Future Gener Comput Syst 22(5):633–642 CrossRef
24.
Zurück zum Zitat Turek W, Calleja P (2010) High performance, open source, Dell Lustre storage system. White paper, University of Cambridge, Dell Turek W, Calleja P (2010) High performance, open source, Dell Lustre storage system. White paper, University of Cambridge, Dell
25.
Zurück zum Zitat Kondekar P (2009) MDS performance analysis. Sun microsystems Kondekar P (2009) MDS performance analysis. Sun microsystems
26.
Zurück zum Zitat Kunkel JM, Ludwig T (2007) Performance evaluation of the PVFS2 architecture. In: Proc of the 15th EUROMICRO int conf on parallel, distrib and netw-based process, pp 509–516 Kunkel JM, Ludwig T (2007) Performance evaluation of the PVFS2 architecture. In: Proc of the 15th EUROMICRO int conf on parallel, distrib and netw-based process, pp 509–516
Metadaten
Titel
Two-level Hash/Table approach for metadata management in distributed file systems
verfasst von
Antonio F. Díaz
Mancia Anguita
Hugo E. Camacho
Erik Nieto
Julio Ortega
Publikationsdatum
01.04.2013
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2013
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-012-0801-y

Weitere Artikel der Ausgabe 1/2013

The Journal of Supercomputing 1/2013 Zur Ausgabe

Premium Partner