Skip to main content

2016 | OriginalPaper | Buchkapitel

Persistence Management in Digital Document Repository

verfasst von : Piotr Pałka, Tomasz Śliwiński, Tomasz Traczyk, Włodzimierz Ogryczak

Erschienen in: Beyond Databases, Architectures and Structures. Advanced Technologies for Data Mining and Knowledge Discovery

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The CREDO Digital Document Repository enables short-and long-term archiving of large volumes of digital resources, ensuring bitstream preservation and providing most of the technical means to ensure content preservation of digital resources. The goal of the paper is to describe the design and implementation an innovative component of the CREDO Repository: the Persistence Management Subsystem (PMS). This subsystem sets guidelines for the file management system on replicas placement, and data relocation. The module responsible for scheduling access to the archive provides energy efficiency by setting suboptimal schedules. The module responsible for diagnose and exchange of data carriers calculates the probabilities of failure, and the information is used by the scheduling module to select appropriate storage areas for reading or writing of data, and for marking the areas as obsolete. Finally, the power management module is responsible for starting-up the storage areas only when necessary.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
CREDO – the acronym of Polish name Cyfrowe REpozytorium DOkumentów, which means ‘Digital Document Repository’. In Latin credo means ‘I believe’, which seems to be quite a good watchword for trustworthy digital repository.
 
2
Mean Time Between Failures, parameter given by the media producers.
 
Literatur
1.
Zurück zum Zitat Al-Fares, M., Radhakrishnan, S., Raghavan, B., Huang, N., Vahdat, A.: Hedera: dynamic flow scheduling for data center networks. In: NSDI, vol. 10, p. 19 (2010) Al-Fares, M., Radhakrishnan, S., Raghavan, B., Huang, N., Vahdat, A.: Hedera: dynamic flow scheduling for data center networks. In: NSDI, vol. 10, p. 19 (2010)
2.
Zurück zum Zitat Beloglazov, A., Buyya, R.: Energy efficient resource management in virtualized cloud data centers. In: Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp. 826–831. IEEE Computer Society (2010) Beloglazov, A., Buyya, R.: Energy efficient resource management in virtualized cloud data centers. In: Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp. 826–831. IEEE Computer Society (2010)
4.
Zurück zum Zitat Denning, P.J.: Effects of scheduling on file memory operations. In: Proceedings of the Spring Joint Computer Conference, pp. 9–21. ACM, 18–20 April 1967 Denning, P.J.: Effects of scheduling on file memory operations. In: Proceedings of the Spring Joint Computer Conference, pp. 9–21. ACM, 18–20 April 1967
5.
Zurück zum Zitat Giaretta, D.: Advanced Digital Preservation. Springer, Heidelberg (2011)CrossRef Giaretta, D.: Advanced Digital Preservation. Springer, Heidelberg (2011)CrossRef
6.
Zurück zum Zitat Hamerly, G., Elkan, C., et al.: Bayesian approaches to failure prediction for disk drives. In: ICML, pp. 202–209. Citeseer (2001) Hamerly, G., Elkan, C., et al.: Bayesian approaches to failure prediction for disk drives. In: ICML, pp. 202–209. Citeseer (2001)
7.
Zurück zum Zitat Kliazovich, D., Bouvry, P., Khan, S.U.: Dens: data center energy-efficient network-aware scheduling. Cluster Comput. 16(1), 65–75 (2013)CrossRef Kliazovich, D., Bouvry, P., Khan, S.U.: Dens: data center energy-efficient network-aware scheduling. Cluster Comput. 16(1), 65–75 (2013)CrossRef
9.
Zurück zum Zitat Mao, S., Chen, Y., Liu, F., Chen, X., Xu, B., Lu, P., Patwari, M., Xi, H., Chang, C., Miller, B., et al.: Commercial TMR heads for hard disk drives: characterization and extendibility at 300 gbit/in 2. IEEE Trans. Magn. 42(2), 97–102 (2006)CrossRef Mao, S., Chen, Y., Liu, F., Chen, X., Xu, B., Lu, P., Patwari, M., Xi, H., Chang, C., Miller, B., et al.: Commercial TMR heads for hard disk drives: characterization and extendibility at 300 gbit/in 2. IEEE Trans. Magn. 42(2), 97–102 (2006)CrossRef
12.
Zurück zum Zitat Meng, X., Pappas, V., Zhang, L.: Improving the scalability of data center networks with traffic-aware virtual machine placement. In: 2010 Proceedings of the IEEE INFOCOM, pp. 1–9. IEEE (2010) Meng, X., Pappas, V., Zhang, L.: Improving the scalability of data center networks with traffic-aware virtual machine placement. In: 2010 Proceedings of the IEEE INFOCOM, pp. 1–9. IEEE (2010)
13.
Zurück zum Zitat Merten, A.G.: Some quantitative techniques for file organization (1970) Merten, A.G.: Some quantitative techniques for file organization (1970)
14.
Zurück zum Zitat Murray, J.F., Hughes, G.F., Kreutz-Delgado, K.: Hard drive failure prediction using non-parametric statistical methods. In: Proceedings of ICANN/ICONIP. Citeseer (2003) Murray, J.F., Hughes, G.F., Kreutz-Delgado, K.: Hard drive failure prediction using non-parametric statistical methods. In: Proceedings of ICANN/ICONIP. Citeseer (2003)
17.
Zurück zum Zitat Rabinovici-Cohen, S., Marberg, J., Nagin, K., Pease, D.: PDS cloud: Long term digital preservation in the cloud. In: 2013 IEEE International Conference on Cloud Engineering (IC2E), pp. 38–45, March 2013 Rabinovici-Cohen, S., Marberg, J., Nagin, K., Pease, D.: PDS cloud: Long term digital preservation in the cloud. In: 2013 IEEE International Conference on Cloud Engineering (IC2E), pp. 38–45, March 2013
18.
Zurück zum Zitat Schroeder, B., Gibson, G.A.: Disk failures in the real world: What does an MTTF of 1, 000, 000 hours mean to you? In: FAST, vol. 7, pp. 1–16 (2007) Schroeder, B., Gibson, G.A.: Disk failures in the real world: What does an MTTF of 1, 000, 000 hours mean to you? In: FAST, vol. 7, pp. 1–16 (2007)
19.
Zurück zum Zitat Schwarz, T., Baker, M., Bassi, S., Baumgart, B., Flagg, W., van Ingen, C., Joste, K., Manasse, M., Shah, M.: Disk failure investigations at the internet archive. Work-in-Progess Session, NASA/IEEE Conference on Mass Storage Systems and Technologies (MSST 2006) (2006) Schwarz, T., Baker, M., Bassi, S., Baumgart, B., Flagg, W., van Ingen, C., Joste, K., Manasse, M., Shah, M.: Disk failure investigations at the internet archive. Work-in-Progess Session, NASA/IEEE Conference on Mass Storage Systems and Technologies (MSST 2006) (2006)
20.
Zurück zum Zitat Seaman, P.H., Lind, R.A., Wilson, T.L.: On teleprocessing system design: part iv an analysis of auxiliary-storage activity. IBM Syst. J. 5(3), 158–170 (1966)CrossRef Seaman, P.H., Lind, R.A., Wilson, T.L.: On teleprocessing system design: part iv an analysis of auxiliary-storage activity. IBM Syst. J. 5(3), 158–170 (1966)CrossRef
21.
Zurück zum Zitat Stage, A., Setzer, T.: Network-aware migration control and scheduling of differentiated virtual machine workloads. In: Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing, pp. 9–14. IEEE Computer Society (2009) Stage, A., Setzer, T.: Network-aware migration control and scheduling of differentiated virtual machine workloads. In: Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing, pp. 9–14. IEEE Computer Society (2009)
22.
Zurück zum Zitat Tang, Q., Gupta, S.K.S., Varsamopoulos, G.: Energy-efficient thermal-aware task scheduling for homogeneous high-performance computing data centers: A cyber-physical approach. IEEE Trans. Parallel Distrib. Syst. 19(11), 1458–1472 (2008)CrossRef Tang, Q., Gupta, S.K.S., Varsamopoulos, G.: Energy-efficient thermal-aware task scheduling for homogeneous high-performance computing data centers: A cyber-physical approach. IEEE Trans. Parallel Distrib. Syst. 19(11), 1458–1472 (2008)CrossRef
Metadaten
Titel
Persistence Management in Digital Document Repository
verfasst von
Piotr Pałka
Tomasz Śliwiński
Tomasz Traczyk
Włodzimierz Ogryczak
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-34099-9_52

Premium Partner