Skip to main content
Erschienen in: Journal of Intelligent Manufacturing 1/2019

18.06.2016

Optimal preventive maintenance policy based on reinforcement learning of a fleet of military trucks

verfasst von: Stephane R. A. Barde, Soumaya Yacout, Hayong Shin

Erschienen in: Journal of Intelligent Manufacturing | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we model preventive maintenance strategies for equipment composed of multi-non-identical components which have different time-to-failure probability distribution, by using a Markov decision process (MDP). The originality of this paper resides in the fact that a Monte Carlo reinforcement learning (MCRL) approach is used to find the optimal policy for each different strategy. The approach is applied to an already existing published application which deals with a fleet of military trucks. The fleet consists of a group of similar trucks that are composed of non-identical components. The problem is formulated as a MDP and solved by a MCRL technique. The advantage of this modeling technique when compared to the published one is that there is no need to estimate the main parameters of the model, for example the estimation of the transition probabilities. These parameters are treated as variables and they are found by the modeling technique, while searching for the optimal solution. Moreover, the technique is not bounded by any explicit mathematical formula, and it converges to the optimal solution whereas the previous model optimizes the replacement policy of each component separately, which leads to a local optimization. The results show that by using the reinforcement learning approach, we are able of getting a 36.44 % better solution that is less downtime.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abdel Haleem, B., & Yacout, S. (1998). Simulation of components replacement policies for a fleet of military trucks. Quality Engineering, 11(2), 303–308.CrossRef Abdel Haleem, B., & Yacout, S. (1998). Simulation of components replacement policies for a fleet of military trucks. Quality Engineering, 11(2), 303–308.CrossRef
Zurück zum Zitat Das, T. K., & Sarkar, S. (1999). Optimal preventive maintenance in a production inventory system. IIE Transactions, 31(6), 537–551. Das, T. K., & Sarkar, S. (1999). Optimal preventive maintenance in a production inventory system. IIE Transactions, 31(6), 537–551.
Zurück zum Zitat Gelly, S., Kocsis, L., Schoenauer, M., Sebag, M., Silver, D., Szepesvári, C., et al. (2012). The grand challenge of computer Go: Monte Carlo tree search and extensions. Communications of the ACM, 55(3), 106–113.CrossRef Gelly, S., Kocsis, L., Schoenauer, M., Sebag, M., Silver, D., Szepesvári, C., et al. (2012). The grand challenge of computer Go: Monte Carlo tree search and extensions. Communications of the ACM, 55(3), 106–113.CrossRef
Zurück zum Zitat Gosavi, A. (2004). Reinforcement learning for long-run average cost. European Journal of Operational Research, 155(3), 654–674.CrossRef Gosavi, A. (2004). Reinforcement learning for long-run average cost. European Journal of Operational Research, 155(3), 654–674.CrossRef
Zurück zum Zitat Jardine, A. K., & Tsang, A. H. (2013). Maintenance, replacement, and reliability: Theory and applications. Boca Raton: CRC Press.CrossRef Jardine, A. K., & Tsang, A. H. (2013). Maintenance, replacement, and reliability: Theory and applications. Boca Raton: CRC Press.CrossRef
Zurück zum Zitat Jia, Q.-S. (2010). A structural property of optimal policies for multi-component maintenance problems. IEEE Transactions on Automation Science and Engineering, 7(3), 677–680.CrossRef Jia, Q.-S. (2010). A structural property of optimal policies for multi-component maintenance problems. IEEE Transactions on Automation Science and Engineering, 7(3), 677–680.CrossRef
Zurück zum Zitat Powell, W. B. (2007). Approximate dynamic programming: Solving the curses of dimensionality (Vol. 703). New York: Wiley.CrossRef Powell, W. B. (2007). Approximate dynamic programming: Solving the curses of dimensionality (Vol. 703). New York: Wiley.CrossRef
Zurück zum Zitat Steven, B. (2001). J. D. Campbell, A. K. Jardine, & W. M. Dekker (Eds.), Maintenance excellence, optimizing equipment life-cycle decisions, pp. 43–44. Steven, B. (2001). J. D. Campbell, A. K. Jardine, & W. M. Dekker (Eds.), Maintenance excellence, optimizing equipment life-cycle decisions, pp. 43–44.
Zurück zum Zitat Sutton, R. S., & Andrew, G. B. (1998). Reinforcement learning: An introduction (Vol. 1, No. 1). Cambridge: MIT press. Sutton, R. S., & Andrew, G. B. (1998). Reinforcement learning: An introduction (Vol. 1, No. 1). Cambridge: MIT press.
Zurück zum Zitat Szepesvári, C. (2010). Algorithms for reinforcement learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 4(1), 1–103.CrossRef Szepesvári, C. (2010). Algorithms for reinforcement learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 4(1), 1–103.CrossRef
Zurück zum Zitat Tsitsiklis, J. N. (2003). On the convergence of optimistic policy iteration. The Journal of Machine Learning Research, 3, 59–72. Tsitsiklis, J. N. (2003). On the convergence of optimistic policy iteration. The Journal of Machine Learning Research, 3, 59–72.
Zurück zum Zitat Tuncel, E., Zeid, A., & Kamarthi, S. (2014). Solving large scale disassembly line balancing problem with uncertainty using reinforcement learning. Journal of Intelligent Manufacturing, 25(4), 647–659. Tuncel, E., Zeid, A., & Kamarthi, S. (2014). Solving large scale disassembly line balancing problem with uncertainty using reinforcement learning. Journal of Intelligent Manufacturing, 25(4), 647–659.
Zurück zum Zitat Wang, X., Wang, H., & Qi, C. (2014). Multi-agent reinforcement learning based maintenance policy for a resource constrained flow line system. Journal of Intelligent Manufacturing, 27(2), 325–333. Wang, X., Wang, H., & Qi, C. (2014). Multi-agent reinforcement learning based maintenance policy for a resource constrained flow line system. Journal of Intelligent Manufacturing, 27(2), 325–333.
Zurück zum Zitat Wang, J. W., Wang, H., Ip, W. H., Furuta, K., & Zhang, W. J. (2013). Predatory search strategy based on swarm intelligence for continuous optimization problems. Mathematical Problems in Engineering. 11 pp. doi:10.1155/2013/749256 Wang, J. W., Wang, H., Ip, W. H., Furuta, K., & Zhang, W. J. (2013). Predatory search strategy based on swarm intelligence for continuous optimization problems. Mathematical Problems in Engineering. 11 pp. doi:10.​1155/​2013/​749256
Zurück zum Zitat Zhang, W. J., & Van Luttervelt, C. A. (2011). Toward a resilient manufacturing system. CIRP Annals-Manufacturing Technology, 60(1), 469–472.CrossRef Zhang, W. J., & Van Luttervelt, C. A. (2011). Toward a resilient manufacturing system. CIRP Annals-Manufacturing Technology, 60(1), 469–472.CrossRef
Metadaten
Titel
Optimal preventive maintenance policy based on reinforcement learning of a fleet of military trucks
verfasst von
Stephane R. A. Barde
Soumaya Yacout
Hayong Shin
Publikationsdatum
18.06.2016
Verlag
Springer US
Erschienen in
Journal of Intelligent Manufacturing / Ausgabe 1/2019
Print ISSN: 0956-5515
Elektronische ISSN: 1572-8145
DOI
https://doi.org/10.1007/s10845-016-1237-7

Weitere Artikel der Ausgabe 1/2019

Journal of Intelligent Manufacturing 1/2019 Zur Ausgabe

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.