Skip to main content
Erschienen in: EURO Journal on Transportation and Logistics 3/2012

01.09.2012 | Tutorial

Approximate dynamic programming in transportation and logistics: a unified framework

verfasst von: Warren B. Powell, Hugo P. Simao, Belgacem Bouzaiene-Ayari

Erschienen in: EURO Journal on Transportation and Logistics | Ausgabe 3/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deterministic optimization has enjoyed a rich place in transportation and logistics, where it represents a mature field with established modeling and algorithmic strategies. By contrast, sequential stochastic optimization models (dynamic programs) have been plagued by the lack of a common modeling framework, and by algorithmic strategies that just do not seem to scale to real-world problems in transportation. This paper is designed as a tutorial of the modeling and algorithmic framework of approximate dynamic programming; however, our perspective on approximate dynamic programming is relatively new, and the approach is new to the transportation research community. We present a simple yet precise modeling framework that makes it possible to integrate most algorithmic strategies into four fundamental classes of policies, the design of which represents approximate solutions to these dynamic programs. The paper then uses problems in transportation and logistics to indicate settings in which each of the four classes of policies represents a natural solution strategy, highlighting the fact that the design of effective policies for these complex problems will remain an exciting area of research for many years. Along the way, we provide a link between dynamic programming, stochastic programming and stochastic search.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Barto AG, Bradtke SJ, Singh SP (1995) Learning to act using real-time dynamic programming. Artificial Intelligence 72(1–2):81–138CrossRef Barto AG, Bradtke SJ, Singh SP (1995) Learning to act using real-time dynamic programming. Artificial Intelligence 72(1–2):81–138CrossRef
Zurück zum Zitat Bellman RE (1957) Dynamic programming. Princeton University Press, Princeton Bellman RE (1957) Dynamic programming. Princeton University Press, Princeton
Zurück zum Zitat Bent RW, Van Hentenryck P (2004) Scenario-based planning for partially dynamic vehicle routing with stochastic customers. Oper Res 52:977–987CrossRef Bent RW, Van Hentenryck P (2004) Scenario-based planning for partially dynamic vehicle routing with stochastic customers. Oper Res 52:977–987CrossRef
Zurück zum Zitat Bertsekas DP, Castanon DA (1999) Rollout algorithms for stochastic scheduling problems. J Heuristics 5:89–108CrossRef Bertsekas DP, Castanon DA (1999) Rollout algorithms for stochastic scheduling problems. J Heuristics 5:89–108CrossRef
Zurück zum Zitat Bertsekas DP, Tsitsiklis JN (1996) Neuro-dynamic programming. Athena Scientific, Belmont Bertsekas DP, Tsitsiklis JN (1996) Neuro-dynamic programming. Athena Scientific, Belmont
Zurück zum Zitat Birge JR, Louveaux F (1997) Introduction to stochastic programming. Springer, New York Birge JR, Louveaux F (1997) Introduction to stochastic programming. Springer, New York
Zurück zum Zitat Braysy O, Gendreau M (2005) Vehicle routing problem with time windows, Part I: route construction and local search algorithms. Transp Sci 39(1):104–118CrossRef Braysy O, Gendreau M (2005) Vehicle routing problem with time windows, Part I: route construction and local search algorithms. Transp Sci 39(1):104–118CrossRef
Zurück zum Zitat Crainic T, Gendreau M (1993) Dynamic and stochastic models for the allocation of empty containers. Oper Res 41(1):102–126CrossRef Crainic T, Gendreau M (1993) Dynamic and stochastic models for the allocation of empty containers. Oper Res 41(1):102–126CrossRef
Zurück zum Zitat Crainic TG, Laporte G (1997) Planning models for freight transportation. Eur J Oper Res 97:409–438 Crainic TG, Laporte G (1997) Planning models for freight transportation. Eur J Oper Res 97:409–438
Zurück zum Zitat Dantzig GB (1951) Application of the simplex method to a transportation problem. In: Koopmans T (ed) Activity analysis of production and allocation. Wiley, New York, pp 359–373 Dantzig GB (1951) Application of the simplex method to a transportation problem. In: Koopmans T (ed) Activity analysis of production and allocation. Wiley, New York, pp 359–373
Zurück zum Zitat Dantzig GB, Ferguson A (1956) The allocation of aircrafts to routes: an example of linear programming under uncertain demand. Manag Sci 3:45–73CrossRef Dantzig GB, Ferguson A (1956) The allocation of aircrafts to routes: an example of linear programming under uncertain demand. Manag Sci 3:45–73CrossRef
Zurück zum Zitat Ferguson AR, Dantzig GB (1955) The problem of routing aircraft: a mathematical solution. Aeronaut Eng Rev 14:51–55 Ferguson AR, Dantzig GB (1955) The problem of routing aircraft: a mathematical solution. Aeronaut Eng Rev 14:51–55
Zurück zum Zitat Gendreau M, Guertin F, Potvin JY, Taillard E (1999) Parallel tabu search for real-time vehicle routing and dispatching. Transp Sci 33:381–190CrossRef Gendreau M, Guertin F, Potvin JY, Taillard E (1999) Parallel tabu search for real-time vehicle routing and dispatching. Transp Sci 33:381–190CrossRef
Zurück zum Zitat Gendreau M, Laporte G, Seguin R (1995) An exact algorithm for the vehicle routing problem with stochastic demands and customers. Transp Sci 29:143–155CrossRef Gendreau M, Laporte G, Seguin R (1995) An exact algorithm for the vehicle routing problem with stochastic demands and customers. Transp Sci 29:143–155CrossRef
Zurück zum Zitat George A, Powell WB, Kulkarni S (2008) Value function approximation using multiple aggregation for multiattribute resource management. J Mach Learn Res 9:2079–2111 George A, Powell WB, Kulkarni S (2008) Value function approximation using multiple aggregation for multiattribute resource management. J Mach Learn Res 9:2079–2111
Zurück zum Zitat Godfrey G, Powell WB (2002) An adaptive dynamic programming algorithm for dynamic fleet management. II: Multiperiod travel times. Transp Sci 36(1): 40–54 Godfrey G, Powell WB (2002) An adaptive dynamic programming algorithm for dynamic fleet management. II: Multiperiod travel times. Transp Sci 36(1): 40–54
Zurück zum Zitat Gorman M, Crook K, Sellers D (2011) North American freight rail industry real-time optimized equipment distribution systems: state of the practice. Transp Res Part C 19(1):103–114CrossRef Gorman M, Crook K, Sellers D (2011) North American freight rail industry real-time optimized equipment distribution systems: state of the practice. Transp Res Part C 19(1):103–114CrossRef
Zurück zum Zitat Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, New York Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, New York
Zurück zum Zitat Haykin S (1999) Neural networks: a comprehensive foundation. Prentice Hall, USA Haykin S (1999) Neural networks: a comprehensive foundation. Prentice Hall, USA
Zurück zum Zitat Higle J, Sen S (1991) Stochastic decomposition: an algorithm for two-stage linear programs with recourse. Math Oper Res 16(3):650–669CrossRef Higle J, Sen S (1991) Stochastic decomposition: an algorithm for two-stage linear programs with recourse. Math Oper Res 16(3):650–669CrossRef
Zurück zum Zitat Higle J, Sen S (1996) Stochastic decomposition: a statistical method for large scale stochastic linear programming. Kluwer Academic Publishers, New York Higle J, Sen S (1996) Stochastic decomposition: a statistical method for large scale stochastic linear programming. Kluwer Academic Publishers, New York
Zurück zum Zitat Hvattum LM, Løkketangen A, Laporte G (2006) Solving a dynamic and stochastic vehicle routing problem with a sample scenario hedging heuristic. Transp Sci 40(4):421–438CrossRef Hvattum LM, Løkketangen A, Laporte G (2006) Solving a dynamic and stochastic vehicle routing problem with a sample scenario hedging heuristic. Transp Sci 40(4):421–438CrossRef
Zurück zum Zitat Ichoua S, Gendreau M, Potvin J-Y (2006) Exploiting knowledge about future demands for real-time vehicle dispatching. Transp Sci 40:211–225CrossRef Ichoua S, Gendreau M, Potvin J-Y (2006) Exploiting knowledge about future demands for real-time vehicle dispatching. Transp Sci 40:211–225CrossRef
Zurück zum Zitat Joborn M, Crainic TG, Gendreau M, Holmberg K, Lundgren JT (2004) Economies of scale in empty freight car distribution in scheduled railways. Transp Sci 38:121–134CrossRef Joborn M, Crainic TG, Gendreau M, Holmberg K, Lundgren JT (2004) Economies of scale in empty freight car distribution in scheduled railways. Transp Sci 38:121–134CrossRef
Zurück zum Zitat Jordan WC, Turnquist MA (1983) A stochastic dynamic network model for railroad car distribution. Transp Sci 17:123–145CrossRef Jordan WC, Turnquist MA (1983) A stochastic dynamic network model for railroad car distribution. Transp Sci 17:123–145CrossRef
Zurück zum Zitat Kall P, Wallace S (1994) Stochastic programming. Wiley, New York Kall P, Wallace S (1994) Stochastic programming. Wiley, New York
Zurück zum Zitat Lam S-w, Lee L-h, Tang L-c (2007) An approximate dynamic programming approach for the empty container allocation problem. Transp Res 15:265–277CrossRef Lam S-w, Lee L-h, Tang L-c (2007) An approximate dynamic programming approach for the empty container allocation problem. Transp Res 15:265–277CrossRef
Zurück zum Zitat Laporte G, Hamme LV, Louveaux FV (2002) An integer L-shaped algorithm for the capacitated vehicle routing problem with stochastic demands. Oper Res 50:415–423CrossRef Laporte G, Hamme LV, Louveaux FV (2002) An integer L-shaped algorithm for the capacitated vehicle routing problem with stochastic demands. Oper Res 50:415–423CrossRef
Zurück zum Zitat Larsen A (2000) The dynamic vehicle routing problem. PhD thesis, Technical University of Denmark. Larsen A (2000) The dynamic vehicle routing problem. PhD thesis, Technical University of Denmark.
Zurück zum Zitat Le Bouthillier A, Crainic TG (2005) A cooperative parallel meta-heuristic for the vehicle routing problem with time windows. Comp Oper Res 32:1685–1708CrossRef Le Bouthillier A, Crainic TG (2005) A cooperative parallel meta-heuristic for the vehicle routing problem with time windows. Comp Oper Res 32:1685–1708CrossRef
Zurück zum Zitat List GF, Wood B, Nozick LK, Turnquist MA, Jones DA, Kjeldgaard EA, Lawton CR (2003) Robust optimization for fleet planning under uncertainty. Transp Res 39:209–227CrossRef List GF, Wood B, Nozick LK, Turnquist MA, Jones DA, Kjeldgaard EA, Lawton CR (2003) Robust optimization for fleet planning under uncertainty. Transp Res 39:209–227CrossRef
Zurück zum Zitat Mercier L, Van Hentenryck P (2011) Amsaa: a multistep anticipatory algorithm for online stochastic combinatorial optimization. Ann Oper Res 184:233–271CrossRef Mercier L, Van Hentenryck P (2011) Amsaa: a multistep anticipatory algorithm for online stochastic combinatorial optimization. Ann Oper Res 184:233–271CrossRef
Zurück zum Zitat Mes MRK, Powell WB, Frazier PI (2011) Hierarchical knowledge gradient for sequential sampling. J Mach Learn Res 12:2931–2974 Mes MRK, Powell WB, Frazier PI (2011) Hierarchical knowledge gradient for sequential sampling. J Mach Learn Res 12:2931–2974
Zurück zum Zitat Pearl J (1984) Heuristics: intelligent search strategies for computer problem solving. Addison-Wesley, Boston Pearl J (1984) Heuristics: intelligent search strategies for computer problem solving. Addison-Wesley, Boston
Zurück zum Zitat Pillac V, Gendreau M, Guéret C, Medaglia A et al (2011) A review of dynamic vehicle routing problems. Working paper, CIRRELT, University of Montreal Pillac V, Gendreau M, Guéret C, Medaglia A et al (2011) A review of dynamic vehicle routing problems. Working paper, CIRRELT, University of Montreal
Zurück zum Zitat Potvin J, Xu Y, Benyahia I (2006) Vehicle routing and scheduling with dynamic travel times. Comp Oper Res 33(4):1129–1137CrossRef Potvin J, Xu Y, Benyahia I (2006) Vehicle routing and scheduling with dynamic travel times. Comp Oper Res 33(4):1129–1137CrossRef
Zurück zum Zitat Powell WB (2011) Approximate dynamic programming: solving the curses of dimensionality, 2nd edn. Wiley, HobokenCrossRef Powell WB (2011) Approximate dynamic programming: solving the curses of dimensionality, 2nd edn. Wiley, HobokenCrossRef
Zurück zum Zitat Powell WB, Godfrey G (2001) An adaptive, distribution-free approximation for the newsvendor problem with censored demands, with applications to inventory and distribution problems. Manag Sci 47(8):1101–1112CrossRef Powell WB, Godfrey G (2001) An adaptive, distribution-free approximation for the newsvendor problem with censored demands, with applications to inventory and distribution problems. Manag Sci 47(8):1101–1112CrossRef
Zurück zum Zitat Powell WB, Simao HP (2009) Approximate dynamic programming for management of high value spare parts. J Manuf Technol Manag 20(2):147–160CrossRef Powell WB, Simao HP (2009) Approximate dynamic programming for management of high value spare parts. J Manuf Technol Manag 20(2):147–160CrossRef
Zurück zum Zitat Powell WB, Topaloglu H (2005) Fleet management. In: Wallace S, Ziemba W (eds) Applications of stochastic programming. Math Programming Society: SIAM Series in Optimization, Philadelphia, pp 185–216CrossRef Powell WB, Topaloglu H (2005) Fleet management. In: Wallace S, Ziemba W (eds) Applications of stochastic programming. Math Programming Society: SIAM Series in Optimization, Philadelphia, pp 185–216CrossRef
Zurück zum Zitat Powell WB, Topaloglu H (2006) Dynamic-programming approximations for stochastic time-staged integer multicommodity-flow problems. Informs J Comput 18(1):31CrossRef Powell WB, Topaloglu H (2006) Dynamic-programming approximations for stochastic time-staged integer multicommodity-flow problems. Informs J Comput 18(1):31CrossRef
Zurück zum Zitat Powell WB, Bouzaiene-Ayari B, Cheng C, Fiorillo R, Das S, Lawrence C (2012a) Strategic, tactical and real-time planning of locomotives at Nortfolk Southern using approximate dynamic programming. In: ASME Joint Rail Conference, ASME, Philadelphia Powell WB, Bouzaiene-Ayari B, Cheng C, Fiorillo R, Das S, Lawrence C (2012a) Strategic, tactical and real-time planning of locomotives at Nortfolk Southern using approximate dynamic programming. In: ASME Joint Rail Conference, ASME, Philadelphia
Zurück zum Zitat Powell WB, Bouzaiene-Ayari B, Lawrence C, Cheng C, Das S, Fiorillo R (2012b) JRC2012-74187. In: ASME Joint Rail Conference, Philadelphia, pp 1–10 Powell WB, Bouzaiene-Ayari B, Lawrence C, Cheng C, Das S, Fiorillo R (2012b) JRC2012-74187. In: ASME Joint Rail Conference, Philadelphia, pp 1–10
Zurück zum Zitat Powell WB, George A, Lamont A, Stewart J (2011) SMART: a stochastic multiscale model for the analysis of energy resources, technology and policy. Informs J Comput Powell WB, George A, Lamont A, Stewart J (2011) SMART: a stochastic multiscale model for the analysis of energy resources, technology and policy. Informs J Comput
Zurück zum Zitat Powell WB, Jaillet P, Odoni A (1995) Stochastic and dynamic networks and routing. In: Ball MO, Magnanti TL, Monma CL, Nemhauser GL (eds) Network routing: handbooks in operations research and management science, vol 8. North-Holland, Amsterdam, The Netherlands, pp 141–295 Powell WB, Jaillet P, Odoni A (1995) Stochastic and dynamic networks and routing. In: Ball MO, Magnanti TL, Monma CL, Nemhauser GL (eds) Network routing: handbooks in operations research and management science, vol 8. North-Holland, Amsterdam, The Netherlands, pp 141–295
Zurück zum Zitat Powell WB, Ruszczynski A, Topaloglu H (2004) Learning algorithms for separable approximations of discrete stochastic optimization problems. Math Oper Res 29(4):814–836CrossRef Powell WB, Ruszczynski A, Topaloglu H (2004) Learning algorithms for separable approximations of discrete stochastic optimization problems. Math Oper Res 29(4):814–836CrossRef
Zurück zum Zitat Powell WB, Simao HP,Shapiro JA (2001) A representational paradigm for dynamic resource transformation problems. in R. In: Coullard FC, Owens JH (eds) Annals of operations research, J. C. Baltzer AG, The Netherlands, pp 231–279 Powell WB, Simao HP,Shapiro JA (2001) A representational paradigm for dynamic resource transformation problems. in R. In: Coullard FC, Owens JH (eds) Annals of operations research, J. C. Baltzer AG, The Netherlands, pp 231–279
Zurück zum Zitat Puterman ML (2005) Markov decision processes, 2nd edn. Wiley, Hoboken Puterman ML (2005) Markov decision processes, 2nd edn. Wiley, Hoboken
Zurück zum Zitat Rockafellar RT, Wets R (1991) Scenarios and policy aggregation in optimization under uncertainty. Math Oper Res 16(1):119–147CrossRef Rockafellar RT, Wets R (1991) Scenarios and policy aggregation in optimization under uncertainty. Math Oper Res 16(1):119–147CrossRef
Zurück zum Zitat Romisch W, Heitsch H (2009) Scenario tree modeling for multistage stochastic programs. Math Program 118:371–406CrossRef Romisch W, Heitsch H (2009) Scenario tree modeling for multistage stochastic programs. Math Program 118:371–406CrossRef
Zurück zum Zitat Schilde M, Doerner KF, Hartl RF (2011) Metaheuristics for the dynamic stochastic dial-a-ride problem with expected return transports. Comp Oper Res 38(12):1719–1730CrossRef Schilde M, Doerner KF, Hartl RF (2011) Metaheuristics for the dynamic stochastic dial-a-ride problem with expected return transports. Comp Oper Res 38(12):1719–1730CrossRef
Zurück zum Zitat Shapiro A, Dentcheva D, Ruszczynski A (2009) Lectures on stochastic programming: modeling and theory. SIAM, PhiladelphiaCrossRef Shapiro A, Dentcheva D, Ruszczynski A (2009) Lectures on stochastic programming: modeling and theory. SIAM, PhiladelphiaCrossRef
Zurück zum Zitat Simao HP, Day J, George A, Gifford T, Powell WB, Nienow J (2009) An approximate dynamic programming algorithm for large-scale fleet management: a case application. Transp Sci 43(2):178–197CrossRef Simao HP, Day J, George A, Gifford T, Powell WB, Nienow J (2009) An approximate dynamic programming algorithm for large-scale fleet management: a case application. Transp Sci 43(2):178–197CrossRef
Zurück zum Zitat Simao HP, George A, Powell WB, Gifford T, Nienow J, Day J (2010) Approximate dynamic programming captures fleet operations for schneider national. Interfaces 40(5):342–352CrossRef Simao HP, George A, Powell WB, Gifford T, Nienow J, Day J (2010) Approximate dynamic programming captures fleet operations for schneider national. Interfaces 40(5):342–352CrossRef
Zurück zum Zitat Spall JC (2003) Introduction to stochastic search and optimization: estimation, smulation and control. Wiley, HobokenCrossRef Spall JC (2003) Introduction to stochastic search and optimization: estimation, smulation and control. Wiley, HobokenCrossRef
Zurück zum Zitat Taillard E, Badeau P, Gendreau M, Guertin F, Potfin J-Y (1997) A Tabu search heuristic for the vehicle routing problem with soft time windows. Transp Sci 31(2):170–186CrossRef Taillard E, Badeau P, Gendreau M, Guertin F, Potfin J-Y (1997) A Tabu search heuristic for the vehicle routing problem with soft time windows. Transp Sci 31(2):170–186CrossRef
Zurück zum Zitat Topaloglu H, Powell WB (2006) Dynamic programming approximations for stochastic. Time-staged integer multicommodity flow problems. Informs J Comput 18:31–42CrossRef Topaloglu H, Powell WB (2006) Dynamic programming approximations for stochastic. Time-staged integer multicommodity flow problems. Informs J Comput 18:31–42CrossRef
Zurück zum Zitat Toth P, Vigo D (eds) (2002) The vehicle routing problem, vol 9, SIAM Monographs on Discrete Mathematics and Applications Toth P, Vigo D (eds) (2002) The vehicle routing problem, vol 9, SIAM Monographs on Discrete Mathematics and Applications
Zurück zum Zitat Tsitsiklis JN (1994) Asynchronous stochastic approximation and Q-learning. Mach Learn 16:185–202 Tsitsiklis JN (1994) Asynchronous stochastic approximation and Q-learning. Mach Learn 16:185–202
Zurück zum Zitat Tsitsiklis JN, Roy B (1996) Feature-based methods for large scale dynamic programming. Mach Learn 22(1):59–94 Tsitsiklis JN, Roy B (1996) Feature-based methods for large scale dynamic programming. Mach Learn 22(1):59–94
Zurück zum Zitat Van Hentenryck P, Bent RW (2009) Onlinestochastic combinatorial optimization. MIT Press, Cambridge Van Hentenryck P, Bent RW (2009) Onlinestochastic combinatorial optimization. MIT Press, Cambridge
Zurück zum Zitat Van Hentenryck P, Bent RW, Upfal E (2009) Online stochastic optimization under time constraints. Ann Oper Res 177(1):151–183CrossRef Van Hentenryck P, Bent RW, Upfal E (2009) Online stochastic optimization under time constraints. Ann Oper Res 177(1):151–183CrossRef
Metadaten
Titel
Approximate dynamic programming in transportation and logistics: a unified framework
verfasst von
Warren B. Powell
Hugo P. Simao
Belgacem Bouzaiene-Ayari
Publikationsdatum
01.09.2012
Verlag
Springer-Verlag
Erschienen in
EURO Journal on Transportation and Logistics / Ausgabe 3/2012
Print ISSN: 2192-4376
Elektronische ISSN: 2192-4384
DOI
https://doi.org/10.1007/s13676-012-0015-8

Premium Partner