Skip to main content

2024 | OriginalPaper | Buchkapitel

2. Multiobjective Control

verfasst von : Julio B. Clempner, Alexander Poznyak

Erschienen in: Optimization and Games for Controllable Markov Chains

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A multi-objective Pareto front solution is presented in this chapter for a particular type of discrete-time ergodic controllable Markov chains. We offer a technique that, given specific boundaries, chooses the best multi-objective option for the Pareto frontier as a decision support system. We only consider a class of finite, ergodic, and controllable Markov chains while addressing this issue. The regularized penalty method utilizes a projection-gradient strategy to identify the Pareto policies along the Pareto frontier and is based on Tikhonov’s regularization method. The goal is to make the parameters as efficient as possible while still maintaining the original form of the functional. After setting the initial value, we gradually reduce it until each policy closely resembles the Pareto policy. In this sense, we specify the precise direction of the parameter tendencies toward zero and establish the convergence of the gradient regularized penalty algorithm. The matching picture in the objective space receives a Pareto frontier of only Pareto policies thanks to our policy-gradient multi-objective algorithms, which, on the other hand, use a gradient-based strategy. In order to improve security when transporting cash and valuables, we empirically validate the technique by providing a numerical example of a genuine alternative solution to the vehicle routing planning problem. In addition, we describe a portfolio optimization and represent the Pareto frontier. The decision-making techniques investigated in this paper are consistent with the most widely used computational intelligent models in the Artificial Intelligence research field.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
By far most of the methods for the computation of single Pareto points or the entire Pareto set are based on a “scalarization” of the MOP (see e.g. [9, 10, 27, 29]).
 
Literatur
1.
Zurück zum Zitat Beltrami, E., Katehakis, M., Durinovic, S.: Multiobjective markov decisions in urban modelling. Math. Model. 6(4), 333–338 (1995)CrossRefMATH Beltrami, E., Katehakis, M., Durinovic, S.: Multiobjective markov decisions in urban modelling. Math. Model. 6(4), 333–338 (1995)CrossRefMATH
2.
Zurück zum Zitat Benson, H.P., et al.: Matthias ehrgott, multicriteria optimization. Springer (2005) ISBN 3-540-21398-8. 323 p. Eur. J. Oper. Res. 176(3), 1961–1964 (2007) Benson, H.P., et al.: Matthias ehrgott, multicriteria optimization. Springer (2005) ISBN 3-540-21398-8. 323 p. Eur. J. Oper. Res. 176(3), 1961–1964 (2007)
3.
Zurück zum Zitat Clempner, J.B.: Necessary and sufficient karush-kuhn-tucker conditions for multiobjective markov chains optimality. Automatica 71, 135–142 (2016)MathSciNetCrossRefMATH Clempner, J.B.: Necessary and sufficient karush-kuhn-tucker conditions for multiobjective markov chains optimality. Automatica 71, 135–142 (2016)MathSciNetCrossRefMATH
4.
Zurück zum Zitat Clempner, J.B.: Computing multiobjective markov chains handled by the extraproximal method. Ann. Oper. Res. 271, 469–486 (2018)MathSciNetCrossRefMATH Clempner, J.B.: Computing multiobjective markov chains handled by the extraproximal method. Ann. Oper. Res. 271, 469–486 (2018)MathSciNetCrossRefMATH
5.
Zurück zum Zitat Clempner, J.B.: A team formation method based on a markov chains games approach. Cybern. Syst. 50(5), 417–443 (2019)CrossRef Clempner, J.B.: A team formation method based on a markov chains games approach. Cybern. Syst. 50(5), 417–443 (2019)CrossRef
6.
Zurück zum Zitat Clempner, J.B., Poznyak, A.S.: Using the manhattan distance for computing the multiobjective markov chains problem. Int. J. Comput. Math. 95(11), 2269–2286 (2017)MathSciNetCrossRefMATH Clempner, J.B., Poznyak, A.S.: Using the manhattan distance for computing the multiobjective markov chains problem. Int. J. Comput. Math. 95(11), 2269–2286 (2017)MathSciNetCrossRefMATH
7.
Zurück zum Zitat Clempner, J.B., Poznyak, A.S.: A tikhonov regularization parameter approach for solving lagrange constrained optimization problems. Eng. Optim. 50(11), 1996–2012 (2018)MathSciNetCrossRefMATH Clempner, J.B., Poznyak, A.S.: A tikhonov regularization parameter approach for solving lagrange constrained optimization problems. Eng. Optim. 50(11), 1996–2012 (2018)MathSciNetCrossRefMATH
8.
Zurück zum Zitat Clempner, J.B., Poznyak, A.S.: A tikhonov regularized penalty function approach for solving polylinear programming problems. J. Comput. Appl. Math. 328, 267–286 (2018)MathSciNetCrossRefMATH Clempner, J.B., Poznyak, A.S.: A tikhonov regularized penalty function approach for solving polylinear programming problems. J. Comput. Appl. Math. 328, 267–286 (2018)MathSciNetCrossRefMATH
9.
Zurück zum Zitat Das, I., Dennis, J.E.: Normal-boundary intersection: an alternate approach for generating pareto-optimal points in multicriteria optimization problems. SIAM J. Optim. 8, 631–657 (1998)MathSciNetCrossRefMATH Das, I., Dennis, J.E.: Normal-boundary intersection: an alternate approach for generating pareto-optimal points in multicriteria optimization problems. SIAM J. Optim. 8, 631–657 (1998)MathSciNetCrossRefMATH
10.
Zurück zum Zitat Fliege, J., Heseler, A.: Constructing approximations to the efficient set of convex quadratic multi-objective problems. University of Dortmund, Germany, Technical report (2003) Fliege, J., Heseler, A.: Constructing approximations to the efficient set of convex quadratic multi-objective problems. University of Dortmund, Germany, Technical report (2003)
11.
Zurück zum Zitat Garcia, C.B., Zangwill, W.I.: Pathways to Solutions, Fixed Points and Equilibria. Prentice-Hall, Englewood Cliffs (1981) Garcia, C.B., Zangwill, W.I.: Pathways to Solutions, Fixed Points and Equilibria. Prentice-Hall, Englewood Cliffs (1981)
12.
Zurück zum Zitat Garcia-Galicia, M., Carsteanu, A.A., Clempner, J.: Continuous-time learning method for customer portfolio with time penalization. Expert Syst. Appl. 129, 27–36 (2019)CrossRef Garcia-Galicia, M., Carsteanu, A.A., Clempner, J.: Continuous-time learning method for customer portfolio with time penalization. Expert Syst. Appl. 129, 27–36 (2019)CrossRef
13.
Zurück zum Zitat Garcia-Galicia, M., Carsteanu, A.A., Clempner, J.: Continuous-time mean variance portfolio with transaction costs: a proximal approach involving time penalization. Int. J. Gen Syst 48(2), 91–111 (2019)MathSciNetCrossRef Garcia-Galicia, M., Carsteanu, A.A., Clempner, J.: Continuous-time mean variance portfolio with transaction costs: a proximal approach involving time penalization. Int. J. Gen Syst 48(2), 91–111 (2019)MathSciNetCrossRef
14.
Zurück zum Zitat Germeyer, Y.: Introduction to the Theory of Operations Research. Nauka, Moscow (1971) Germeyer, Y.: Introduction to the Theory of Operations Research. Nauka, Moscow (1971)
15.
Zurück zum Zitat Kiefer, J., Wolfowitz, J.: Stochastic estimation of the maximum of a regression function. Ann. Math. Stat. 23(2), 462–466 (1952) Kiefer, J., Wolfowitz, J.: Stochastic estimation of the maximum of a regression function. Ann. Math. Stat. 23(2), 462–466 (1952)
16.
Zurück zum Zitat Markowitz, H.: Portfolio selection. J. Finance 7, 77–98 (1952) Markowitz, H.: Portfolio selection. J. Finance 7, 77–98 (1952)
17.
Zurück zum Zitat Markowitz, H.: The optimization of a quadratic function subject to linear constraints. Nav. Res. Logist. Q. 3, 111–133 (1956)MathSciNetCrossRef Markowitz, H.: The optimization of a quadratic function subject to linear constraints. Nav. Res. Logist. Q. 3, 111–133 (1956)MathSciNetCrossRef
18.
Zurück zum Zitat Markowitz, H.M.: Mean-variance analysis. In: Finance, pp. 194–198. Springer (1989) Markowitz, H.M.: Mean-variance analysis. In: Finance, pp. 194–198. Springer (1989)
19.
Zurück zum Zitat Miettinen, K.: Nonlinear multiobjective optimization, vol. 12. Springer Science & Business Media (2012) Miettinen, K.: Nonlinear multiobjective optimization, vol. 12. Springer Science & Business Media (2012)
20.
Zurück zum Zitat Novák, J.: Linear programming in tector criterion markov and semi-markov decision processes. Optim. 20(5), 651–670 (1989)CrossRefMATH Novák, J.: Linear programming in tector criterion markov and semi-markov decision processes. Optim. 20(5), 651–670 (1989)CrossRefMATH
21.
Zurück zum Zitat Ortiz-Cerezo, L., Carsteanu, A., Clempner, J.B.: Optimal constrained portfolio analysis for incomplete information and transaction costs. Econ. Comput. Econ. Cybern. Stud. Res. 4(56), 107–121 (2022) Ortiz-Cerezo, L., Carsteanu, A., Clempner, J.B.: Optimal constrained portfolio analysis for incomplete information and transaction costs. Econ. Comput. Econ. Cybern. Stud. Res. 4(56), 107–121 (2022)
22.
Zurück zum Zitat Ortiz-Cerezo, L., Carsteanu, A., Clempner, J.B.: Sharpe-ratio portfolio in controllable markov chains: analytic and algorithmic approach for second order cone programming. Mathematics 10(18), 3221 (2022)CrossRef Ortiz-Cerezo, L., Carsteanu, A., Clempner, J.B.: Sharpe-ratio portfolio in controllable markov chains: analytic and algorithmic approach for second order cone programming. Mathematics 10(18), 3221 (2022)CrossRef
23.
Zurück zum Zitat Poznyak, A.S.: Advanced Mathematical Tools for Automatic Control Engineers. Deterministic Technique, vol. 1. Elsevier, Amsterdam, Oxford (2008) Poznyak, A.S.: Advanced Mathematical Tools for Automatic Control Engineers. Deterministic Technique, vol. 1. Elsevier, Amsterdam, Oxford (2008)
24.
Zurück zum Zitat Poznyak, A.S., Najim, K., Gómez-Ramírez, E.: Self-learning Control of Finite Markov Chains. Marcel Dekker, Inc. (2000) Poznyak, A.S., Najim, K., Gómez-Ramírez, E.: Self-learning Control of Finite Markov Chains. Marcel Dekker, Inc. (2000)
25.
Zurück zum Zitat Sánchez, E.M., Clempner, J.B., Poznyak, A.S.: A priori-knowledge/actor-critic reinforcement learning architecture for computing the mean-variance customer portfolio: the case of bank marketing campaigns. Eng. Appl. Artif. Intell. 46, Part A, 82–92 (2015) Sánchez, E.M., Clempner, J.B., Poznyak, A.S.: A priori-knowledge/actor-critic reinforcement learning architecture for computing the mean-variance customer portfolio: the case of bank marketing campaigns. Eng. Appl. Artif. Intell. 46, Part A, 82–92 (2015)
26.
Zurück zum Zitat Sánchez, E.M., Clempner, J.B., Poznyak, A.S.: Solving the mean-variance customer portfolio in markov chains using iterated quadratic/lagrange programming: a credit-card customer-credit limits approach. Expert Syst. Appl. 42(12), 5315–5327 (2015)CrossRef Sánchez, E.M., Clempner, J.B., Poznyak, A.S.: Solving the mean-variance customer portfolio in markov chains using iterated quadratic/lagrange programming: a credit-card customer-credit limits approach. Expert Syst. Appl. 42(12), 5315–5327 (2015)CrossRef
27.
Zurück zum Zitat Schittkowski, K.: Easy-opt: an interactive optimization system with automatic differentiation - user’s guide. Department of Mathematics, University of Bayreuth, Technical report (1999) Schittkowski, K.: Easy-opt: an interactive optimization system with automatic differentiation - user’s guide. Department of Mathematics, University of Bayreuth, Technical report (1999)
28.
Zurück zum Zitat Sen, C.: A new approach for multi-objective rural development planning. Indian Econ. J. 30(4), 91–96 (1983) Sen, C.: A new approach for multi-objective rural development planning. Indian Econ. J. 30(4), 91–96 (1983)
29.
Zurück zum Zitat Steuer, R.E.: The Tchebycheff procedure of interactive multiple objective programming. In: Multiple Criteria Decision Making and Risk Analysis Using Microcomputers, pp. 235–249. Springer, Berlin (1989) Steuer, R.E.: The Tchebycheff procedure of interactive multiple objective programming. In: Multiple Criteria Decision Making and Risk Analysis Using Microcomputers, pp. 235–249. Springer, Berlin (1989)
30.
Zurück zum Zitat Tikhonov, A., Goncharsky, A., Stepanov, V., Yagola, A.G.: Numerical Methods for the Solution of Ill-Posed Problems. Kluwer Academic Publishers (1995) Tikhonov, A., Goncharsky, A., Stepanov, V., Yagola, A.G.: Numerical Methods for the Solution of Ill-Posed Problems. Kluwer Academic Publishers (1995)
31.
Zurück zum Zitat Tikhonov, A.N., Arsenin, V.Y.: Solution of Ill-posed Problems. Winston & Sons, Washington (1977)MATH Tikhonov, A.N., Arsenin, V.Y.: Solution of Ill-posed Problems. Winston & Sons, Washington (1977)MATH
32.
Zurück zum Zitat Vazquez, E., Clempner, J.B.: Customer portfolio model driven by continuous-time markov chains: an l2 lagrangian regularization method. Econ. Comput. Econ. Cybern. Stud. Res. 2, 23–40 (2020) Vazquez, E., Clempner, J.B.: Customer portfolio model driven by continuous-time markov chains: an l2 lagrangian regularization method. Econ. Comput. Econ. Cybern. Stud. Res. 2, 23–40 (2020)
33.
Zurück zum Zitat Wang, Y.M.: On lexicographic goal programming method for generating weights from inconsistent interval comparison matrices. Appl. Math. Comput. 173(2), 985–991 (2006)MathSciNetMATH Wang, Y.M.: On lexicographic goal programming method for generating weights from inconsistent interval comparison matrices. Appl. Math. Comput. 173(2), 985–991 (2006)MathSciNetMATH
35.
Zurück zum Zitat Zangwill, W.I.: Nonlinear Programming: A Unified Approach. Prentice-Halt, Englewood Cliffs (1969)MATH Zangwill, W.I.: Nonlinear Programming: A Unified Approach. Prentice-Halt, Englewood Cliffs (1969)MATH
36.
Zurück zum Zitat Zhang, R., Golovin, D.: Random hypervolume scalarizations for provable multi-objective black box optimization. In: International Conference on Machine Learning, pp. 11096–11105. PMLR (2020) Zhang, R., Golovin, D.: Random hypervolume scalarizations for provable multi-objective black box optimization. In: International Conference on Machine Learning, pp. 11096–11105. PMLR (2020)
Metadaten
Titel
Multiobjective Control
verfasst von
Julio B. Clempner
Alexander Poznyak
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-43575-1_2

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.