nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

2. Multiobjective Control

verfasst von : Julio B. Clempner, Alexander Poznyak

Erschienen in: Optimization and Games for Controllable Markov Chains

Verlag: Springer Nature Switzerland

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

A multi-objective Pareto front solution is presented in this chapter for a particular type of discrete-time ergodic controllable Markov chains. We offer a technique that, given specific boundaries, chooses the best multi-objective option for the Pareto frontier as a decision support system. We only consider a class of finite, ergodic, and controllable Markov chains while addressing this issue. The regularized penalty method utilizes a projection-gradient strategy to identify the Pareto policies along the Pareto frontier and is based on Tikhonov’s regularization method. The goal is to make the parameters as efficient as possible while still maintaining the original form of the functional. After setting the initial value, we gradually reduce it until each policy closely resembles the Pareto policy. In this sense, we specify the precise direction of the parameter tendencies toward zero and establish the convergence of the gradient regularized penalty algorithm. The matching picture in the objective space receives a Pareto frontier of only Pareto policies thanks to our policy-gradient multi-objective algorithms, which, on the other hand, use a gradient-based strategy. In order to improve security when transporting cash and valuables, we empirically validate the technique by providing a numerical example of a genuine alternative solution to the vehicle routing planning problem. In addition, we describe a portfolio optimization and represent the Pareto frontier. The decision-making techniques investigated in this paper are consistent with the most widely used computational intelligent models in the Artificial Intelligence research field.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Controllable Markov Chains

Nächstes Kapitel Partially Observable Markov Chains

By far most of the methods for the computation of single Pareto points or the entire Pareto set are based on a “scalarization” of the MOP (see e.g. [9, 10, 27, 29]).

Beltrami, E., Katehakis, M., Durinovic, S.: Multiobjective markov decisions in urban modelling. Math. Model. 6(4), 333–338 (1995)CrossRefMATH

Benson, H.P., et al.: Matthias ehrgott, multicriteria optimization. Springer (2005) ISBN 3-540-21398-8. 323 p. Eur. J. Oper. Res. 176(3), 1961–1964 (2007)

Clempner, J.B.: Necessary and sufficient karush-kuhn-tucker conditions for multiobjective markov chains optimality. Automatica 71, 135–142 (2016)MathSciNetCrossRefMATH

Clempner, J.B.: Computing multiobjective markov chains handled by the extraproximal method. Ann. Oper. Res. 271, 469–486 (2018)MathSciNetCrossRefMATH

Clempner, J.B.: A team formation method based on a markov chains games approach. Cybern. Syst. 50(5), 417–443 (2019)CrossRef

Clempner, J.B., Poznyak, A.S.: Using the manhattan distance for computing the multiobjective markov chains problem. Int. J. Comput. Math. 95(11), 2269–2286 (2017)MathSciNetCrossRefMATH

Clempner, J.B., Poznyak, A.S.: A tikhonov regularization parameter approach for solving lagrange constrained optimization problems. Eng. Optim. 50(11), 1996–2012 (2018)MathSciNetCrossRefMATH

Clempner, J.B., Poznyak, A.S.: A tikhonov regularized penalty function approach for solving polylinear programming problems. J. Comput. Appl. Math. 328, 267–286 (2018)MathSciNetCrossRefMATH

Das, I., Dennis, J.E.: Normal-boundary intersection: an alternate approach for generating pareto-optimal points in multicriteria optimization problems. SIAM J. Optim. 8, 631–657 (1998)MathSciNetCrossRefMATH

10.

Fliege, J., Heseler, A.: Constructing approximations to the efficient set of convex quadratic multi-objective problems. University of Dortmund, Germany, Technical report (2003)

11.

Garcia, C.B., Zangwill, W.I.: Pathways to Solutions, Fixed Points and Equilibria. Prentice-Hall, Englewood Cliffs (1981)

12.

Garcia-Galicia, M., Carsteanu, A.A., Clempner, J.: Continuous-time learning method for customer portfolio with time penalization. Expert Syst. Appl. 129, 27–36 (2019)CrossRef

13.

Garcia-Galicia, M., Carsteanu, A.A., Clempner, J.: Continuous-time mean variance portfolio with transaction costs: a proximal approach involving time penalization. Int. J. Gen Syst 48(2), 91–111 (2019)MathSciNetCrossRef

14.

Germeyer, Y.: Introduction to the Theory of Operations Research. Nauka, Moscow (1971)

15.

Kiefer, J., Wolfowitz, J.: Stochastic estimation of the maximum of a regression function. Ann. Math. Stat. 23(2), 462–466 (1952)

16.

Markowitz, H.: Portfolio selection. J. Finance 7, 77–98 (1952)

17.

Markowitz, H.: The optimization of a quadratic function subject to linear constraints. Nav. Res. Logist. Q. 3, 111–133 (1956)MathSciNetCrossRef

18.

Markowitz, H.M.: Mean-variance analysis. In: Finance, pp. 194–198. Springer (1989)

19.

Miettinen, K.: Nonlinear multiobjective optimization, vol. 12. Springer Science & Business Media (2012)

20.

Novák, J.: Linear programming in tector criterion markov and semi-markov decision processes. Optim. 20(5), 651–670 (1989)CrossRefMATH

21.

Ortiz-Cerezo, L., Carsteanu, A., Clempner, J.B.: Optimal constrained portfolio analysis for incomplete information and transaction costs. Econ. Comput. Econ. Cybern. Stud. Res. 4(56), 107–121 (2022)

22.

Ortiz-Cerezo, L., Carsteanu, A., Clempner, J.B.: Sharpe-ratio portfolio in controllable markov chains: analytic and algorithmic approach for second order cone programming. Mathematics 10(18), 3221 (2022)CrossRef

23.

Poznyak, A.S.: Advanced Mathematical Tools for Automatic Control Engineers. Deterministic Technique, vol. 1. Elsevier, Amsterdam, Oxford (2008)

24.

Poznyak, A.S., Najim, K., Gómez-Ramírez, E.: Self-learning Control of Finite Markov Chains. Marcel Dekker, Inc. (2000)

25.

Sánchez, E.M., Clempner, J.B., Poznyak, A.S.: A priori-knowledge/actor-critic reinforcement learning architecture for computing the mean-variance customer portfolio: the case of bank marketing campaigns. Eng. Appl. Artif. Intell. 46, Part A, 82–92 (2015)

26.

Sánchez, E.M., Clempner, J.B., Poznyak, A.S.: Solving the mean-variance customer portfolio in markov chains using iterated quadratic/lagrange programming: a credit-card customer-credit limits approach. Expert Syst. Appl. 42(12), 5315–5327 (2015)CrossRef

27.

Schittkowski, K.: Easy-opt: an interactive optimization system with automatic differentiation - user’s guide. Department of Mathematics, University of Bayreuth, Technical report (1999)

28.

Sen, C.: A new approach for multi-objective rural development planning. Indian Econ. J. 30(4), 91–96 (1983)

29.

Steuer, R.E.: The Tchebycheff procedure of interactive multiple objective programming. In: Multiple Criteria Decision Making and Risk Analysis Using Microcomputers, pp. 235–249. Springer, Berlin (1989)

30.

Tikhonov, A., Goncharsky, A., Stepanov, V., Yagola, A.G.: Numerical Methods for the Solution of Ill-Posed Problems. Kluwer Academic Publishers (1995)

31.

Tikhonov, A.N., Arsenin, V.Y.: Solution of Ill-posed Problems. Winston & Sons, Washington (1977)MATH

32.

Vazquez, E., Clempner, J.B.: Customer portfolio model driven by continuous-time markov chains: an l2 lagrangian regularization method. Econ. Comput. Econ. Cybern. Stud. Res. 2, 23–40 (2020)

33.

Wang, Y.M.: On lexicographic goal programming method for generating weights from inconsistent interval comparison matrices. Appl. Math. Comput. 173(2), 985–991 (2006)MathSciNetMATH

34.

Wierzbicki, A.P.: A mathematical basis for satisficing decision making. Math. Model. 3(5), 391–405 (1982)MathSciNetCrossRefMATH

35.

Zangwill, W.I.: Nonlinear Programming: A Unified Approach. Prentice-Halt, Englewood Cliffs (1969)MATH

36.

Zhang, R., Golovin, D.: Random hypervolume scalarizations for provable multi-objective black box optimization. In: International Conference on Machine Learning, pp. 11096–11105. PMLR (2020)

Titel: Multiobjective Control
verfasst von: Julio B. Clempner
Alexander Poznyak
Verlag: Springer Nature Switzerland
Buch: Optimization and Games for Controllable Markov Chains
Print ISBN: 978-3-031-43574-4

Electronic ISBN: 978-3-031-43575-1

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-3-031-43575-1_2

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht