Abstract
A general partially observed control model with discrete time parameter is investigated. Our main interest concerns monotonicity results and bounds for the value functions and for optimal policies. In particular, we show how the value functions depend on the observation kernels and we present conditions for a lower bound of an optimal policy. Our approach is based on two multivariate stochastic orderings: theTP 2 ordering and the Blackwell ordering.
Similar content being viewed by others
References
Albright SC (1979) Structural Results for Partially Observable Markovian Systems. Operat Res 27:1041–1053
Bertsekas DP (1987) Dynamic Programming: Deterministic and Stochastic Models. Prentice Hall, Englewood Cliffs
DeGroot MH (1970) Optimal Statistical Decisions. McGraw-Hill, New York
Hinderer K (1970) Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter. Springer, Berlin
Karlin S, Rinott Y (1980) Classes of Orderings of Measures and Related Correlation Inequalities. I. Multivariate Totally Positive Distributions. J Multivariate Analysis 10:467–498
Lovejoy WS (1987) Some Monotonicity Results for Partially Observed Markov Decision Processes. Operat Res 35:736–743
Monahan GE (1982) A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms. Management Sci 28:1–16
Nakai T (1985) The Problem of Optimal Stopping in a Partially Observable Markov Chain. J Optim Theory Appl 45:425–442
Nakai T (1986) A Sequential Stochastic Assignment Problem in a Partially Observable Markov Chain. Math Oper Res 11:230–240
Ohnishi M, Kawai H, Mine H (1986) An Optimal Inspection and Replacement Policy under Incomplete State Information. Eur J Oper Res 27:117–128
Rieder U (1988) Bayessche Kontrollmodelle. Skript, Universität Ulm
Rieder U, Wagner H (1991) Structured Policies in the Sequential Design of Experiments. Ann Oper Res 32:165–188
White CC (1976) Application of Two Inequality Results for Concave Functions to a Stochastic Optimization Problem. J Math Anal Appl 55:347–350
White CC (1979) Optimal Control-limit Strategies for a Partially Observed Replacement Problem. Internat J System Sci 10:321–331
White CC, Harrington D (1980) Application of Jensen's Inequality for Adaptive Suboptimal Design. J Optim Theory Appl 32:89–100
Whitt W (1982) Multivariate Monotone Likelihood Ratio and Uniform Conditional Stochastic Order. J Appl Prob 19:695–701
Author information
Authors and Affiliations
Additional information
Dedicated to Prof. Dr. K. Hinderer on the occassion of his 60th birthday
Rights and permissions
About this article
Cite this article
Rieder, U. Structural results for partially observed control models. ZOR - Methods and Models of Operations Research 35, 473–490 (1991). https://doi.org/10.1007/BF01415990
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF01415990