Skip to main content
Log in

Structural results for partially observed control models

  • Published:
Zeitschrift für Operations Research Aims and scope Submit manuscript

Abstract

A general partially observed control model with discrete time parameter is investigated. Our main interest concerns monotonicity results and bounds for the value functions and for optimal policies. In particular, we show how the value functions depend on the observation kernels and we present conditions for a lower bound of an optimal policy. Our approach is based on two multivariate stochastic orderings: theTP 2 ordering and the Blackwell ordering.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Albright SC (1979) Structural Results for Partially Observable Markovian Systems. Operat Res 27:1041–1053

    Google Scholar 

  • Bertsekas DP (1987) Dynamic Programming: Deterministic and Stochastic Models. Prentice Hall, Englewood Cliffs

    Google Scholar 

  • DeGroot MH (1970) Optimal Statistical Decisions. McGraw-Hill, New York

    Google Scholar 

  • Hinderer K (1970) Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter. Springer, Berlin

    Google Scholar 

  • Karlin S, Rinott Y (1980) Classes of Orderings of Measures and Related Correlation Inequalities. I. Multivariate Totally Positive Distributions. J Multivariate Analysis 10:467–498

    Google Scholar 

  • Lovejoy WS (1987) Some Monotonicity Results for Partially Observed Markov Decision Processes. Operat Res 35:736–743

    Google Scholar 

  • Monahan GE (1982) A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms. Management Sci 28:1–16

    Google Scholar 

  • Nakai T (1985) The Problem of Optimal Stopping in a Partially Observable Markov Chain. J Optim Theory Appl 45:425–442

    Google Scholar 

  • Nakai T (1986) A Sequential Stochastic Assignment Problem in a Partially Observable Markov Chain. Math Oper Res 11:230–240

    Google Scholar 

  • Ohnishi M, Kawai H, Mine H (1986) An Optimal Inspection and Replacement Policy under Incomplete State Information. Eur J Oper Res 27:117–128

    Google Scholar 

  • Rieder U (1988) Bayessche Kontrollmodelle. Skript, Universität Ulm

  • Rieder U, Wagner H (1991) Structured Policies in the Sequential Design of Experiments. Ann Oper Res 32:165–188

    Google Scholar 

  • White CC (1976) Application of Two Inequality Results for Concave Functions to a Stochastic Optimization Problem. J Math Anal Appl 55:347–350

    Google Scholar 

  • White CC (1979) Optimal Control-limit Strategies for a Partially Observed Replacement Problem. Internat J System Sci 10:321–331

    Google Scholar 

  • White CC, Harrington D (1980) Application of Jensen's Inequality for Adaptive Suboptimal Design. J Optim Theory Appl 32:89–100

    Google Scholar 

  • Whitt W (1982) Multivariate Monotone Likelihood Ratio and Uniform Conditional Stochastic Order. J Appl Prob 19:695–701

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

Dedicated to Prof. Dr. K. Hinderer on the occassion of his 60th birthday

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rieder, U. Structural results for partially observed control models. ZOR - Methods and Models of Operations Research 35, 473–490 (1991). https://doi.org/10.1007/BF01415990

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01415990

Key words

Navigation