Skip to main content
Log in

Optimal control of Markovian jump processes with partial information and applications to a parallel queueing model

  • Original Article
  • Published:
Mathematical Methods of Operations Research Aims and scope Submit manuscript

Abstract

We consider a stochastic control problem over an infinite horizon where the state process is influenced by an unobservable environment process. In particular, the Hidden-Markov-model and the Bayesian model are included. This model under partial information is transformed into an equivalent one with complete information by using the well-known filter technique. In particular, the optimal controls and the value functions of the original and the transformed problem are the same. An explicit representation of the filter process which is a piecewise-deterministic process, is also given. Then we propose two solution techniques for the transformed model. First, a generalized verification technique (with a generalized Hamilton–Jacobi–Bellman equation) is formulated where the strict differentiability of the value function is weaken to local Lipschitz continuity. Second, we present a discrete-time Markovian decision model by which we are able to compute an optimal control of our given problem. In this context we are also able to state a general existence result for optimal controls. The power of both solution techniques is finally demonstrated for a parallel queueing model with unknown service rates. In particular, the filter process is discussed in detail, the value function is explicitly computed and the optimal control is completely characterized in the symmetric case.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Altman E, Jimenez T, Nunez-Queija R, Yechiali U (2003) Optimal routing among ./M/1 queues with partial information. INRIA Research Report No. 4985

  • Asmussen S (2003) Applied probabilities and queues. Springer, Berlin

    Google Scholar 

  • Bäuerle N, Rieder U (2007) Portfolio optimization with jumps and unobservable intensity process. Math Finance 17(2): 205–224

    Article  MATH  MathSciNet  Google Scholar 

  • Bensoussan A, Cakanyildirim M, Sethi S (2003) Partially observed inventory systems. In: Proceedings of the 44th IEEE conference on decision and control, pp 1023–1028

  • Bertsekas D, Shreve S (1978) Stochastic optimal control: the discrete time case. Academic Press, Dublin

    MATH  Google Scholar 

  • Brémaud P (1981) Point processes and queues. Springer, Berlin

    MATH  Google Scholar 

  • Clarke F (1983) Optimization and nonsmooth analysis. Wiley, New York

    MATH  Google Scholar 

  • Davis D (1993) Markov models and optimization. Chapman & Hall, London

    MATH  Google Scholar 

  • Donchev D (1998) On the two-armed bandit problem with non-observed Poissonian switching of arms. Math Methods Oper Res 47: 401–422

    Article  MATH  MathSciNet  Google Scholar 

  • Donchev D (1999) Exact solution of the Bellman equation for a β-discounted reward in a two-armed bandit with switching arms. J Appl Math Stoch Anal 12(2): 151–160

    Article  MATH  MathSciNet  Google Scholar 

  • Elliott R, Aggoun R, Moore J (1997) Hidden Markov models: estimation and control. Springer, Berlin

    Google Scholar 

  • Honhon D, Seshadri S (2007) Admission control with incomplete information to a finite buffer queue. Probab Eng Inform Sci 21(1): 19–46

    MATH  MathSciNet  Google Scholar 

  • Lin K, Ross S (2003) Admission control with incomplete information of a queueing system. INFORMS Oper Res 51: 645–654

    Article  MATH  MathSciNet  Google Scholar 

  • Liptser R, Shiryayev A (2004) Statistics of random processes. Springer, Berlin

    Google Scholar 

  • Rogers L, Williams D (2003) Diffusions, Markov processes and martingales. Cambridge University Press, Cambridge

    Google Scholar 

  • Winter J (2008) Optimal control of Markovian jump processes with different information structures. PhD-Thesis, Universität Ulm

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ulrich Rieder.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rieder, U., Winter, J. Optimal control of Markovian jump processes with partial information and applications to a parallel queueing model. Math Meth Oper Res 70, 567–596 (2009). https://doi.org/10.1007/s00186-009-0284-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00186-009-0284-7

Keywords

Navigation