Markov decision processes with a stopping time constraint

Horiguchi, Masayuki

doi:10.1007/PL00003996

Markov decision processes with a stopping time constraint

Published: June 2001

Volume 53, pages 279–295, (2001)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Masayuki Horiguchi¹

185 Accesses
15 Citations
Explore all metrics

Abstract.

In this paper, the optimization problem for a stopped Markov decision process with finite states and actions is considered over stopping times τ constrained so that ?τ≦α for some fixed α>0. The problem is solved through randomization of stopping times and mathematical programming formulation by occupation measures. Another representation, called F-representation, of randomized stopping times is given, by which the concept of Markov or stationary randomized stopping times is introduced. We treat two types of occupation measures, running and stopped, but stopped occupation measure is shown to be expressed by running one. We study the properties of the set of running occupation measures achieved by different classes of pairs of policies and randomized stopping times. Analyzing the equivalent mathematical programming problem formulated by running occupation measures corresponding with stationary policies and stationary randomized stopping times, we prove the existence of an optimal constrained pair of stationary policy and stopping time requiring randomization in at most one state.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Author information

Authors and Affiliations

Division of Mathematical Sciences and Physics, Graduate School of Science and Technology, Chiba University, 33 Yayoi-cho 1-chome, Inage-ku, Chiba 263-8522, Japan (e-mail: horiguti@math.e.chiba-u.ac.jp), , , , , , JP
Masayuki Horiguchi

Authors

Masayuki Horiguchi
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Manuscript received: September 2000/Final version received: November 2000

Rights and permissions

Reprints and permissions

About this article

Cite this article

Horiguchi, M. Markov decision processes with a stopping time constraint. Mathematical Methods of OR 53, 279–295 (2001). https://doi.org/10.1007/PL00003996

Download citation

Issue Date: June 2001
DOI: https://doi.org/10.1007/PL00003996

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Markov decision processes with a stopping time constraint

Abstract.

Access this article

Similar content being viewed by others

Optimal Stopping Time on Semi-Markov Processes with Finite Horizon

Control-limit policies for a class of stopping time problems with termination restrictions

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Markov decision processes with a stopping time constraint

Abstract.

Access this article

Similar content being viewed by others

Optimal Stopping Time on Semi-Markov Processes with Finite Horizon

Control-limit policies for a class of stopping time problems with termination restrictions

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation