UNIFORMIZATION FOR SEMI-MARKOV DECISION PROCESSES UNDER STATIONARY POLICIES.

Frederick J. Beutler, Keith Ross

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Summary form only given. Uniformization permits the replacement of a semi-Markov decision process (SMDP) by a Markov chain exhibiting the same average rewards for simple (nonrandomized) policies. However, uniformization can be accepted as valid only for simple policies. Uniformization is generalized to yield consistent results for stationary policies also. These results are applied to constrained optimization of SMDP, in which stationary (randomized) policies appear naturally.

    Original languageEnglish (US)
    Title of host publicationUnknown Host Publication Title
    PublisherIEEE
    Pages86
    Number of pages1
    StatePublished - 1986

    ASJC Scopus subject areas

    • General Engineering

    Fingerprint

    Dive into the research topics of 'UNIFORMIZATION FOR SEMI-MARKOV DECISION PROCESSES UNDER STATIONARY POLICIES.'. Together they form a unique fingerprint.

    Cite this