Abstract
The point of view is adopted that there are many natural situations in which the controller is interested in finding a policy that will achieve a sufficiently high long-run average reward, that is, a target level with a sufficiently high probability, that is, a percentile. The key conceptual difference between this approach and the classical problem is that the present controller is not searching for an optimal policy but rather for a policy that is 'good enough,' knowing that such a policy will typically fail to exist if the target level and the percentile are set too high. Conceptually, the approach is somewhat analogous to that often adopted by statisticians in testing of hypotheses where it is desirable (but usually not possible) to minimize simultaneously the so-called 'type 1' and the 'type 2' errors.
Original language | English (US) |
---|---|
Pages (from-to) | 1273-1276 |
Number of pages | 4 |
Journal | Proceedings of the IEEE Conference on Decision and Control |
Volume | 2 |
State | Published - 1989 |
Event | Proceedings of the 28th IEEE Conference on Decision and Control. Part 2 (of 3) - Tampa, FL, USA Duration: Dec 13 1989 → Dec 15 1989 |
ASJC Scopus subject areas
- Control and Systems Engineering
- Modeling and Simulation
- Control and Optimization