Abstract:
For the semi-Markov decision models we study the stationary strategies which for some final reward are optimal over any time interval. These strategies are applied to the maximization of the average per unit time reward in the case of infinite horizon.