TY - GEN
T1 - How can ignorant but patient cognitive terminals learn their strategy and utility?
AU - Perlaza, S. M.
AU - Tembine, H.
AU - Lasaulce, S.
PY - 2010
Y1 - 2010
N2 - This paper aims to contribute to bridge the gap between existing theoretical results in distributed radio resource allocation policies based on equilibria in games (assuming complete information and rational players) and practical design of signal processing algorithms for self-configuring wireless networks. For this purpose, the framework of learning theory m games is exploited. Here, a new learning algorithm based on mild information assumptions at the transmitters is presented. This algorithm possesses attractive convergence properties not available for standard reinforcement learning algorithms and in addition, it allows each transmitter to learn both its optimal strategy and the values of its expected utility for all its actions. A detailed convergence analysis is conducted. In particular, a framework for studying heterogeneous wireless networks where transmitters do not learn at the same rate is provided. The proposed algorithm, which can be applied to any wireless network verifying the information assumptions stated, is applied to the case of multiple access channels in order to provide some numerical results.
AB - This paper aims to contribute to bridge the gap between existing theoretical results in distributed radio resource allocation policies based on equilibria in games (assuming complete information and rational players) and practical design of signal processing algorithms for self-configuring wireless networks. For this purpose, the framework of learning theory m games is exploited. Here, a new learning algorithm based on mild information assumptions at the transmitters is presented. This algorithm possesses attractive convergence properties not available for standard reinforcement learning algorithms and in addition, it allows each transmitter to learn both its optimal strategy and the values of its expected utility for all its actions. A detailed convergence analysis is conducted. In particular, a framework for studying heterogeneous wireless networks where transmitters do not learn at the same rate is provided. The proposed algorithm, which can be applied to any wireless network verifying the information assumptions stated, is applied to the case of multiple access channels in order to provide some numerical results.
UR - http://www.scopus.com/inward/record.url?scp=78751491532&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78751491532&partnerID=8YFLogxK
U2 - 10.1109/SPAWC.2010.5670983
DO - 10.1109/SPAWC.2010.5670983
M3 - Conference contribution
AN - SCOPUS:78751491532
SN - 9781424469901
T3 - IEEE Workshop on Signal Processing Advances in Wireless Communications, SPAWC
BT - 2010 IEEE 11th International Workshop on Signal Processing Advances in Wireless Communications, SPAWC 2010
T2 - 2010 IEEE 11th International Workshop on Signal Processing Advances in Wireless Communications, SPAWC 2010
Y2 - 20 June 2010 through 23 June 2010
ER -