TY - GEN
T1 - Maximum Likelihood Estimation of information measures
AU - Jiao, Jiantao
AU - Venkat, Kartik
AU - Han, Yanjun
AU - Weissman, Tsachy
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/9/28
Y1 - 2015/9/28
N2 - The Maximum Likelihood Estimator (MLE) is widely used in estimating information measures, and involves 'plugging-in' the empirical distribution of the data to estimate a given functional of the unknown distribution. In this work we propose a general framework and procedure to analyze the nonasymptotic performance of the MLE in estimating functionals of discrete distributions, under the worst-case mean squared error criterion. We show that existing theory is insufficient for analyzing the bias of the MLE, and propose to apply the theory of approximation using positive linear operators to study this bias. The variance is controlled using the well-known tools from the literature on concentration inequalities. Our techniques completely characterize the maximum L2 risk incurred by the MLE in estimating the Shannon entropy H(P) = σi=1S -piln pi, and Fα(P) = σi=1Spiα up to a multiplicative constant. As a corollary, for Shannon entropy estimation, we show that it is necessary and sufficient to have n ≪ S observations for the MLE to be consistent, where S represents the support size. In addition, we obtain that it is necessary and sufficient to consider n ≪ S1/α samples for the MLE to consistently estimate Fα(P); 0 <α < 1. The minimax rate-optimal estimators for both problems require S/ln S and S1/α / ln S samples, which implies that the MLE is strictly sub-optimal. When 1 < α < 3/2, we show that the maximum L2 rate of convergence for the MLE is n-2(α-1) for infinite support size, while the minimax L2 rate is (n ln n)-2(α-1). When α ≥ 3/2, the MLE achieves the minimax optimal L2 convergence rate n-1 regardless of the support size.
AB - The Maximum Likelihood Estimator (MLE) is widely used in estimating information measures, and involves 'plugging-in' the empirical distribution of the data to estimate a given functional of the unknown distribution. In this work we propose a general framework and procedure to analyze the nonasymptotic performance of the MLE in estimating functionals of discrete distributions, under the worst-case mean squared error criterion. We show that existing theory is insufficient for analyzing the bias of the MLE, and propose to apply the theory of approximation using positive linear operators to study this bias. The variance is controlled using the well-known tools from the literature on concentration inequalities. Our techniques completely characterize the maximum L2 risk incurred by the MLE in estimating the Shannon entropy H(P) = σi=1S -piln pi, and Fα(P) = σi=1Spiα up to a multiplicative constant. As a corollary, for Shannon entropy estimation, we show that it is necessary and sufficient to have n ≪ S observations for the MLE to be consistent, where S represents the support size. In addition, we obtain that it is necessary and sufficient to consider n ≪ S1/α samples for the MLE to consistently estimate Fα(P); 0 <α < 1. The minimax rate-optimal estimators for both problems require S/ln S and S1/α / ln S samples, which implies that the MLE is strictly sub-optimal. When 1 < α < 3/2, we show that the maximum L2 rate of convergence for the MLE is n-2(α-1) for infinite support size, while the minimax L2 rate is (n ln n)-2(α-1). When α ≥ 3/2, the MLE achieves the minimax optimal L2 convergence rate n-1 regardless of the support size.
UR - http://www.scopus.com/inward/record.url?scp=84969850826&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84969850826&partnerID=8YFLogxK
U2 - 10.1109/ISIT.2015.7282573
DO - 10.1109/ISIT.2015.7282573
M3 - Conference contribution
AN - SCOPUS:84969850826
T3 - IEEE International Symposium on Information Theory - Proceedings
SP - 839
EP - 843
BT - Proceedings - 2015 IEEE International Symposium on Information Theory, ISIT 2015
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - IEEE International Symposium on Information Theory, ISIT 2015
Y2 - 14 June 2015 through 19 June 2015
ER -