Minimax rate-optimal estimation of KL divergence between discrete distributions

Yanjun Han, Jiantao Jiao, Tsachy Weissman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We refine the general methodology in [1] for the construction and analysis of essentially minimax estimators for a wide class of functionals of finite dimensional parameters, and elaborate on the case of discrete distributions with support size S comparable with the number of observations n. Specifically, we determine the 'smooth' and 'non-smooth' regimes based on the confidence set and the smoothness of the functional. In the 'non-smooth' regime, we apply an unbiased estimator for a 'suitable' polynomial approximation of the functional. In the 'smooth' regime, we construct a bias corrected version of the Maximum Likelihood Estimator (MLE) based on Taylor expansion. We apply the general methodology to the problem of estimating the KL divergence between two discrete distributions from empirical data. We construct a minimax rate-optimal estimator which is adaptive in the sense that it does not require the knowledge of the support size nor the upper bound on the likelihood ratio. Moreover, the performance of the optimal estimator with n samples is essentially that of the MLE with n ln n samples, i.e., the effective sample size enlargement phenomenon holds.

Original languageEnglish (US)
Title of host publicationProceedings of 2016 International Symposium on Information Theory and Its Applications, ISITA 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages256-260
Number of pages5
ISBN (Electronic)9784885523090
StatePublished - Feb 2 2017
Event3rd International Symposium on Information Theory and Its Applications, ISITA 2016 - Monterey, United States
Duration: Oct 30 2016Nov 2 2016

Publication series

NameProceedings of 2016 International Symposium on Information Theory and Its Applications, ISITA 2016

Conference

Conference3rd International Symposium on Information Theory and Its Applications, ISITA 2016
Country/TerritoryUnited States
CityMonterey
Period10/30/1611/2/16

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems
  • Signal Processing
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Minimax rate-optimal estimation of KL divergence between discrete distributions'. Together they form a unique fingerprint.

Cite this