TY - JOUR
T1 - Assouad, Fano, and Le Cam with Interaction
T2 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024
AU - Chen, Fan
AU - Foster, Dylan J.
AU - Han, Yanjun
AU - Qian, Jian
AU - Rakhlin, Alexander
AU - Xu, Yunbei
N1 - Publisher Copyright:
© 2024 Neural information processing systems foundation. All rights reserved.
PY - 2024
Y1 - 2024
N2 - We develop a unifying framework for information-theoretic lower bound in statistical estimation and interactive decision making. Classical lower bound techniques-such as Fano's method, Le Cam's method, and Assouad's lemma-are central to the study of minimax risk in statistical estimation, yet are insufficient to provide tight lower bounds for interactive decision making algorithms that collect data interactively (e.g., algorithms for bandits and reinforcement learning). Recent work of Foster et al. [40, 42] provides minimax lower bounds for interactive decision making using seemingly different analysis techniques from the classical methods. These results-which are proven using a complexity measure known as the Decision-Estimation Coefficient (DEC)-capture difficulties unique to interactive learning, yet do not recover the tightest known lower bounds for passive estimation. We propose a unified view of these distinct methodologies through a new lower bound approach called interactive Fano method. As an application, we introduce a novel complexity measure, the Fractional Covering Number, which facilitates the new lower bounds for interactive decision making that extend the DEC methodology by incorporating the complexity of estimation. Using the fractional covering number, we (i) provide a unified characterization of learnability for any stochastic bandit problem, (ii) close the remaining gap between the upper and lower bounds in Foster et al. [40, 42] (up to polynomial factors) for any interactive decision making problem in which the underlying model class is convex.
AB - We develop a unifying framework for information-theoretic lower bound in statistical estimation and interactive decision making. Classical lower bound techniques-such as Fano's method, Le Cam's method, and Assouad's lemma-are central to the study of minimax risk in statistical estimation, yet are insufficient to provide tight lower bounds for interactive decision making algorithms that collect data interactively (e.g., algorithms for bandits and reinforcement learning). Recent work of Foster et al. [40, 42] provides minimax lower bounds for interactive decision making using seemingly different analysis techniques from the classical methods. These results-which are proven using a complexity measure known as the Decision-Estimation Coefficient (DEC)-capture difficulties unique to interactive learning, yet do not recover the tightest known lower bounds for passive estimation. We propose a unified view of these distinct methodologies through a new lower bound approach called interactive Fano method. As an application, we introduce a novel complexity measure, the Fractional Covering Number, which facilitates the new lower bounds for interactive decision making that extend the DEC methodology by incorporating the complexity of estimation. Using the fractional covering number, we (i) provide a unified characterization of learnability for any stochastic bandit problem, (ii) close the remaining gap between the upper and lower bounds in Foster et al. [40, 42] (up to polynomial factors) for any interactive decision making problem in which the underlying model class is convex.
UR - http://www.scopus.com/inward/record.url?scp=105000504148&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105000504148&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:105000504148
SN - 1049-5258
VL - 37
JO - Advances in Neural Information Processing Systems
JF - Advances in Neural Information Processing Systems
Y2 - 9 December 2024 through 15 December 2024
ER -