Institute of Statistical Science Academia Sinica

Seminars

Seminars Seminars

Multi-Armed Bandit with Covariates

2013-05-27 (Mon.), 10:00 AM
Recreation Hall, 2F, Institute of Statistical Science
Prof. Yuhong Yang
School of Statistics, Univ. of Minnesota, USA

Abstract

Multi-armed bandit problem is an important optimization game that requires an exploration-exploitation tradeoff to achieve optimal total reward. Motivated from industrial applications such as online advertising and clinical trial adaptive design, we consider a setting where the rewards of bandit machines are associated with covariates, and the accurate estimation of the corresponding mean reward functions plays an important role in the performance of the allocation rules. We establish strong consistency of nonparametric methods and derive their rates of convergence. In addition, model selection and combination results are presented as well. The work is joint with Wei Qian.

Update：2025-07-16 06:15

Back