中央研究院統計科學研究所

演講公告

演講公告演講公告

:::

Multi-Armed Bandit with Covariates

2013-05-27 (Mon.), 10:00 AM
中研院-統計所 2F 交誼廳
茶會：上午9：40統計所二樓交誼廳
Prof. Yuhong Yang（楊宇泓　教授）
School of Statistics, Univ. of Minnesota, USA

Abstract

Multi-armed bandit problem is an important optimization game that requires an exploration-exploitation tradeoff to achieve optimal total reward. Motivated from industrial applications such as online advertising and clinical trial adaptive design, we consider a setting where the rewards of bandit machines are associated with covariates, and the accurate estimation of the corresponding mean reward functions plays an important role in the performance of the allocation rules. We establish strong consistency of nonparametric methods and derive their rates of convergence. In addition, model selection and combination results are presented as well. The work is joint with Wei Qian.

最後更新日期：2026-06-27 12:16

回列表頁