Some theoretical results on Thompson sampling for Multi-armed Bandits

Apr 19 (Wednesday) at 2pm GHC-8102

Speaker: Kirthevasan Kandasamy

Abstract: I will discuss some recent theoretical results on Thompson sampling for Multi-armed Bandits. The discussion will be based on the following line of work,

  1. http://www.jmlr.org/papers/volume17/14-087/14-087.pdf
  2. http://djrusso.github.io/docs/Learning_to_Optimize.pdf
  3. https://arxiv.org/abs/1403.5556