Online Learning-1 | bandit
bandit凸优化,之后有空再写。
references
T. Lattimore and C. Szepesvári, Bandit Algorithms. Cambridge: Cambridge University Press, 2020.
Elad Hazan, Introduction to Online Convex Optimization , now, 2016, doi: 10.1561/2400000013.
bandit凸优化,之后有空再写。
T. Lattimore and C. Szepesvári, Bandit Algorithms. Cambridge: Cambridge University Press, 2020.
Elad Hazan, Introduction to Online Convex Optimization , now, 2016, doi: 10.1561/2400000013.