Online Learning-1 | bandit

bandit凸优化,之后有空再写。

references

CSE599s: Online Learning

T. Lattimore and C. Szepesvári, Bandit Algorithms. Cambridge: Cambridge University Press, 2020.

Elad Hazan, Introduction to Online Convex Optimization , now, 2016, doi: 10.1561/2400000013.

posted @ 2025-07-10 20:22  Theophania  阅读(4)  评论(0)    收藏  举报