Odalric-Ambrym Maillard.
PhD thesis, Université de Lille 1, October 2011.
[AFIA PhD Prize 2012]
[Download]
Abstract: |
This thesis studies the following topics in Machine Learning: Bandit theory, Statistical learning and Reinforcement learning. The common underlying thread is the non-asymptotic study of various notions of adaptation: to an environment or an opponent in part I about bandit theory, to the structure of a signal in part II about statistical theory, to the structure of states and rewards or to some state-model of the world in part III about reinforcement learning. |
You can dowload my Ph.D. manuscript from the University website (here).
Bibtex: |
@phdthesis{maillard2011apprentissage, title={APPRENTISSAGE S{\’E}QUENTIEL: Bandits, Statistique et Renforcement.}, author={Maillard, Odalric-Ambrym}, year={2011}, school={Universit{\’e} des Sciences et Technologie de Lille — Lille I} } |