\(\def\ < -{ \leftarrow }\) \(\def\- > { \rightarrow}\) \(\def\ > ={ \geq }\) \(\def\ < ={ \leq }\)

Intoduction

I have read the instructions on your website

I am very interested in your research on sample efficiency, complexity, and accuracy for RL algorithms under intersting conditions (like linearity of value function). I am also open and flexible to researching variety of theoretical RL topics which I may not be aware. In particular, I enjoyed reading through the paper "A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation". It was exciting to see how iterative searching of linear model parameters based around upper bounds of Hoeffding and Eluder dimension, could produce an algorithm with the properties: iteration complexity is small, requires fewer expert queries, and is accurate if consistent. I appreciated the clear and detailed proofs and hope to follow in similar footsteps.

Paper

RESULTS

Concentration Inequalities

Optimism

Iteration Complexity

Other INFO