diff --git a/rl/extra_reading.txt b/rl/extra_reading.txt index 64dd9812..fac79d64 100644 --- a/rl/extra_reading.txt +++ b/rl/extra_reading.txt @@ -1,6 +1,9 @@ Finite-time Analysis of the Multiarmed Bandit Problem https://homes.di.unimi.it/cesa-bianchi/Pubblicazioni/ml-02.pdf +A Nice Lecture for Students Who Claim "RL Doesn't Use Math" +https://www.youtube.com/watch?v=dhEF5pfYmvc + Hacking Google reCAPTCHA v3 using Reinforcement Learning https://arxiv.org/pdf/1903.01003.pdf