contextual::cmabs | Demo: Basic Synthetic cMAB Policies | HTML | source | R code | |
contextual::cmabsoffline | Demo: Offline cMAB LinUCB evaluation | HTML | source | R code | |
contextual::eckles_kaptein | Demo: MAB Replication Eckles & Kaptein (Bootstrap Thompson Sampling) | HTML | source | R code | |
contextual::epsilongreedy | Demo: Basic Epsilon Greed | HTML | source | R code | |
contextual::introduction | Getting started: running simulations | HTML | source | R code | |
contextual::mabs | Demo: MAB Policies Comparison | HTML | source | R code | |
contextual::ml10m | Demo: MovieLens 10M Dataset | HTML | source | R code | |
contextual::offline_depaul_movies | Demo: Offline cMAB: CarsKit DePaul Movie Dataset | HTML | source | R code | |
contextual::replication | Offline evaluation: Replication of Li et al 2010 | HTML | source | R code | |
contextual::simpsons | Demo: Bandits, Propensity Weighting & Simpson's Paradox in R | HTML | source | R code | |
contextual::sutton_barto | Demo: Replication Sutton & Barto, Reinforcement Learning: An Introduction, Chapter 2 | HTML | source | R code | |
contextual::website_optimization | Demo: Replication of John Myles White, Bandit Algorithms for Website Optimization | HTML | source | R code |