Reinforced Learning examples Epsilon greedy epsilon_greedy.py A simple example of exploration-exploitation with a multi-armed bandit problem. Credit: Steve Phelps Results Actions taken: Rewards: