Skip to content

Latest commit

 

History

History
15 lines (10 loc) · 324 Bytes

File metadata and controls

15 lines (10 loc) · 324 Bytes

Reinforced Learning examples

Epsilon greedy

epsilon_greedy.py

A simple example of exploration-exploitation with a multi-armed bandit problem.

Credit: Steve Phelps

Results

Actions taken:

Rewards: