Skip to content

Markov Decision Process: an agent has to navigate a grid in an optimal way (i.e., maximising rewards while minimising penalties)

Notifications You must be signed in to change notification settings

gmarsich/Gridworld

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Gridworld: value iteration VS linear programming

Main files:

  • Part 1 - Value Iteration/Part 1 - Value Iteration.ipynb: notebook containing the discussion and the results about the value iteration (VI) method
  • Part 2 - Linear Programmin/Part 2 - Linear Programming.ipynb: notebook containing the discussion and the results about optimisation, the linear programming method (LP) and comparison between VI and LP
  • RL_project.pptx: PowerPoint presentation of the project

About

Markov Decision Process: an agent has to navigate a grid in an optimal way (i.e., maximising rewards while minimising penalties)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published