Deep Learning Spring 2024 - Final Project

Commands for creating submission

python3 -m grader state_agent -v
python3 bundle.py state_agent group33
python3 -m grader group33.zip -v

Demo for creating Canvas compatible jit file

Training Command:

python3 -m imitation_agent.train -e 1 -v AI_L2x256_blue --time_steps 500000 --time_steps_infer 10 --nenv 2 --use_opponent --expert jurgen_agent --batch_size 512 --device cuda --md 90 --net_arch "256,256"

Command for creating jit compatible file:

python3 -m imitation_agent.canvas_jit -e 1 -v AI_L2x256_blue_error --time_steps 1 --time_steps_infer 10 --nenv 1 --use_opponent --expert jurgen_agent --batch_size 512 --device cuda --md 90 --net_arch "256,256" --resume_training "AI_L2x256_blue/AI_L2x256_blue.pt"

Reward Function for single team match

Offense Player (Player 1)

Minimize "kart_to_puck_dist"
- Rewarding the player (it increases exponentially): np.exp(-x)
- TODO: Check if normalizing the values helps or not
Aligning the player-puck-opponent goal post
- 1st vector: (puck - player)
- 2nd vector: (opponent goal post - player)
- Use cosine similarity b/w the two vectors
Reward for scoring the goal
- Minimize the distance b/w ball and goal post

Defense Player (Player 2)

Reward for not allowing the ball in a region

Reward Function for match against opponents

Reward based on current match state

Notes

PyStk State space

player_state

camera (https://pystk.readthedocs.io/en/latest/state.html#pystk.Camera)
Not needed????
Kart (https://pystk.readthedocs.io/en/latest/state.html#pystk.Kart)
attachment - types of attachment
front - Front direction of kart 1/2 kart length forward from location - float3
id - Kart id compatible with instance labels - int
jumping - Is the kart jumping? - bool (Not needed I think)
location - 3D world location of the kart - float3
max_steer_angle - Maximum steering angle - float
name - Player name - str
overall_distance - Overall distance traveled - float (Not needed I think)
player_id - Player id - int
powerup - Powerup collected - powerup
rotation - Quaternion rotation of the kart - Quaternion
velocity - Velocity of kart - float3

game_state

ball
id - Object id of the soccer ball - int
location - 3D world location of the item - float3 ( )
size - Size of the ball - float
goal_line (static) - Start and end of the goal line for each team - List[List[float3[2]][2]]

Features calculated by extract_features()

kart_direction - float2
kart_angle - float
kart_to_puck_direction - float2
kart_to_puck_angle - float
kart_to_puck_angle_difference - float
kart_to_opponent0 - float2
kart_to_opponent0_angle - float
kart_to_opponent0_angle_difference - float
goal_line_center - float2
puck_to_goal_line - float2
puck_to_goal_line_angle - float
kart_to_goal_line_angle_difference - float

Somethings that can help

ball velocity
ball acceleration?

Rewards (https://github.com/Rolv-Arild/Necto)

agents velocity towards ball
balls velocity towards goal
+ve reward on goal

-ve reward on opponent goal
+ve reward if shot on target
+ve on making a save
+ve on impeding opponent

psuedo code for necto rewards

game state reward = ball_pos closer to goal (continuos)

Questions

How does it know to reverse?
continous space
how can i train with some base agent
circiculum learning - ppo
action space
rewarding shaping

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
.idea		.idea
geoffrey_agent		geoffrey_agent
grader		grader
image_agent		image_agent
image_jurgen_agent		image_jurgen_agent
imitation_agent		imitation_agent
imitation_local		imitation_local
jurgen_agent		jurgen_agent
saved_model		saved_model
stable_baselines3_local		stable_baselines3_local
state_agent		state_agent
tb_log_reference		tb_log_reference
test1_da		test1_da
tournament		tournament
yann_agent		yann_agent
yoshua_agent		yoshua_agent
.gitignore		.gitignore
README.md		README.md
Report.pdf		Report.pdf
Rewards.txt		Rewards.txt
bundle.py		bundle.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Learning Spring 2024 - Final Project

Commands for creating submission

Demo for creating Canvas compatible jit file

Reward Function for single team match

Offense Player (Player 1)

Defense Player (Player 2)

Reward Function for match against opponents

Notes

PyStk State space

Features calculated by extract_features()

Rewards (https://github.com/Rolv-Arild/Necto)

Questions

About

Uh oh!

Releases

Packages

Languages

ckvermaAI/dl-project

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Spring 2024 - Final Project

Commands for creating submission

Demo for creating Canvas compatible jit file

Reward Function for single team match

Offense Player (Player 1)

Defense Player (Player 2)

Reward Function for match against opponents

Notes

PyStk State space

Features calculated by extract_features()

Rewards (https://github.com/Rolv-Arild/Necto)

Questions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages