SafeDRP Hybrid Extension

This repository is an extension of the repository SafeDRP from kaji-ou,
which is already an extension of DRPChallenge.

We extend the original environment with a hybrid action policy that combines rule-based navigation and reinforcement learning (RL).

How to run ?

While in epymarl:

Run normally

python3 src/main.py --config=qmix --env-config=gymma with env_args.time_limit=100 env_args.key="drp_env:drp-4agent_map_8x5-v2" env_args.state_repre_flag="onehot_fov"

Run with safe one

This uses safe wrapper "dep_env/SafeMarlEnv/env_wrapper"

python3 src/main.py --config=qmix --env-config=gymma with env_args.time_limit=100 env_args.key="drp_env:drp_safe-4agent_map_8x5-v2" env_args.state_repre_flag="onehot_fov"

Hybrid Action Policy Extension

Our approach employs a hybrid action policy: a combination of rule-based navigation and reinforcement learning (RL).

Rule-based algorithm: guides the agent toward the target node via the shortest path when possible.
RL exploration: enables adaptive behavior through trial-and-error.
Weighted combination:
- Early training: 90% rule-based, 10% RL.
- Over time: the rule-based influence decreases with episode index.

Implementation highlights

probability_rule_based() : returns probability of following rule-based policy (decreases with training).
shortest_path_action(joint_action) : computes valid shortest-path actions.
action_policy(joint_action) : mixes rule-based and RL policies.
action_policy_verifying(next_node, i) : ensures validity of chosen actions.
get_map_complexity() : calculates a numerical complexity score for the map.

Benefits:

Stabilizes early training with rule-based guidance. Encourages adaptive, generalized behavior as RL influence grows.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
drp.egg-info		drp.egg-info
drp_env		drp_env
example		example
policy		policy
problem		problem
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
calculate_cost.py		calculate_cost.py
policy_tester.py		policy_tester.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SafeDRP Hybrid Extension

How to run ?

Run normally

Run with safe one

Hybrid Action Policy Extension

Implementation highlights

Benefits:

About

Uh oh!

Releases

Packages

Languages

License

louannhintzy/safeHybridDRP

Folders and files

Latest commit

History

Repository files navigation

SafeDRP Hybrid Extension

How to run ?

Run normally

Run with safe one

Hybrid Action Policy Extension

Implementation highlights

Benefits:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages