-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Description
In SAC.py Line 120
Deep-Reinforcement-Learning-Algorithms-with-PyTorch/agents/actor_critic_agents/SAC.py
Line 120 in b338c87
| _, z, action = self.produce_action_and_action_info(state) |
However, the output of
produce_action_and_action_info(state) isDeep-Reinforcement-Learning-Algorithms-with-PyTorch/agents/actor_critic_agents/SAC.py
Line 135 in b338c87
| return action, log_prob, torch.tanh(mean) |
So, even though SAC algorithm can work in practice, is it a mistake?
Metadata
Metadata
Assignees
Labels
No labels