Comments on the robotSupervisorSchemeTutorial #12

tsampazk · 2021-02-23T15:08:50Z

tsampazk
Feb 23, 2021
Maintainer

ringpy · 2021-04-20T17:38:17Z

ringpy
Apr 20, 2021

Hi, we are trying to reproduce the find and avoid in our world. We are not able to transmit the created message from epuck to supervisor. Do you have a quick document on this program and how to use the json scripts for emitter/receiver.

0 replies

Mouses94 · 2021-05-25T16:52:05Z

Mouses94
May 25, 2021

Hi, I am writing a paper with the goal of implementing the stable_baselines algorithms in a Webots environment (using the IRB4600). The first step for me is now this tutorial.
I have gone through the tutorial but now I always get the following error message:

INFO: robotSupervisorController: Starting controller: python.exe -u robotSupervisorController.py
Traceback (most recent call last):
File "D:\Webots Projekte\controllers\robotSupervisorController\robotSupervisorController.py", line 88, in
env = CartpoleRobot()
File "D:\Webots Projekte\controllers\robotSupervisorController\robotSupervisorController.py", line 16, in init
self.robot = self.getSelf() # Grab the robot reference from the supervisor to access various robot methods
AttributeError: 'CartpoleRobot' object has no attribute 'getSelf'
WARNING: 'robotSupervisorController' Controller beendet mit Status: 1

Another question:
I already have the framework for the IRB4600, the motors and armChain, and above that the position sensors, I want to implement with the package ikpy as shown in the example Inverse kinematic.
For this I need the supervisor method: supervisor.getUrdf().encode('utf-8')

However, I cannot implement this at the same time as the RobotSupervisor. Is there another solution for this?

Is there already an example project for this?

Many thanks

7 replies

Mouses94 May 26, 2021

However, now I have another Issue with the Deepbots Framework.

I have written a Controller for the IRB4600 and wanted to run it with the Deepbots Framework.

But theres now this Error:

Traceback (most recent call last):
  File "D:\Webots Projekte\controllers\ReachIRB\ReachIRB.py", line 293, in <module>
    selectedAction, actionProb = agent.work(observation, type_="selectAction")
  File "D:\Webots Projekte\controllers\ReachIRB\PPO_agent.py", line 72, in work
    c = Categorical(action_prob)
  File "C:\Users\Office\AppData\Local\Programs\Python\Python39\lib\site-packages\torch\distributions\categorical.py", line 64, in __init__
    super(Categorical, self).__init__(batch_shape, validate_args=validate_args)
  File "C:\Users\Office\AppData\Local\Programs\Python\Python39\lib\site-packages\torch\distributions\distribution.py", line 53, in __init__
    raise ValueError("The parameter {} has invalid values".format(param))
ValueError: The parameter probs has invalid values

What can I do about it?
Does it have anything to do with the observation space, or where could the Error come from?

Thank you

eakirtas May 27, 2021
Maintainer

What type are the observation, could you please add some samples? Pytorch seems to waits Categorical values does this makes sense?

caozx1110 May 23, 2022

seems like only deepbots==0.1.3.dev2 can work

tsampazk May 24, 2022
Maintainer Author

seems like only deepbots==0.1.3.dev2 can work

Hello, @caozx1110. The discussion you answered to is quite old now (May 2021).

Right now, indeed the tutorial is intended for version 0.1.3.dev2 as stated here. I am currently updating the tutorial for the latest changes in Webots, fixing some issues with rotations, etc.

caozx1110 May 24, 2022

ok, thanks for your response

Kai-Mansfield · 2022-06-15T07:51:41Z

Kai-Mansfield
Jun 15, 2022

Hi, great tutorial! Are there definitions for some of the methods, like getSelf(), getDevice(), getFromDef(), etc. anywhere please? I tried searching the Deepbots and Webots github pages, but couldn't find the definitions or their methods and attributes. I found a reference to them in webots\docs\reference\supervisor.md, but their methods and attributes aren't included. Thank you for your time :)

2 replies

tsampazk Jun 15, 2022
Maintainer Author

Hey @Kai-Mansfield, thanks!

As far as i know, you can work off of the documentation here which is the same as the one you linked but in a more readable format. In most cases, the Webots python bindings are more or less self-explanatory, and if needed, most information exists on the documentation texts describing the methods.This and a little bit of experimenting usually get me the results i need.

If you still face problems you can visit the official Webots discord (invite link) and search there for similar questions or even ask a question yourself. Usually, one of Webot's developers will answer.

As a side note there is a new python API in the works for Webots, which will probably make it a lot easier to work with python.

Kai-Mansfield Jun 15, 2022

Hi @tsampazk, thank you for such a quick response. Excellent, I will have a look at the links, thank you!

Kaijun101 · 2022-10-03T02:30:20Z

Kaijun101
Oct 3, 2022

Hi, thanks for the awesome tutorial! However, when I run the tutorial I got the error, ModuleNotFoundError: No module named 'controller'. Would kindly require your assistance on this. The error is from supervisor_env.py.

18 replies

tsampazk Oct 14, 2022
Maintainer Author

Well that would be my guess!

Kaijun101 Oct 14, 2022

I was using the PPO algo from the tutorial, any ideas how output multiple actions @tsampazk hahaha?

tsampazk Oct 14, 2022
Maintainer Author

If your problem/environment uses a continuous action space, then the PPO agent used in the tutorial is not suitable as it is implemented for discrete action space.

There's a wealth of continuous action space agent implementations online, as well as libraries like stable-baselines that you can use.

Kaijun101 Oct 14, 2022

Alright, and really thanks a lot @tsampazk for all your support! Appreciate it definitely.

tsampazk Oct 14, 2022
Maintainer Author

No problem @Kaijun101, happy to help!

ChrisSim01 · 2024-03-21T15:57:13Z

ChrisSim01
Mar 21, 2024

Great, crystal clear tutorial! Many thanks indeed for your work.
I plan to use this as the basis for my future work. I have been using Webots for a while and this tutorial is just what I needed as a guide into RL.
If you can offer any advice into the following it would be greatly appreciated:
-saving the ANN state so that it can be loaded again from file, without the need to retrain.
-restart the training from an already trained n/w, perhaps on a slightly different task?
Best regards, Chris

1 reply

tsampazk Mar 21, 2024
Maintainer Author

Hello @ChrisSim01, thank your for your kind words, very happy to hear that the tutorial proved useful for you! 😄

As for your first question, you can find the agent save/load methods here, which are basically saving the actor/critic neural nets. They can be called whenever you wish, per some number of episodes, every episode or at the end of training to save the agent.

As long as you saved the models once, you can load them after the agent is initialized here. To continue training on a different task with the same agent, you can modify your environment, webots worlds, etc. and just load the agent as discussed.

Hope this helps, let me know if you have other questions!

ChrisSim01 · 2024-03-21T21:30:52Z

ChrisSim01
Mar 21, 2024

Awesome, thank you so much!

…

On Thu, 21 Mar 2024 at 21:25, Kostas Tsampazis ***@***.***> wrote: Hello @ChrisSim01 <https://github.com/ChrisSim01>, thank your for your kind words, very happy to hear that the tutorial proved useful for you! 😄 As for your first question, you can find the agent save/load methods here <https://github.com/aidudezzz/deepbots-tutorials/blob/48f76ecc9791d6b9cfc415623b4ad30d208efc9c/robotSupervisorSchemeTutorial/full_project/controllers/robot_supervisor_controller/PPO_agent.py#L93-L113>, which are basically saving the actor/critic neural nets. They can be called whenever you wish, per some number of episodes, every episode or at the end of training to save the agent. As long as you saved the models once, you can load them after the agent is initialized here <https://github.com/aidudezzz/deepbots-tutorials/blob/48f76ecc9791d6b9cfc415623b4ad30d208efc9c/robotSupervisorSchemeTutorial/full_project/controllers/robot_supervisor_controller/robot_supervisor_controller.py#L95>. To continue training on a different task with the same agent, you can modify your environment, webots worlds, etc. and just load the agent as discussed. Hope this helps, let me know if you have other questions! — Reply to this email directly, view it on GitHub <#12 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AH4Z5XTCRSLFN6NZYL6KAK3YZNF3ZAVCNFSM4YCUZX52U5DIOJSWCZC7NNSXTOKENFZWG5LTONUW63SDN5WW2ZLOOQ5TQOBXGEYTENY> . You are receiving this because you were mentioned.Message ID: ***@***.*** com>

0 replies

ChrisSim01 · 2024-03-22T11:49:25Z

ChrisSim01
Mar 22, 2024

Hi again, You could make an online course out of this tutorial via Udemy.com and make money from it! Just an idea, Best regards, Chris

…

On Thu, 21 Mar 2024 at 21:25, Kostas Tsampazis ***@***.***> wrote: Hello @ChrisSim01 <https://github.com/ChrisSim01>, thank your for your kind words, very happy to hear that the tutorial proved useful for you! 😄 As for your first question, you can find the agent save/load methods here <https://github.com/aidudezzz/deepbots-tutorials/blob/48f76ecc9791d6b9cfc415623b4ad30d208efc9c/robotSupervisorSchemeTutorial/full_project/controllers/robot_supervisor_controller/PPO_agent.py#L93-L113>, which are basically saving the actor/critic neural nets. They can be called whenever you wish, per some number of episodes, every episode or at the end of training to save the agent. As long as you saved the models once, you can load them after the agent is initialized here <https://github.com/aidudezzz/deepbots-tutorials/blob/48f76ecc9791d6b9cfc415623b4ad30d208efc9c/robotSupervisorSchemeTutorial/full_project/controllers/robot_supervisor_controller/robot_supervisor_controller.py#L95>. To continue training on a different task with the same agent, you can modify your environment, webots worlds, etc. and just load the agent as discussed. Hope this helps, let me know if you have other questions! — Reply to this email directly, view it on GitHub <#12 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AH4Z5XTCRSLFN6NZYL6KAK3YZNF3ZAVCNFSM4YCUZX52U5DIOJSWCZC7NNSXTOKENFZWG5LTONUW63SDN5WW2ZLOOQ5TQOBXGEYTENY> . You are receiving this because you were mentioned.Message ID: ***@***.*** com>

1 reply

tsampazk Mar 22, 2024
Maintainer Author

Good idea Chris, i will definitely consider it! Thank you.

ChrisSim01 · 2024-03-22T21:04:52Z

ChrisSim01
Mar 22, 2024

Hi again Kostas, I do have one further question please: is there any way to 'lock', or encapsulate, a neural network module once it is learned, so that it is used by, but not changed by further neural network training? This would allow tasks to be broken down into subtasks, and chaining of the subtasks together I think. I'm thinking in the style of https://nn.cs.utexas.edu/downloads/papers/lessin.gecco13.pdf Best regards, Chris

…

On Thu, 21 Mar 2024 at 21:25, Kostas Tsampazis ***@***.***> wrote: Hello @ChrisSim01 <https://github.com/ChrisSim01>, thank your for your kind words, very happy to hear that the tutorial proved useful for you! 😄 As for your first question, you can find the agent save/load methods here <https://github.com/aidudezzz/deepbots-tutorials/blob/48f76ecc9791d6b9cfc415623b4ad30d208efc9c/robotSupervisorSchemeTutorial/full_project/controllers/robot_supervisor_controller/PPO_agent.py#L93-L113>, which are basically saving the actor/critic neural nets. They can be called whenever you wish, per some number of episodes, every episode or at the end of training to save the agent. As long as you saved the models once, you can load them after the agent is initialized here <https://github.com/aidudezzz/deepbots-tutorials/blob/48f76ecc9791d6b9cfc415623b4ad30d208efc9c/robotSupervisorSchemeTutorial/full_project/controllers/robot_supervisor_controller/robot_supervisor_controller.py#L95>. To continue training on a different task with the same agent, you can modify your environment, webots worlds, etc. and just load the agent as discussed. Hope this helps, let me know if you have other questions! — Reply to this email directly, view it on GitHub <#12 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AH4Z5XTCRSLFN6NZYL6KAK3YZNF3ZAVCNFSM4YCUZX52U5DIOJSWCZC7NNSXTOKENFZWG5LTONUW63SDN5WW2ZLOOQ5TQOBXGEYTENY> . You are receiving this because you were mentioned.Message ID: ***@***.*** com>

1 reply

tsampazk Mar 22, 2024
Maintainer Author

Very interesting paper!

I think that what you are describing can be achieved by "freezing" specific layers of your networks, something along the lines of this.

ChrisSim01 · 2024-03-22T22:12:57Z

ChrisSim01
Mar 22, 2024

Thanks again Kostas. I'm on a steep learning curve!

…

On Fri, 22 Mar 2024 at 21:50, Kostas Tsampazis ***@***.***> wrote: Very interesting paper! I think that what you are describing can be achieved by "freezing" specific layers of your networks, something along the lines of this <https://discuss.pytorch.org/t/how-the-pytorch-freeze-network-in-some-layers-only-the-rest-of-the-training/7088> . — Reply to this email directly, view it on GitHub <#12 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AH4Z5XV3FCKQPHBAY2UESVDYZSRTRAVCNFSM4YCUZX52U5DIOJSWCZC7NNSXTOKENFZWG5LTONUW63SDN5WW2ZLOOQ5TQOBYGI4TANI> . You are receiving this because you were mentioned.Message ID: ***@***.*** com>

1 reply

tsampazk Mar 22, 2024
Maintainer Author

You sure are, but with persistence everything is possible!

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

Comments on the robotSupervisorSchemeTutorial #12

Uh oh!

Uh oh!

tsampazk Feb 23, 2021 Maintainer

Replies: 10 comments · 33 replies

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eakirtas May 27, 2021 Maintainer

Uh oh!

Uh oh!

tsampazk May 24, 2022 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tsampazk Jun 15, 2022 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tsampazk Oct 14, 2022 Maintainer Author

Uh oh!

Uh oh!

tsampazk Oct 14, 2022 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

tsampazk Oct 14, 2022 Maintainer Author

Uh oh!

Uh oh!

tsampazk Mar 21, 2024 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

tsampazk Mar 22, 2024 Maintainer Author

Uh oh!

Uh oh!

tsampazk Mar 22, 2024 Maintainer Author

Uh oh!

Uh oh!

tsampazk Mar 22, 2024 Maintainer Author

tsampazk
Feb 23, 2021
Maintainer

Replies: 10 comments 33 replies

eakirtas May 27, 2021
Maintainer

tsampazk May 24, 2022
Maintainer Author

tsampazk Jun 15, 2022
Maintainer Author

tsampazk Oct 14, 2022
Maintainer Author

tsampazk Oct 14, 2022
Maintainer Author

tsampazk Oct 14, 2022
Maintainer Author

tsampazk Mar 21, 2024
Maintainer Author

tsampazk Mar 22, 2024
Maintainer Author

tsampazk Mar 22, 2024
Maintainer Author

tsampazk Mar 22, 2024
Maintainer Author