feature(whl): add PC+MCTS code #603

kxzxvbk · 2023-03-05T08:59:25Z

Description

Related Issue

TODO

Check List

merge the latest version source branch/repo, and resolve all the conflicts
pass style check
pass all the tests

PaParaZz1 · 2023-03-09T06:32:28Z

dizoo/atari/config/serial/qbert/qbert_pc_mcts_config.py

+            learner=dict(hook=dict(save_ckpt_after_iter=1000)),
+            train_epoch=20,
+        ),
+        eval=dict(evaluator=dict(eval_freq=40, ))


increase eval_freq

PaParaZz1 · 2023-03-09T06:32:52Z

dizoo/atari/config/serial/pong/pong_pc_mcts_config.py

+qbert_pc_mcts_config = dict(
+    exp_name='pong_pc_mcts_seed0',
+    env=dict(
+        manager=dict(


remove unmodified default config

PaParaZz1 · 2023-03-09T06:33:38Z

ding/worker/collector/interaction_serial_evaluator.py

                                    self._env.enable_save_figure(env_id, self._cfg.figure_path)
                            self._policy.reset([env_id])
-                            reward = t.info['eval_episode_return']
+                            if 'final_eval_reward' in t.info.keys():


final_eval_reward has been renamed to eval_episode_return

PaParaZz1 · 2023-03-09T06:34:29Z

ding/policy/pc.py

+    def _forward_collect(self, data: Dict[int, Any], **kwargs) -> Dict[int, Any]:
+        pass
+
+    def _process_transition(self, obs: Any, model_output: dict, timestep: namedtuple) -> dict:


polish methods related to collect

PaParaZz1 · 2023-03-09T06:34:43Z

ding/policy/pc.py

+            output = {'action': output}
+        output = default_decollate(output)
+        # TODO why this bug?
+        output = [{'action': o['action'].item()} for o in output]


why add this

‘whl’ and others added 4 commits March 4, 2023 19:05

init commit

2ecbb3f

init commit

f090d31

bug fux

6dc65e1

Merge branch 'main' into pc-mcts

559bd87

PaParaZz1 added the algo Add new algorithm or improve old one label Mar 5, 2023

‘whl’ added 2 commits March 6, 2023 10:27

reformat

fdd5d34

Merge branch 'pc-mcts' of github.com:kxzxvbk/DI-engine into pc-mcts

4343ca1

PaParaZz1 requested changes Mar 9, 2023

View reviewed changes

PaParaZz1 mentioned this pull request Mar 9, 2023

Roadmap for DI-engine #548

Open

‘whl’ added 21 commits March 15, 2023 17:40

add visualization

12b9a02

feature(whl): update pc model

a459278

fix_bug

f983602

fix_bug

f2db38a

fix_bug

1403f73

fix_bug

cd2c871

fix_bug

1a8ea4f

fix_bug

4ffaaa4

fix_bug

1e15ee4

fix_bug

d486ee8

bug fix

c6662c4

add visualization for recurrent mode

d925897

debug visualization for recurrent mode

841e013

debug visualization for recurrent mode

ffb8154

debug forward eval

b3c72aa

debug forward eval

cfbd277

debug forward eval

b876347

reweight loss

70baa69

reweight loss

ab9eda7

add seq actions

b0080ac

polish loss

6f5ba7e

update metric monitor

e30c98d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature(whl): add PC+MCTS code #603

feature(whl): add PC+MCTS code #603

Uh oh!

kxzxvbk commented Mar 5, 2023

Uh oh!

PaParaZz1 Mar 9, 2023

Uh oh!

PaParaZz1 Mar 9, 2023

Uh oh!

PaParaZz1 Mar 9, 2023

Uh oh!

PaParaZz1 Mar 9, 2023

Uh oh!

PaParaZz1 Mar 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feature(whl): add PC+MCTS code #603

Are you sure you want to change the base?

feature(whl): add PC+MCTS code #603

Uh oh!

Conversation

kxzxvbk commented Mar 5, 2023

Description

Related Issue

TODO

Check List

Uh oh!

PaParaZz1 Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

PaParaZz1 Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

PaParaZz1 Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

PaParaZz1 Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

PaParaZz1 Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants