feature(wrh): add adaptive batch size for transition #256

ruiheng123 · 2024-07-31T12:52:11Z

Each env has different steps per episode. We improved the sampling batch size distribution according to the collected steps in the buffer. Env with more collected steps can obtain larger batch size for training.
Meanwhile, we use softmax with temperature T param to keep the difference between different batch sizes from being too large.

ruiheng123 added 4 commits July 31, 2024 12:50

feature(wrh): add adaptive batch size for transition

9e0b55d

feature(wrh): add adaptive batch size as softmax with temp

6670bbd

feature(wrh): add adaptive batch size as softmax with temp

75b54d8

feature(wrh): add adaptive batch size for transition

c274ba4

puyuan1996 added the enhancement New feature or request label Aug 5, 2024

Provide feedback