Open
Description
It seems to me that this line should be changed to if 'tm' in self.name
(
self.start_conversation
and self.end_conversation
to split the training and test sets (seehttps://github.com/rosewang2008/language_modeling_via_stochastic_processes/blob/main/language_modeling_via_stochastic_processes/transformers/src/transformers/data/datasets/language_modeling.py#L1182
) for the tm2 dataset. With the current code, it seems that the training and test sets would be the same for tm2.
Metadata
Metadata
Assignees
Labels
No labels