Issue in data loading

It seems to me that this line should be changed to `if 'tm' in self.name` (https://github.com/rosewang2008/language_modeling_via_stochastic_processes/blob/5cbc3eed581eba6444c471bfe716bd56db0f5253/language_modeling_via_stochastic_processes/transformers/src/transformers/data/datasets/language_modeling.py#L1201), since you were using `self.start_conversation` and `self.end_conversation` to split the training and test sets (see
https://github.com/rosewang2008/language_modeling_via_stochastic_processes/blob/main/language_modeling_via_stochastic_processes/transformers/src/transformers/data/datasets/language_modeling.py#L1182
) for the tm2 dataset. With the current code, it seems that the training and test sets would be the same for tm2.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue in data loading #10

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue in data loading #10

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions