You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add post_dataloading_processing method to Trainer (#1985)
Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0)
(oldest at bottom):
* #2002
* #2001
* #1995
* __->__ #1985
We are adding more actions to convert the raw inputs and label.
1. The new CP can do the input/label/BlockMask sharding this in this
method.
2. The experimental full dtensor model can simply override this method
without changing too many Trainer code.
This method is extracted from
#1857
Makeing this a standalone PR allows us to continue the two projects
above without one blocks another.
0 commit comments