Skip to content

Conversation

@fegin
Copy link
Contributor

@fegin fegin commented Nov 4, 2025

Stack from ghstack (oldest at bottom):

We are adding more actions to convert the raw inputs and label.

  1. The new CP can do the input/label/BlockMask sharding this in this method.
  2. The experimental full dtensor model can simply override this method without changing too many Trainer code.

This method is extracted from #1857

Makeing this a standalone PR allows us to continue the two projects above without one blocks another.

[ghstack-poisoned]
fegin added a commit that referenced this pull request Nov 4, 2025
We are adding more actions to convert the raw inputs and label.

1. The new CP can do the input/label/BlockMask sharding this in this method.
2. The experimental full dtensor model can simply override this method without changing too many Trainer code.

This method is extracted from #1857

Makeing this a standalone PR allows us to continue the two projects above without one blocks another.


ghstack-source-id: d1882a7
Pull-Request: #1985
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 4, 2025
[ghstack-poisoned]
[ghstack-poisoned]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants