Skip to content

Conversation

@terminator123
Copy link

Overview

Adds suppport for packing-with-padding format in gpt dataset. The new truncationless-packing mode addresses this by padding the remaining space in the current sequence instead of truncating the sample, thus preserving the integrity of every training example.

Key changes

  • Added -use-truncateless-packing flag to enable packing samples with padding
  • Fix loss_mask cache initialization when the sample has padding

@copy-pr-bot
Copy link

copy-pr-bot bot commented Sep 3, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants