-
Couldn't load subscription status.
- Fork 68
IdModel loop graph fix #5426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
IdModel loop graph fix #5426
Conversation
|
!test |
Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
|
To run @crcrpar 's example with nvfp4, i needed a few small fixes. So we would want to use thunder branch in Lightning-AI/lightning-thunder#2691 I used nvfuser code in this PR, as well as the small fix for nvfp4 in #5428 (already approved, will update comment after merge). In order to run the benchmark, the command to use (run this in Thunder's directory). The program did run to completion, but I haven't got around to verify the model: Note: we need to add |
Fixes: #5391
The issue is root cause from having un-connected IDs in allocation domain, triggering loop promotion assert on ID not covered by loop domain. However, since loop domain should only check coverage on logical sizes, we shouldn't included allocation domain in loop graph in the first place.
Changes in this PR:
TensorDomain::allIdsmethod to exclude allocation domain from the pairwise path traversal.