-
Notifications
You must be signed in to change notification settings - Fork 596
[ET] enabling half dtype output for dequantization and making logic consistent #11552
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…onsistent Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11552
Note: Links to docs will display an error until the docs builds have been completed. ❌ 4 New Failures, 2 Cancelled Jobs, 18 Unrelated FailuresAs of commit 5cfc153 with merge base 8cfa858 ( NEW FAILURES - The following jobs have failed:
CANCELLED JOBS - The following jobs were cancelled. Please retry:
FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
…ing logic consistent" Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D76289181 |
64d1f4d
into
gh/ahmtox/17/base
Stack from ghstack (oldest at bottom):
Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other
Differential Revision: D76289181