Skip to content

Conversation

loscrossos
Copy link
Contributor

the current github wheel build process uses TORCH_CUDA_ARCH_LIST = 7.5 8.0+PTX 9.0a
this should be forwards compatible but on in some cases for blackwell it isnt (see #1251).
Currently the compilation code already supports blackwell. Building with TORCH_CUDA_ARCH_LIST=12.0 solves the issue on #1251 but the pre-built wheels are not being compiled with that flag. Since the build process is done with CudaTK12.8 already i added capability 100 and 120 to the workflow. I already submitted a PR for the FA2 code path (#1254).
this PR adds support for blackwell on the main build workflow for the pipy builds.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants