Use external hip and hipcub headers when HOOMD_GPU_PLATFORM=CUDA #2178

joaander · 2025-11-20T15:13:13Z

Description

Remove the vendored hip and hipcub headers. Instead, use find_package(hip) to find external headers.
Change compiler definitions to support modern versions of hip.

Motivation and context

CUDA 13 contains many breaking changes and the vendored headers do not support it.
By using external hip libraries, HOOMD-blue will gain support for new versions of CUDA as soon as upstream adds support (at this time the latest release of hipcub does not support CUDA 13).

conda-forge does not support HIP_PLATFORM=nvidia (conda-forge/hip-feedstock#9) in hip-devel and lacks a hipcub package entirely. Therefore, users that build HOOMD from source for NVIDIA GPUs will need to install hip and hipcub headers:

How has this been tested?

HOOMD-blue compiles and passes tests with CUDA 12.9, rocm-systems:hip-version_7.2.53220, and rocm-libraries:rocm-7.1.0 locally. CI checks have been updated accordingly. Patches to hip and hipcub fix build errors with CUDA 12.5–12.8.

Checklist:

I have reviewed the Contributor Guidelines.
I agree with the terms of the HOOMD-blue Contributor Agreement.
My name is on the list of contributors (sphinx-doc/credits.rst) in the pull request source branch.
I have summarized these changes in CHANGELOG.rst following the established format.

Builds fail with 7.1 on CUDA 12.

Also document Pixi installation instructions.

joaander · 2025-11-21T18:55:04Z

@mphoward, what do you think of using an external HIP library even when HOOMD_GPU_PLATFORM=CUDA?

Ideally we could avoid this by using hipper everywhere, but that would be a lot more work.

In any case, I won't merge this until hipcub supports CUDA 13. Until then, these changes introduce additional complexity for developers with no benefit.

mphoward · 2025-11-25T03:31:10Z

Hmm, it isn't ideal to require CUDA users to download and compile HIP, but I agree this fix is substantially less work than getting hipper updated to modern rocm and implemented everywhere in HOOMD. (This work is still on my todo list for a long time, but I've been unable to get to it.)

Do you have a rough sense of when hipcub will support for CUDA 13 and this will be ready to merge? We could always temporarily require HIP, then relax this requirement later if we did complete the extra work.

joaander · 2025-12-01T13:42:49Z

Do you have a rough sense of when hipcub will support for CUDA 13 and this will be ready to merge? We could always temporarily require HIP, then relax this requirement later if we did complete the extra work.

I do not. When working on this branch, I was surprised to find that hipcub does not already support CUDA 13 given that it has been out for several months. I also found that the development version of hip supports only CUDA 12.9 and 13 (I had to patch hip to support all CUDA 12.4--12.8). There are no issues, discussions, or open pull requests regarding CUDA 13: https://github.com/ROCm/rocm-libraries

I am concerned that HIP/CUDA interoperation is no longer actively supported by AMD. They seem to measure success by "can it run PyTorch?" and have even made the effort to create a custom build system that builds ROCm, HIP, and PyTorch: https://github.com/ROCm/TheRock

mphoward · 2025-12-01T14:20:48Z

I was surprised to find that hipcub does not already support CUDA 13 given that it has been out for several months. I also found that the development version of hip supports only CUDA 12.9 and 13 (I had to patch hip to support all CUDA 12.4--12.8).

I am concerned that HIP/CUDA interoperation is no longer actively supported by AMD.

I agree, this is all quite concerning. It might also explain why they have more closely matched the HIP kernel launch / device code interface with CUDA in recent major releases (it is a replacement, not a compatibility layer).

Lack of proper CUDA support would be a good reason to put in the effort to update and use hipper as our own compatibility layer. I can prioritize working on that, but the soonest I can have a look would be in ~2 weeks (after the semester ends). It will require breaking changes to hipper first, and I am good with jumping to whatever the minimum versions of CUDA and ROCm that we need for HOOMD to keep the work as limited as possible.

joaander · 2025-12-01T19:22:41Z

We'll give AMD some time and see if they add CUDA 13 support.

If you do plan an eventual hipper refactor, the minimum CUDA I need is 12.8. NCSA Delta has just updated to that version. When I submitted this PR, the system was still on CUDA 12.4.

joaander added 5 commits November 20, 2025 08:43

Add missing cassert includes.

6b3fb6b

Add missing undef

1c3a2f0

Use modern external HIP headers when HOOMD_GPU_PLATFORM=CUDA

1b75fc9

Remove unused submodules.

d0d1444

Install hip and hipcub.

0600388

joaander marked this pull request as draft November 20, 2025 15:13

joaander added 7 commits November 20, 2025 10:26

Use the hip 7.2 prerelease

5de1a13

Builds fail with 7.1 on CUDA 12.

Patch hip to enable support for CUDA 12.x (where x < 8)

2d32b03

Also patch rocm-libraries.

ebbbb2a

Patch device_merge

8452b37

Test with cuda 12.5, 12.6, 12.8, and 12.9.

a364dc1

Fix failing cuda 12.4 build.

125169f

device_transform requires CUDA 12.9

fb74d6a

joaander added the release Build and unit test all support compiler/python configurations label Nov 20, 2025

joaander added 8 commits November 20, 2025 13:13

More hipcub patches.

ec617ac

Build for compute 8.0 on CI.

8a9a0f5

Test on CUDA 12.5+.

a951845

Remove unused include.

54de51b

Some preparation for CUDA 13

542c82a

Document new build requirements.

ed342b4

Document hip and hipcub dependencies.

a85e092

Also document Pixi installation instructions.

Disable cuda 13 tests.

4811aa6

joaander deleted the branch trunk November 21, 2025 22:03

joaander closed this Nov 21, 2025

joaander reopened this Dec 1, 2025

joaander changed the base branch from trunk-major to trunk December 1, 2025 13:14

Merge branch 'trunk' into external-hip

61e6680

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use external hip and hipcub headers when HOOMD_GPU_PLATFORM=CUDA #2178

Use external hip and hipcub headers when HOOMD_GPU_PLATFORM=CUDA #2178

Uh oh!

joaander commented Nov 20, 2025 •

edited

Loading

Uh oh!

joaander commented Nov 21, 2025

Uh oh!

mphoward commented Nov 25, 2025

Uh oh!

joaander commented Dec 1, 2025

Uh oh!

mphoward commented Dec 1, 2025

Uh oh!

joaander commented Dec 1, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use external hip and hipcub headers when HOOMD_GPU_PLATFORM=CUDA #2178

Are you sure you want to change the base?

Use external hip and hipcub headers when HOOMD_GPU_PLATFORM=CUDA #2178

Uh oh!

Conversation

joaander commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and context

How has this been tested?

Checklist:

Uh oh!

joaander commented Nov 21, 2025

Uh oh!

mphoward commented Nov 25, 2025

Uh oh!

joaander commented Dec 1, 2025

Uh oh!

mphoward commented Dec 1, 2025

Uh oh!

joaander commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

joaander commented Nov 20, 2025 •

edited

Loading

joaander commented Dec 1, 2025 •

edited

Loading