ROCM SDK Builder 6.1.2 released #216
lamikr
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
ROCM SDK Builder 6.1.2 (2025-03-06)
The new ROCM SDK Builder 6.1.2 is now available on
with git tag v6.1.2.0. It is available both on source code and with links to prebuilt docker images that can be used from the Linux.
(No Windows WSL support)
The ROCM SDK Builder provides an easy-to-build stack for machine learning tools across nearly
GPUs and has released since gfx906 series.
ROCM SDK Builder consists of from over 130 machine learning related applications and libraries that are build and optimized for the the AMD GPUs.
The list integrated applications that provides the acceleration support on AMD GPU environment has increased significanly from the previous release.
The integration by using the ROCM SDK builder to build and install these applications is required because AMD support is quite often
missing from many of these applications if installed from the official PIP repositories.
The project has seen many new contributors who have helped with testing, bug reporting, and code fixes,
significantly improving the project. In addition the project received the MI50 GPU as a donation, which helped
to add support both for the MI50 and Vega VII GPUs. In same way special thanks also for the Framework company
which has provided a valuable support by allowing to use their computers with AMD iGPUs and discrete GPUs for development and testing.
Further maintenance fixes for this release may be made available on the git branch
releases/rocm_sdk_builder_612
while the master branch will soon be used for the development version targeting newer AMD ROCm release as a base.
New Features and Updates
https://hub.docker.com/r/lamikr/rocm_sdk_builder/tags
Full installation and usage instructions are provided on the ROCM SDK Builders github page.
GPU specific tuning has been added for many older GPUs in projects like rocBLAS and Aotriton to increase their performance.
(Thank you for the upstream ROCM Projects for providing a mechanism to add tuning data for additional GPUs)
/opt/rocm/docs/examples folder
/opt/rocm_sdk_612/benchmark to allow executing
https://github.com/lamikr/pytorch-gpu-benchmark
It will also allow generating graph diagrams to compare benchmark results between selected GPUs.
(source code needs to be edited in current version to select which GPU result filed are compared)
Applications are now divided to mandatory Core applications and non-mandatory Extra applications.
Lot of applications have been now added to the list of supported optional extra applications.
Extra applications are not build by default but can be build easily after the ROCM SDK Builder
has finished building the mandatory core applications.
ai_tools
: Llama.cpp, vLLM, Stable Diffusion (web UI), CTranslate2, etc.google-tools
: JAXamd-media-tools
: rocDecode, RPP, MIVisionXamd_devel_tools
: rocProfiler, OmniTracer, and their dependenciesamd-aie
: LLVM for XDNA/XDNA2 NPU (work in progress)compared to distro versions of same applications
The ROCM SDK Builder can now be used to build an LLVM compiler for AMD's VLIW instruction-based XDNA and XDNA2 NPUs.
Simple test applications source code with instructions for building and executing it on NPU is included.
rocorofiler
androcTracer
dependencies are now built only once separately instead of building them as a part of these tools own build process.(for example timemory)
Allow configuring the target GPU list without requirement to really have the GPU on the build machine.
(Target GPU can be given as a parameter instead of build system checking what GPUs are present)
GPU specific files generated and build will now be stored for own dedicatd directory for each GPUs.
This fixes inode resource problems breaking the build if too manyGPUs are selected as a build target simultaneously.
Disabled the automatic update mechanism possibly fetching the source code to newer version during the build time.
flash attention support added for MI50/Radeon VII (never AMD GPUs already supported by default)
Known Issues
https://github.com/lamikr/pytorch-gpu-benchmark
MMU events for the kernel when new task is loaded. Real fix seems to require patching the firmware
https://lore.kernel.org/lkml/[email protected]/T/#mb4da09bc8b7c87eb56796631e3d0b08063e73347
gfx1103 (7840U): HW Exception by GPU node-1 #141 (comment)
MMU events for the kernel when new task is loaded. Real fix seems to require patching the firmware
https://lore.kernel.org/lkml/[email protected]/T/#mb4da09bc8b7c87eb56796631e3d0b08063e73347
reguired AMD GPU Windows driver to work with the version build on that way
Plans for the Next Release
Beta Was this translation helpful? Give feedback.
All reactions