v0.9.0
New features
Initial ROCm support
This release adds the rocm device type. For instance to register a kernel that supports both CUDA and ROCm, you can use:
kernel_layer_mapping = {
"SiluAndMul": {
"cuda": LayerRepository(
repo_id="kernels-community/activation",
layer_name="SiluAndMul",
),
"rocm": LayerRepository(
repo_id="kernels-community/activation",
layer_name="SiluAndMul",
)
}
}
register_kernel_mapping(kernel_layer_mapping)Support for loading local kernel layers
For development and debugging it can often be useful to load kernel layers from a local directory. This is supported by the new LocalLayerRepository class. You can directly use the output of kernel-builder. For example:
kernel_layer_mapping = {
"SiluAndMul": {
"cuda": LocalLayerRepository(
repo_path="/home/daniel/kernels/activation",
package_name="activation",
layer_name="SiluAndMul",
)
}
}
register_kernel_mapping(kernel_layer_mapping)What's Changed
- Fix typo in layers documentation by @shadeMe in #116
- Update documentation for compatibility with doc-builder by @danieldk in #117
- Test examples in docstrings using mktestdocs by @danieldk in #118
- Add doc build to CI by @danieldk in #119
- Log when using fallback layer by @danieldk in #121
- Add
LocalLayerRepositoryto load from a local repo by @danieldk in #123 - Run black check by @danieldk in #124
- Nix: go back to hf-nix main by @danieldk in #125
- Add ROCm device discovery by @ahadnagy in #122
- Set version to 0.9.0.dev0 by @danieldk in #126
New Contributors
Full Changelog: v0.8.1...v0.9.0