We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent aad3f09 commit 1e97e68Copy full SHA for 1e97e68
README.md
@@ -4,7 +4,7 @@ This fork is specifically optimized for AMD GFX906 architecture (MI50, MI60, Veg
4
5
---
6
7
-## Key Features of b6615 - forked
+## Key Features of b6628 - forked
8
9
- **Replaced bpermute instructions with swizzle** (AMD native warp reductions, main contribution)
10
0 commit comments