Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add pytest timeout to mitigate JAX tests hang
#395 opened Dec 5, 2025 by ipanfilo Draft
13 tasks
Add instructions to download and install from wheels.
#394 opened Dec 5, 2025 by wenchenvincent Loading…
1 of 6 tasks
[WIP] GEMM reference compute offload
#392 opened Dec 4, 2025 by matthiasdiener Draft
13 tasks
Current scaling: two-stage Triton amax kernel
#385 opened Nov 26, 2025 by matthiasdiener Loading…
6 of 13 tasks
Enable AOTriton BWD V3 API
#382 opened Nov 25, 2025 by Micky774 Loading…
13 tasks
Old FP8 support code cleanup
#379 opened Nov 24, 2025 by ipanfilo Loading…
1 of 13 tasks
Re-enable supported GEMM configs
#378 opened Nov 24, 2025 by ipanfilo Loading…
13 tasks
Layernorm forward optimization
#377 opened Nov 24, 2025 by eliotwang Loading…
13 tasks
IFU dev v2.6
#374 opened Nov 19, 2025 by wangye805 Loading…
9 of 13 tasks
Userbuffer epic
#367 opened Nov 11, 2025 by alextmagro Draft
JAX FA Benchmarking Script
#351 opened Oct 24, 2025 by Micky774 Loading…
13 tasks
Triton norms dispatch refactor
#305 opened Sep 5, 2025 by Micky774 Loading…
13 tasks
heyi's layernorm optimization
#225 opened Jul 3, 2025 by eliotwang Loading…
8 of 13 tasks
Added Dockerfile for CI images
#195 opened May 28, 2025 by VeeraRajasekhar Loading…
7 of 13 tasks
[ROCm] support triton-based flash-attn in TE
#177 opened May 1, 2025 by wangye805 Loading…
8 of 13 tasks
Update attention example attention.ipynb
#152 opened Mar 19, 2025 by anhminhnguyenhoang Loading…
5 of 13 tasks
Honor the NVTE_FUSED_ATTN_<backend> in test_fused_attn.py
#123 opened Feb 11, 2025 by wangye805 Loading…
13 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.