Skip to content

Pull requests: NVIDIA/Fuser

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix TensorDomain::setContiguity
#5125 opened Sep 5, 2025 by zasdfgbnm Loading…
Add a repro
#5124 opened Sep 5, 2025 by wujingyue Draft
Documentation for multi-GPU support
#5123 opened Sep 5, 2025 by wujingyue Loading…
enable codegen for layout op
#5118 opened Sep 4, 2025 by jjsjann123 Loading…
Creating a CuTe TV Layout in NvFuser Cutlass Matmuls
#5117 opened Sep 4, 2025 by rdspring1 Loading…
3 tasks
add layout op runtime function
#5115 opened Sep 4, 2025 by jjsjann123 Loading…
Add layout op
#5114 opened Sep 4, 2025 by jjsjann123 Loading…
max.fp16
#5111 opened Sep 4, 2025 by liqiangxl Draft
P2PCommunication Benchmark over Cuda IPC
#5102 opened Sep 2, 2025 by nsarka Loading…
Create grouped_mm for bf16 and fp16 inputs on Blackwell Cutlass Direct Bindings Python extension with direct mapping to NvFuser CPP objects.
#5101 opened Sep 2, 2025 by rdspring1 Loading…
Replay allocation in rfactor
#5090 opened Aug 29, 2025 by Priya2698 Loading…
Llu/cluster reduction auto
#5075 opened Aug 26, 2025 by liqiangxl Loading…
remove revertUseOfInputCache
#5056 opened Aug 22, 2025 by liqiangxl Draft
ProTip! Follow long discussions with comments:>50.