You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copies data from the specified rank's memory into the destination rank's memory.
440
+
This function performs the transfer by translating src_ptr from the from_rank's address
441
+
space to the to_rank's address space, performing a masked load from the translated
442
+
source, and storing the loaded data to dst_ptr in the to_rank memory location.
443
+
If from_rank and to_rank are the same, this function performs a local copy operation.
442
444
445
+
Args:
446
+
src_ptr (triton.PointerType, or block of dtype=triton.PointerType): Pointer in the from_rank's local memory from which to read data.
447
+
dst_ptr (triton.PointerType, or block of dtype=triton.PointerType): Pointer in the to_rank's local memory where the data will be written.
448
+
from_rank (int): The rank ID that owns src_ptr (source rank).
449
+
to_rank (int): The rank ID that will receive the data (destination rank).
450
+
heap_bases (triton.PointerType): Array containing the heap base addresses for all ranks.
451
+
mask (Block of triton.int1, optional): If mask[idx] is false, do not load from the translated src_ptr[idx] and do not store to dst_ptr[idx]. Defaults to None.
0 commit comments