Open
Description
HIP_FUNC_ATTRIBUTE_LOCAL_SIZE_BYTES | The local memory usage of each thread by this function in bytes.
HIP_FUNC_ATTRIBUTE_NUM_REGS | The number of registers used by each thread of this function.
When the local size bytes is the memory usage of each thread, should the following case be moved to "urKernelGetInfo" ?
case UR_KERNEL_GROUP_INFO_PRIVATE_MEM_SIZE: {
// OpenCL PRIVATE == CUDA LOCAL
int Bytes = 0;
UR_CHECK_ERROR(cuFuncGetAttribute(
&Bytes, CU_FUNC_ATTRIBUTE_LOCAL_SIZE_BYTES, hKernel->get()));
return ReturnValue(uint64_t(Bytes));