Skip to content

[HIP][CUDA] local size bytes #2634

Open
@jinz2014

Description

@jinz2014

HIP_FUNC_ATTRIBUTE_LOCAL_SIZE_BYTES | The local memory usage of each thread by this function in bytes.
HIP_FUNC_ATTRIBUTE_NUM_REGS | The number of registers used by each thread of this function.

When the local size bytes is the memory usage of each thread, should the following case be moved to "urKernelGetInfo" ?

  case UR_KERNEL_GROUP_INFO_PRIVATE_MEM_SIZE: {
    // OpenCL PRIVATE == CUDA LOCAL
    int Bytes = 0;
    UR_CHECK_ERROR(cuFuncGetAttribute(
        &Bytes, CU_FUNC_ATTRIBUTE_LOCAL_SIZE_BYTES, hKernel->get()));
    return ReturnValue(uint64_t(Bytes));

Metadata

Metadata

Labels

cudaCUDA adapter specific issueshipHIP adapter specific issuesspecificationChanges or additions to the specification

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions