CUpti_ActivityKernel2 Struct Reference
[CUPTI Activity API]

The activity record for kernel. (deprecated). More...


Data Fields

int32_t blockX
int32_t blockY
int32_t blockZ
uint64_t completed
uint32_t contextId
uint32_t correlationId
uint32_t deviceId
int32_t dynamicSharedMemory
uint64_t end
int64_t gridId
int32_t gridX
int32_t gridY
int32_t gridZ
CUpti_ActivityKind kind
uint32_t localMemoryPerThread
uint32_t localMemoryTotal
const char * name
uint16_t registersPerThread
void * reserved0
uint8_t sharedMemoryConfig
uint64_t start
int32_t staticSharedMemory
uint32_t streamId
uint8_t executed:4
uint8_t requested:4


Detailed Description

This activity record represents a kernel execution (CUPTI_ACTIVITY_KIND_KERNEL and CUPTI_ACTIVITY_KIND_CONCURRENT_KERNEL) but is no longer generated by CUPTI. Kernel activities are now reported using the CUpti_ActivityKernel6 activity record.

Field Documentation

The X-dimension block size for the kernel.

The Y-dimension block size for the kernel.

The Z-dimension grid size for the kernel.

The completed timestamp for the kernel execution, in ns. It represents the completion of all it's child kernels and the kernel itself. A value of CUPTI_TIMESTAMP_UNKNOWN indicates that the completion time is unknown.

The ID of the context where the kernel is executing.

The correlation ID of the kernel. Each kernel execution is assigned a unique correlation ID that is identical to the correlation ID in the driver or runtime API activity record that launched the kernel.

The ID of the device where the kernel is executing.

The dynamic shared memory reserved for the kernel, in bytes.

The end timestamp for the kernel execution, in ns. A value of 0 for both the start and end timestamps indicates that timestamp information could not be collected for the kernel.

The cache configuration used for the kernel. The value is one of the CUfunc_cache enumeration values from cuda.h.

The grid ID of the kernel. Each kernel is assigned a unique grid ID at runtime.

The X-dimension grid size for the kernel.

The Y-dimension grid size for the kernel.

The Z-dimension grid size for the kernel.

The activity record kind, must be CUPTI_ACTIVITY_KIND_KERNEL or CUPTI_ACTIVITY_KIND_CONCURRENT_KERNEL.

The amount of local memory reserved for each thread, in bytes.

The total amount of local memory reserved for the kernel, in bytes.

The name of the kernel. This name is shared across all activity records representing the same kernel, and so should not be modified.

The number of registers required for each thread executing the kernel.

The cache configuration requested by the kernel. The value is one of the CUfunc_cache enumeration values from cuda.h.

Undefined. Reserved for internal use.

The shared memory configuration used for the kernel. The value is one of the CUsharedconfig enumeration values from cuda.h.

The start timestamp for the kernel execution, in ns. A value of 0 for both the start and end timestamps indicates that timestamp information could not be collected for the kernel.

The static shared memory allocated for the kernel, in bytes.

The ID of the stream where the kernel is executing.


Generated on Sun Feb 14 21:14:24 2021 for Cupti by  doxygen 1.5.8