Track GPU memory consumption explicitly #137

khuck · 2021-03-01T00:48:12Z

Currently, APEX tracks the cudaMalloc amounts, but doesn't track the total amount allocated. It relies on the periodic sampling of NVML counters, which can create blind spots. To avoid these blind spots, we should optionally track actually cudaMalloc and cudaFree locations and amounts. Each cudaMalloc call will increment an atomic counter of allocated memory bytes and insert into a map with the key as the address and the value the size. Then the cudaFree calls will use the address to look up the allocated size and decrement the atomic counter. This will be an optional feature, to avoid perturbation from contention for the map and the counter. Each malloc and free will result in an event to the OTF2 trace.

Now explicitly tracking all memory allocations and frees on both the host and the device.

khuck · 2021-03-12T00:29:53Z

Fixed with 7e37b10

khuck added the enhancement label Mar 1, 2021

khuck added a commit that referenced this issue Mar 12, 2021

Fixing #137.

7e37b10

Now explicitly tracking all memory allocations and frees on both the host and the device.

khuck closed this as completed Mar 12, 2021

khuck mentioned this issue Mar 12, 2021

Add memory tracking support for general purpose CPU allocations #139

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track GPU memory consumption explicitly #137

Track GPU memory consumption explicitly #137

khuck commented Mar 1, 2021

khuck commented Mar 12, 2021

Track GPU memory consumption explicitly #137

Track GPU memory consumption explicitly #137

Comments

khuck commented Mar 1, 2021

khuck commented Mar 12, 2021