[BUG] When ASYNC is enabled GDS needs to handle cudaMalloced bounce buffers #5268

abellina · 2022-04-18T15:54:58Z

Given the ASYNC allocator is enabled, the UCX bounce buffers are allocated directly, bypassing the ASYNC pool (because memory from the async pool can't be mapped for purposes of GPUDirectRDMA).

This PR fixes GDS copies where it assumed the bounce buffer was a DeviceMemoryBuffer when it was really a CudaMemoryBuffer (e.g. straight from cudaMalloc). It also fixes a couple of leaks where .slice was used, but the sliced buffer was not closed in the stack.

…d buffers Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

jlowe · 2022-04-18T16:16:27Z

build

abellina added 2 commits April 18, 2022 10:07

Use BaseDeviceMemoryBuffer in RapidsGdsStore to accomodte cudaMalloce…

ea319aa

…d buffers Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

Fixed sliced leak

8828aae

abellina added the bug Something isn't working label Apr 18, 2022

abellina added this to the Apr 18 - Apr 29 milestone Apr 18, 2022

jlowe approved these changes Apr 18, 2022

View reviewed changes

abellina merged commit 8e353ff into NVIDIA:branch-22.06 Apr 18, 2022

abellina deleted the bug/gds_base_device_memory_buffer branch April 18, 2022 19:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] When ASYNC is enabled GDS needs to handle cudaMalloced bounce buffers #5268

[BUG] When ASYNC is enabled GDS needs to handle cudaMalloced bounce buffers #5268

abellina commented Apr 18, 2022

jlowe commented Apr 18, 2022

[BUG] When ASYNC is enabled GDS needs to handle cudaMalloced bounce buffers #5268

[BUG] When ASYNC is enabled GDS needs to handle cudaMalloced bounce buffers #5268

Conversation

abellina commented Apr 18, 2022

jlowe commented Apr 18, 2022