-
Notifications
You must be signed in to change notification settings - Fork 669
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
memory leak on dataset iteration #2289
Comments
To check I took your latest #2232
and ran the code to reproduce the bug against
but I still see the same bug behaviour
|
It is the same field of problem. The impact of the bug behaviour from #2210? reproduced by
is eliminated by the proposed solution #2273. (I am not using the word solved here as the suggested solution cleans the garbage, but in the ideal solution there would not be garbage). However, the behaviour reported here even shows after applying the patch #2273. |
@enpasos I think I find the possible root cause. Basically, the The advanced indexing has memory leak issue, which is now fixed in #2300. So this is the possible root cause. You can apply this patch, then the memory leak issue is expected to be fixed. |
Concrats for eliminating the root cause for this memory leak! Very nice :-) |
Description
On running the FashionMnist example from DJL Docs I experience a GPU memory leak of about 503 Bytes on each dataset iteration.
illustrates the memory grows on GPU per epoch.
I see this increase even if the batch iteration is reduced to just the iteration without doing something else.
I experience this loss without and with the suggested fix #2273 to clean up orphaned NDArrays.
Expected Behavior
No memory leak.
How to Reproduce?
I set up a toy app based on djl fashion mnist to reproduce the problem I experience:
To further localize the cause:
What have you tried to solve it?
Looking for the cause. Did not find it yet.
Environment Info
The text was updated successfully, but these errors were encountered: