You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
using Zygote
using CUDA
f(A, I) =sum(A[I])
A =rand(4)
I = [1, 3, 1]
# CPU - everything is OK
Zygote.gradient(f, A, I)
# ==> ([2.0, 0.0, 1.0, 0.0], nothing)# GPU - dA[1] is incorrect
Zygote.gradient(f, cu(A), cu(I))
# => (Float32[1.0, 0.0, 1.0, 0.0], nothing)
I believe CPU version comes from ChainRules.jl which correctly adds several derivatives to dA[1], but I'm not sure what code is used for CUDA version.
Here is how I came to this issue and how I try to resolve it in Yota.
The text was updated successfully, but these errors were encountered:
My brief reading of the linked CUDA issue suggests this can't be fixed? Could we add an error here? Just spent a bunch of time discovering subtly wrong results in my code / reducing to a MWE / eventually finding this already-filed issue. 😢
I believe CPU version comes from ChainRules.jl which correctly adds several derivatives to
dA[1]
, but I'm not sure what code is used for CUDA version.Here is how I came to this issue and how I try to resolve it in Yota.
The text was updated successfully, but these errors were encountered: