-
-
Notifications
You must be signed in to change notification settings - Fork 212
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flux & Zygote's AD slower than ForwardDiff #994
Comments
I have been encountering poor performance as well. However, I cannot reproduce results that extreme. You should use BenchmarkTools and
Results:
System information: Ubuntu 18.04
|
I think this is mainly because of tracing the for loop is a bit heavy for reverse mode, since we need to store each |
This looks to be strictly a Zygote thing and could probably moved there (or closed, if we think it's an inherent design limitation) instead of Flux. |
Indeed. This ought to be sped up by #962, and #981. Some times:
(Julia 1.6, M1 mac + rosetta). Bigger version:
|
I found Zygote's recent advancement in AD and tried to benchmark it and found the following:
where the function
rosenbrock
is taken from here [Edit: now here]and
x = rand(10000);
.Three functions have been run multiple times for julia's JIT compilation.
I wonder what could be the reason for that?
The text was updated successfully, but these errors were encountered: