-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: update all AMDGPU compats #837
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
[skip tests]
Benchmark Results (ASV)
Benchmark PlotsA plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Benchmark Results
Benchmark suite | Current: 38a046b | Previous: b6171a6 | Ratio |
---|---|---|---|
Dense(2 => 2)/cpu/reverse/ReverseDiff (compiled)/(2, 128) |
3650.625 ns |
3675.625 ns |
0.99 |
Dense(2 => 2)/cpu/reverse/Zygote/(2, 128) |
7918.083333333334 ns |
8093.5 ns |
0.98 |
Dense(2 => 2)/cpu/reverse/Tracker/(2, 128) |
21340 ns |
21210 ns |
1.01 |
Dense(2 => 2)/cpu/reverse/ReverseDiff/(2, 128) |
9775 ns |
9748.2 ns |
1.00 |
Dense(2 => 2)/cpu/reverse/Flux/(2, 128) |
9016.8 ns |
9167.2 ns |
0.98 |
Dense(2 => 2)/cpu/reverse/SimpleChains/(2, 128) |
4402.666666666667 ns |
4470.875 ns |
0.98 |
Dense(2 => 2)/cpu/reverse/Enzyme/(2, 128) |
4914.125 ns |
4956.875 ns |
0.99 |
Dense(2 => 2)/cpu/forward/NamedTuple/(2, 128) |
2347.4 ns |
2373.4 ns |
0.99 |
Dense(2 => 2)/cpu/forward/ComponentArray/(2, 128) |
2373.4 ns |
2270.3 ns |
1.05 |
Dense(2 => 2)/cpu/forward/Flux/(2, 128) |
1793.5357142857142 ns |
1790.017543859649 ns |
1.00 |
Dense(2 => 2)/cpu/forward/SimpleChains/(2, 128) |
179.49579831932772 ns |
179.70239774330042 ns |
1.00 |
Dense(20 => 20)/cpu/reverse/ReverseDiff (compiled)/(20, 128) |
17283 ns |
17562.5 ns |
0.98 |
Dense(20 => 20)/cpu/reverse/Zygote/(20, 128) |
34264 ns |
24787 ns |
1.38 |
Dense(20 => 20)/cpu/reverse/Tracker/(20, 128) |
38973 ns |
38393 ns |
1.02 |
Dense(20 => 20)/cpu/reverse/ReverseDiff/(20, 128) |
29706 ns |
29025 ns |
1.02 |
Dense(20 => 20)/cpu/reverse/Flux/(20, 128) |
19727 ns |
21590 ns |
0.91 |
Dense(20 => 20)/cpu/reverse/SimpleChains/(20, 128) |
17287.5 ns |
17092 ns |
1.01 |
Dense(20 => 20)/cpu/reverse/Enzyme/(20, 128) |
25688 ns |
25648 ns |
1.00 |
Dense(20 => 20)/cpu/forward/NamedTuple/(20, 128) |
19888 ns |
20248 ns |
0.98 |
Dense(20 => 20)/cpu/forward/ComponentArray/(20, 128) |
20639 ns |
14448 ns |
1.43 |
Dense(20 => 20)/cpu/forward/Flux/(20, 128) |
4937.857142857143 ns |
4846.285714285715 ns |
1.02 |
Dense(20 => 20)/cpu/forward/SimpleChains/(20, 128) |
1662.1 ns |
1659.2 ns |
1.00 |
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 3, 128) |
116312647 ns |
77690170 ns |
1.50 |
Conv((3, 3), 3 => 3)/cpu/reverse/Zygote/(64, 64, 3, 128) |
75943610.5 ns |
76782338 ns |
0.99 |
Conv((3, 3), 3 => 3)/cpu/reverse/Tracker/(64, 64, 3, 128) |
172264656.5 ns |
155414925 ns |
1.11 |
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff/(64, 64, 3, 128) |
192675811.5 ns |
167638289.5 ns |
1.15 |
Conv((3, 3), 3 => 3)/cpu/reverse/Flux/(64, 64, 3, 128) |
179833271 ns |
142842293.5 ns |
1.26 |
Conv((3, 3), 3 => 3)/cpu/reverse/SimpleChains/(64, 64, 3, 128) |
11909358.5 ns |
11557321.5 ns |
1.03 |
Conv((3, 3), 3 => 3)/cpu/reverse/Enzyme/(64, 64, 3, 128) |
220741819 ns |
199234044.5 ns |
1.11 |
Conv((3, 3), 3 => 3)/cpu/forward/NamedTuple/(64, 64, 3, 128) |
15366998.5 ns |
15528408.5 ns |
0.99 |
Conv((3, 3), 3 => 3)/cpu/forward/ComponentArray/(64, 64, 3, 128) |
15503678.5 ns |
15540189 ns |
1.00 |
Conv((3, 3), 3 => 3)/cpu/forward/Flux/(64, 64, 3, 128) |
41156139 ns |
30661456 ns |
1.34 |
Conv((3, 3), 3 => 3)/cpu/forward/SimpleChains/(64, 64, 3, 128) |
6318391.5 ns |
6376663 ns |
0.99 |
vgg16/cpu/reverse/Zygote/(32, 32, 3, 16) |
1063380917.5 ns |
1064055959.5 ns |
1.00 |
vgg16/cpu/reverse/Zygote/(32, 32, 3, 64) |
2893008446 ns |
2970205700 ns |
0.97 |
vgg16/cpu/reverse/Zygote/(32, 32, 3, 2) |
196260977 ns |
178121161 ns |
1.10 |
vgg16/cpu/reverse/Tracker/(32, 32, 3, 16) |
1395460301 ns |
1320655778 ns |
1.06 |
vgg16/cpu/reverse/Tracker/(32, 32, 3, 64) |
3625063869 ns |
3516351096 ns |
1.03 |
vgg16/cpu/reverse/Tracker/(32, 32, 3, 2) |
395182584 ns |
344809509 ns |
1.15 |
vgg16/cpu/reverse/Flux/(32, 32, 3, 16) |
1585273830 ns |
1431616033 ns |
1.11 |
vgg16/cpu/reverse/Flux/(32, 32, 3, 64) |
4260488961 ns |
4058579611 ns |
1.05 |
vgg16/cpu/reverse/Flux/(32, 32, 3, 2) |
452914238 ns |
436008182 ns |
1.04 |
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 16) |
374779030 ns |
381866129 ns |
0.98 |
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 64) |
901899645 ns |
905256978 ns |
1.00 |
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 2) |
55190309 ns |
54567006.5 ns |
1.01 |
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 16) |
367367750 ns |
382293897 ns |
0.96 |
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 64) |
899735410.5 ns |
870357323.5 ns |
1.03 |
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 2) |
55597522 ns |
54472914.5 ns |
1.02 |
vgg16/cpu/forward/Flux/(32, 32, 3, 16) |
526827152 ns |
551222188 ns |
0.96 |
vgg16/cpu/forward/Flux/(32, 32, 3, 64) |
1479861454 ns |
1387168504 ns |
1.07 |
vgg16/cpu/forward/Flux/(32, 32, 3, 2) |
167765083 ns |
164122645 ns |
1.02 |
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 64, 128) |
1314817844.5 ns |
1180058919 ns |
1.11 |
Conv((3, 3), 64 => 64)/cpu/reverse/Zygote/(64, 64, 64, 128) |
1575014115 ns |
1610297742 ns |
0.98 |
Conv((3, 3), 64 => 64)/cpu/reverse/Tracker/(64, 64, 64, 128) |
2398467933 ns |
2289727615.5 ns |
1.05 |
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff/(64, 64, 64, 128) |
2427384218 ns |
2640437136 ns |
0.92 |
Conv((3, 3), 64 => 64)/cpu/reverse/Flux/(64, 64, 64, 128) |
2180036687 ns |
2193753011.5 ns |
0.99 |
Conv((3, 3), 64 => 64)/cpu/reverse/Enzyme/(64, 64, 64, 128) |
2237899044 ns |
2122924359 ns |
1.05 |
Conv((3, 3), 64 => 64)/cpu/forward/NamedTuple/(64, 64, 64, 128) |
285419226 ns |
282003619 ns |
1.01 |
Conv((3, 3), 64 => 64)/cpu/forward/ComponentArray/(64, 64, 64, 128) |
285127178.5 ns |
286261947 ns |
1.00 |
Conv((3, 3), 64 => 64)/cpu/forward/Flux/(64, 64, 64, 128) |
456543369 ns |
437257287 ns |
1.04 |
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 1, 128) |
12020070 ns |
11806435 ns |
1.02 |
Conv((3, 3), 1 => 1)/cpu/reverse/Zygote/(64, 64, 1, 128) |
34753186 ns |
34527638 ns |
1.01 |
Conv((3, 3), 1 => 1)/cpu/reverse/Tracker/(64, 64, 1, 128) |
16664217 ns |
16364743 ns |
1.02 |
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff/(64, 64, 1, 128) |
21264382 ns |
21004093 ns |
1.01 |
Conv((3, 3), 1 => 1)/cpu/reverse/Flux/(64, 64, 1, 128) |
15351989 ns |
15284140 ns |
1.00 |
Conv((3, 3), 1 => 1)/cpu/reverse/SimpleChains/(64, 64, 1, 128) |
1193025 ns |
1148921.5 ns |
1.04 |
Conv((3, 3), 1 => 1)/cpu/reverse/Enzyme/(64, 64, 1, 128) |
35851437.5 ns |
35777843.5 ns |
1.00 |
Conv((3, 3), 1 => 1)/cpu/forward/NamedTuple/(64, 64, 1, 128) |
4515987 ns |
4500694 ns |
1.00 |
Conv((3, 3), 1 => 1)/cpu/forward/ComponentArray/(64, 64, 1, 128) |
4589579 ns |
4506207 ns |
1.02 |
Conv((3, 3), 1 => 1)/cpu/forward/Flux/(64, 64, 1, 128) |
2002427.5 ns |
2045686 ns |
0.98 |
Conv((3, 3), 1 => 1)/cpu/forward/SimpleChains/(64, 64, 1, 128) |
244667 ns |
196300 ns |
1.25 |
Dense(200 => 200)/cpu/reverse/ReverseDiff (compiled)/(200, 128) |
381681 ns |
378068 ns |
1.01 |
Dense(200 => 200)/cpu/reverse/Zygote/(200, 128) |
304498 ns |
314462 ns |
0.97 |
Dense(200 => 200)/cpu/reverse/Tracker/(200, 128) |
388380 ns |
377972 ns |
1.03 |
Dense(200 => 200)/cpu/reverse/ReverseDiff/(200, 128) |
517516 ns |
520691 ns |
0.99 |
Dense(200 => 200)/cpu/reverse/Flux/(200, 128) |
294910 ns |
289716 ns |
1.02 |
Dense(200 => 200)/cpu/reverse/SimpleChains/(200, 128) |
409033 ns |
401777 ns |
1.02 |
Dense(200 => 200)/cpu/reverse/Enzyme/(200, 128) |
430443 ns |
425321 ns |
1.01 |
Dense(200 => 200)/cpu/forward/NamedTuple/(200, 128) |
164637 ns |
157406 ns |
1.05 |
Dense(200 => 200)/cpu/forward/ComponentArray/(200, 128) |
155045 ns |
162456 ns |
0.95 |
Dense(200 => 200)/cpu/forward/Flux/(200, 128) |
93285 ns |
91953 ns |
1.01 |
Dense(200 => 200)/cpu/forward/SimpleChains/(200, 128) |
104535 ns |
104407 ns |
1.00 |
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 16, 128) |
312422589 ns |
297649242 ns |
1.05 |
Conv((3, 3), 16 => 16)/cpu/reverse/Zygote/(64, 64, 16, 128) |
290463113 ns |
287837994 ns |
1.01 |
Conv((3, 3), 16 => 16)/cpu/reverse/Tracker/(64, 64, 16, 128) |
571203973 ns |
545531151.5 ns |
1.05 |
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff/(64, 64, 16, 128) |
683798344 ns |
655809148 ns |
1.04 |
Conv((3, 3), 16 => 16)/cpu/reverse/Flux/(64, 64, 16, 128) |
590381756 ns |
554893727 ns |
1.06 |
Conv((3, 3), 16 => 16)/cpu/reverse/SimpleChains/(64, 64, 16, 128) |
339651259 ns |
316084028.5 ns |
1.07 |
Conv((3, 3), 16 => 16)/cpu/reverse/Enzyme/(64, 64, 16, 128) |
615147910.5 ns |
583442251.5 ns |
1.05 |
Conv((3, 3), 16 => 16)/cpu/forward/NamedTuple/(64, 64, 16, 128) |
40167611.5 ns |
40159465 ns |
1.00 |
Conv((3, 3), 16 => 16)/cpu/forward/ComponentArray/(64, 64, 16, 128) |
39800524 ns |
40173961.5 ns |
0.99 |
Conv((3, 3), 16 => 16)/cpu/forward/Flux/(64, 64, 16, 128) |
107566981.5 ns |
96663497 ns |
1.11 |
Conv((3, 3), 16 => 16)/cpu/forward/SimpleChains/(64, 64, 16, 128) |
29363022 ns |
28321531 ns |
1.04 |
Dense(2000 => 2000)/cpu/reverse/ReverseDiff (compiled)/(2000, 128) |
21121686 ns |
21078472 ns |
1.00 |
Dense(2000 => 2000)/cpu/reverse/Zygote/(2000, 128) |
17433210 ns |
17393481 ns |
1.00 |
Dense(2000 => 2000)/cpu/reverse/Tracker/(2000, 128) |
22824955.5 ns |
22657728 ns |
1.01 |
Dense(2000 => 2000)/cpu/reverse/ReverseDiff/(2000, 128) |
26452674 ns |
28019412 ns |
0.94 |
Dense(2000 => 2000)/cpu/reverse/Flux/(2000, 128) |
19392595 ns |
19298592.5 ns |
1.00 |
Dense(2000 => 2000)/cpu/reverse/Enzyme/(2000, 128) |
20920397.5 ns |
20720819 ns |
1.01 |
Dense(2000 => 2000)/cpu/forward/NamedTuple/(2000, 128) |
6135208.5 ns |
6086608 ns |
1.01 |
Dense(2000 => 2000)/cpu/forward/ComponentArray/(2000, 128) |
6112049 ns |
6101998 ns |
1.00 |
Dense(2000 => 2000)/cpu/forward/Flux/(2000, 128) |
6475517 ns |
6509879.5 ns |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.