Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: update all AMDGPU compats #837

Merged
merged 1 commit into from
Aug 10, 2024
Merged

chore: update all AMDGPU compats #837

merged 1 commit into from
Aug 10, 2024

Conversation

avik-pal
Copy link
Member

No description provided.

@avik-pal avik-pal merged commit 2799345 into main Aug 10, 2024
55 of 67 checks passed
@avik-pal avik-pal deleted the ap/compat branch August 10, 2024 03:47
Copy link
Contributor

Benchmark Results (ASV)

main 38a046b... main/38a046b9cd6843...
basics/overhead 0.2 ± 0.0044 μs 0.2 ± 0.004 μs 0.998
time_to_load 1.01 ± 0.0091 s 1.01 ± 0.0074 s 1.01

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR.
Go to "Actions"->"Benchmark a pull request"->[the most recent run]->"Artifacts" (at the bottom).

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Benchmark Results

Benchmark suite Current: 38a046b Previous: b6171a6 Ratio
Dense(2 => 2)/cpu/reverse/ReverseDiff (compiled)/(2, 128) 3650.625 ns 3675.625 ns 0.99
Dense(2 => 2)/cpu/reverse/Zygote/(2, 128) 7918.083333333334 ns 8093.5 ns 0.98
Dense(2 => 2)/cpu/reverse/Tracker/(2, 128) 21340 ns 21210 ns 1.01
Dense(2 => 2)/cpu/reverse/ReverseDiff/(2, 128) 9775 ns 9748.2 ns 1.00
Dense(2 => 2)/cpu/reverse/Flux/(2, 128) 9016.8 ns 9167.2 ns 0.98
Dense(2 => 2)/cpu/reverse/SimpleChains/(2, 128) 4402.666666666667 ns 4470.875 ns 0.98
Dense(2 => 2)/cpu/reverse/Enzyme/(2, 128) 4914.125 ns 4956.875 ns 0.99
Dense(2 => 2)/cpu/forward/NamedTuple/(2, 128) 2347.4 ns 2373.4 ns 0.99
Dense(2 => 2)/cpu/forward/ComponentArray/(2, 128) 2373.4 ns 2270.3 ns 1.05
Dense(2 => 2)/cpu/forward/Flux/(2, 128) 1793.5357142857142 ns 1790.017543859649 ns 1.00
Dense(2 => 2)/cpu/forward/SimpleChains/(2, 128) 179.49579831932772 ns 179.70239774330042 ns 1.00
Dense(20 => 20)/cpu/reverse/ReverseDiff (compiled)/(20, 128) 17283 ns 17562.5 ns 0.98
Dense(20 => 20)/cpu/reverse/Zygote/(20, 128) 34264 ns 24787 ns 1.38
Dense(20 => 20)/cpu/reverse/Tracker/(20, 128) 38973 ns 38393 ns 1.02
Dense(20 => 20)/cpu/reverse/ReverseDiff/(20, 128) 29706 ns 29025 ns 1.02
Dense(20 => 20)/cpu/reverse/Flux/(20, 128) 19727 ns 21590 ns 0.91
Dense(20 => 20)/cpu/reverse/SimpleChains/(20, 128) 17287.5 ns 17092 ns 1.01
Dense(20 => 20)/cpu/reverse/Enzyme/(20, 128) 25688 ns 25648 ns 1.00
Dense(20 => 20)/cpu/forward/NamedTuple/(20, 128) 19888 ns 20248 ns 0.98
Dense(20 => 20)/cpu/forward/ComponentArray/(20, 128) 20639 ns 14448 ns 1.43
Dense(20 => 20)/cpu/forward/Flux/(20, 128) 4937.857142857143 ns 4846.285714285715 ns 1.02
Dense(20 => 20)/cpu/forward/SimpleChains/(20, 128) 1662.1 ns 1659.2 ns 1.00
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 3, 128) 116312647 ns 77690170 ns 1.50
Conv((3, 3), 3 => 3)/cpu/reverse/Zygote/(64, 64, 3, 128) 75943610.5 ns 76782338 ns 0.99
Conv((3, 3), 3 => 3)/cpu/reverse/Tracker/(64, 64, 3, 128) 172264656.5 ns 155414925 ns 1.11
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff/(64, 64, 3, 128) 192675811.5 ns 167638289.5 ns 1.15
Conv((3, 3), 3 => 3)/cpu/reverse/Flux/(64, 64, 3, 128) 179833271 ns 142842293.5 ns 1.26
Conv((3, 3), 3 => 3)/cpu/reverse/SimpleChains/(64, 64, 3, 128) 11909358.5 ns 11557321.5 ns 1.03
Conv((3, 3), 3 => 3)/cpu/reverse/Enzyme/(64, 64, 3, 128) 220741819 ns 199234044.5 ns 1.11
Conv((3, 3), 3 => 3)/cpu/forward/NamedTuple/(64, 64, 3, 128) 15366998.5 ns 15528408.5 ns 0.99
Conv((3, 3), 3 => 3)/cpu/forward/ComponentArray/(64, 64, 3, 128) 15503678.5 ns 15540189 ns 1.00
Conv((3, 3), 3 => 3)/cpu/forward/Flux/(64, 64, 3, 128) 41156139 ns 30661456 ns 1.34
Conv((3, 3), 3 => 3)/cpu/forward/SimpleChains/(64, 64, 3, 128) 6318391.5 ns 6376663 ns 0.99
vgg16/cpu/reverse/Zygote/(32, 32, 3, 16) 1063380917.5 ns 1064055959.5 ns 1.00
vgg16/cpu/reverse/Zygote/(32, 32, 3, 64) 2893008446 ns 2970205700 ns 0.97
vgg16/cpu/reverse/Zygote/(32, 32, 3, 2) 196260977 ns 178121161 ns 1.10
vgg16/cpu/reverse/Tracker/(32, 32, 3, 16) 1395460301 ns 1320655778 ns 1.06
vgg16/cpu/reverse/Tracker/(32, 32, 3, 64) 3625063869 ns 3516351096 ns 1.03
vgg16/cpu/reverse/Tracker/(32, 32, 3, 2) 395182584 ns 344809509 ns 1.15
vgg16/cpu/reverse/Flux/(32, 32, 3, 16) 1585273830 ns 1431616033 ns 1.11
vgg16/cpu/reverse/Flux/(32, 32, 3, 64) 4260488961 ns 4058579611 ns 1.05
vgg16/cpu/reverse/Flux/(32, 32, 3, 2) 452914238 ns 436008182 ns 1.04
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 16) 374779030 ns 381866129 ns 0.98
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 64) 901899645 ns 905256978 ns 1.00
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 2) 55190309 ns 54567006.5 ns 1.01
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 16) 367367750 ns 382293897 ns 0.96
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 64) 899735410.5 ns 870357323.5 ns 1.03
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 2) 55597522 ns 54472914.5 ns 1.02
vgg16/cpu/forward/Flux/(32, 32, 3, 16) 526827152 ns 551222188 ns 0.96
vgg16/cpu/forward/Flux/(32, 32, 3, 64) 1479861454 ns 1387168504 ns 1.07
vgg16/cpu/forward/Flux/(32, 32, 3, 2) 167765083 ns 164122645 ns 1.02
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 64, 128) 1314817844.5 ns 1180058919 ns 1.11
Conv((3, 3), 64 => 64)/cpu/reverse/Zygote/(64, 64, 64, 128) 1575014115 ns 1610297742 ns 0.98
Conv((3, 3), 64 => 64)/cpu/reverse/Tracker/(64, 64, 64, 128) 2398467933 ns 2289727615.5 ns 1.05
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff/(64, 64, 64, 128) 2427384218 ns 2640437136 ns 0.92
Conv((3, 3), 64 => 64)/cpu/reverse/Flux/(64, 64, 64, 128) 2180036687 ns 2193753011.5 ns 0.99
Conv((3, 3), 64 => 64)/cpu/reverse/Enzyme/(64, 64, 64, 128) 2237899044 ns 2122924359 ns 1.05
Conv((3, 3), 64 => 64)/cpu/forward/NamedTuple/(64, 64, 64, 128) 285419226 ns 282003619 ns 1.01
Conv((3, 3), 64 => 64)/cpu/forward/ComponentArray/(64, 64, 64, 128) 285127178.5 ns 286261947 ns 1.00
Conv((3, 3), 64 => 64)/cpu/forward/Flux/(64, 64, 64, 128) 456543369 ns 437257287 ns 1.04
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 1, 128) 12020070 ns 11806435 ns 1.02
Conv((3, 3), 1 => 1)/cpu/reverse/Zygote/(64, 64, 1, 128) 34753186 ns 34527638 ns 1.01
Conv((3, 3), 1 => 1)/cpu/reverse/Tracker/(64, 64, 1, 128) 16664217 ns 16364743 ns 1.02
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff/(64, 64, 1, 128) 21264382 ns 21004093 ns 1.01
Conv((3, 3), 1 => 1)/cpu/reverse/Flux/(64, 64, 1, 128) 15351989 ns 15284140 ns 1.00
Conv((3, 3), 1 => 1)/cpu/reverse/SimpleChains/(64, 64, 1, 128) 1193025 ns 1148921.5 ns 1.04
Conv((3, 3), 1 => 1)/cpu/reverse/Enzyme/(64, 64, 1, 128) 35851437.5 ns 35777843.5 ns 1.00
Conv((3, 3), 1 => 1)/cpu/forward/NamedTuple/(64, 64, 1, 128) 4515987 ns 4500694 ns 1.00
Conv((3, 3), 1 => 1)/cpu/forward/ComponentArray/(64, 64, 1, 128) 4589579 ns 4506207 ns 1.02
Conv((3, 3), 1 => 1)/cpu/forward/Flux/(64, 64, 1, 128) 2002427.5 ns 2045686 ns 0.98
Conv((3, 3), 1 => 1)/cpu/forward/SimpleChains/(64, 64, 1, 128) 244667 ns 196300 ns 1.25
Dense(200 => 200)/cpu/reverse/ReverseDiff (compiled)/(200, 128) 381681 ns 378068 ns 1.01
Dense(200 => 200)/cpu/reverse/Zygote/(200, 128) 304498 ns 314462 ns 0.97
Dense(200 => 200)/cpu/reverse/Tracker/(200, 128) 388380 ns 377972 ns 1.03
Dense(200 => 200)/cpu/reverse/ReverseDiff/(200, 128) 517516 ns 520691 ns 0.99
Dense(200 => 200)/cpu/reverse/Flux/(200, 128) 294910 ns 289716 ns 1.02
Dense(200 => 200)/cpu/reverse/SimpleChains/(200, 128) 409033 ns 401777 ns 1.02
Dense(200 => 200)/cpu/reverse/Enzyme/(200, 128) 430443 ns 425321 ns 1.01
Dense(200 => 200)/cpu/forward/NamedTuple/(200, 128) 164637 ns 157406 ns 1.05
Dense(200 => 200)/cpu/forward/ComponentArray/(200, 128) 155045 ns 162456 ns 0.95
Dense(200 => 200)/cpu/forward/Flux/(200, 128) 93285 ns 91953 ns 1.01
Dense(200 => 200)/cpu/forward/SimpleChains/(200, 128) 104535 ns 104407 ns 1.00
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 16, 128) 312422589 ns 297649242 ns 1.05
Conv((3, 3), 16 => 16)/cpu/reverse/Zygote/(64, 64, 16, 128) 290463113 ns 287837994 ns 1.01
Conv((3, 3), 16 => 16)/cpu/reverse/Tracker/(64, 64, 16, 128) 571203973 ns 545531151.5 ns 1.05
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff/(64, 64, 16, 128) 683798344 ns 655809148 ns 1.04
Conv((3, 3), 16 => 16)/cpu/reverse/Flux/(64, 64, 16, 128) 590381756 ns 554893727 ns 1.06
Conv((3, 3), 16 => 16)/cpu/reverse/SimpleChains/(64, 64, 16, 128) 339651259 ns 316084028.5 ns 1.07
Conv((3, 3), 16 => 16)/cpu/reverse/Enzyme/(64, 64, 16, 128) 615147910.5 ns 583442251.5 ns 1.05
Conv((3, 3), 16 => 16)/cpu/forward/NamedTuple/(64, 64, 16, 128) 40167611.5 ns 40159465 ns 1.00
Conv((3, 3), 16 => 16)/cpu/forward/ComponentArray/(64, 64, 16, 128) 39800524 ns 40173961.5 ns 0.99
Conv((3, 3), 16 => 16)/cpu/forward/Flux/(64, 64, 16, 128) 107566981.5 ns 96663497 ns 1.11
Conv((3, 3), 16 => 16)/cpu/forward/SimpleChains/(64, 64, 16, 128) 29363022 ns 28321531 ns 1.04
Dense(2000 => 2000)/cpu/reverse/ReverseDiff (compiled)/(2000, 128) 21121686 ns 21078472 ns 1.00
Dense(2000 => 2000)/cpu/reverse/Zygote/(2000, 128) 17433210 ns 17393481 ns 1.00
Dense(2000 => 2000)/cpu/reverse/Tracker/(2000, 128) 22824955.5 ns 22657728 ns 1.01
Dense(2000 => 2000)/cpu/reverse/ReverseDiff/(2000, 128) 26452674 ns 28019412 ns 0.94
Dense(2000 => 2000)/cpu/reverse/Flux/(2000, 128) 19392595 ns 19298592.5 ns 1.00
Dense(2000 => 2000)/cpu/reverse/Enzyme/(2000, 128) 20920397.5 ns 20720819 ns 1.01
Dense(2000 => 2000)/cpu/forward/NamedTuple/(2000, 128) 6135208.5 ns 6086608 ns 1.01
Dense(2000 => 2000)/cpu/forward/ComponentArray/(2000, 128) 6112049 ns 6101998 ns 1.00
Dense(2000 => 2000)/cpu/forward/Flux/(2000, 128) 6475517 ns 6509879.5 ns 0.99

This comment was automatically generated by workflow using github-action-benchmark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant