Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make stdlib extensions backwards compatible #512

Merged
merged 4 commits into from
Aug 22, 2024
Merged

Conversation

vchuravy
Copy link
Member

Copy link
Contributor

github-actions bot commented Aug 22, 2024

Benchmark Results

main 3d05201... main/3d052019fac473...
saxpy/default/Float16/1024 2.77 ± 0.19 μs 2.77 ± 0.19 μs 1
saxpy/default/Float16/1048576 2.07 ± 0.015 ms 2.07 ± 0.012 ms 1
saxpy/default/Float16/16384 0.0327 ± 0.00013 ms 0.0327 ± 0.00014 ms 1
saxpy/default/Float16/2048 5.16 ± 0.065 μs 5.16 ± 0.062 μs 0.999
saxpy/default/Float16/256 0.96 ± 0.11 μs 1.03 ± 0.13 μs 0.933
saxpy/default/Float16/262144 0.515 ± 0.0014 ms 0.516 ± 0.0014 ms 1
saxpy/default/Float16/32768 0.0649 ± 0.00015 ms 0.0649 ± 0.00017 ms 1
saxpy/default/Float16/4096 10 ± 0.05 μs 10 ± 0.05 μs 1
saxpy/default/Float16/512 1.56 ± 0.089 μs 1.56 ± 0.079 μs 0.996
saxpy/default/Float16/64 0.611 ± 0.014 μs 0.628 ± 0.013 μs 0.974
saxpy/default/Float16/65536 0.129 ± 0.00027 ms 0.129 ± 0.00026 ms 1
saxpy/default/Float32/1024 1.02 ± 0.011 μs 1.04 ± 0.012 μs 0.983
saxpy/default/Float32/1048576 0.884 ± 0.014 ms 0.886 ± 0.014 ms 0.998
saxpy/default/Float32/16384 14.4 ± 0.11 μs 14.4 ± 0.12 μs 0.999
saxpy/default/Float32/2048 1.73 ± 0.02 μs 1.72 ± 0.016 μs 1
saxpy/default/Float32/256 0.525 ± 0.12 μs 0.529 ± 0.12 μs 0.992
saxpy/default/Float32/262144 0.221 ± 0.00079 ms 0.221 ± 0.0008 ms 0.998
saxpy/default/Float32/32768 28.3 ± 0.15 μs 28.3 ± 0.14 μs 1
saxpy/default/Float32/4096 3.03 ± 0.023 μs 3.02 ± 0.022 μs 1
saxpy/default/Float32/512 0.691 ± 0.11 μs 0.694 ± 0.11 μs 0.995
saxpy/default/Float32/64 0.404 ± 0.0053 μs 0.412 ± 0.0043 μs 0.981
saxpy/default/Float32/65536 0.056 ± 0.00031 ms 0.056 ± 0.0003 ms 1
saxpy/default/Float64/1024 1.07 ± 0.027 μs 1.08 ± 0.02 μs 0.997
saxpy/default/Float64/1048576 0.992 ± 0.049 ms 0.953 ± 0.046 ms 1.04
saxpy/default/Float64/16384 15.9 ± 0.12 μs 15.9 ± 0.94 μs 1
saxpy/default/Float64/2048 1.75 ± 0.018 μs 1.77 ± 0.024 μs 0.988
saxpy/default/Float64/256 0.528 ± 0.0064 μs 0.541 ± 0.0081 μs 0.977
saxpy/default/Float64/262144 0.244 ± 0.0059 ms 0.228 ± 0.013 ms 1.07
saxpy/default/Float64/32768 31.5 ± 0.53 μs 31.3 ± 2.1 μs 1
saxpy/default/Float64/4096 3.04 ± 0.034 μs 3.05 ± 0.031 μs 0.997
saxpy/default/Float64/512 0.708 ± 0.11 μs 0.714 ± 0.11 μs 0.992
saxpy/default/Float64/64 0.406 ± 0.0046 μs 0.423 ± 0.0052 μs 0.959
saxpy/default/Float64/65536 0.062 ± 0.0018 ms 0.0601 ± 0.0037 ms 1.03
saxpy/static workgroup=(1024,)/Float16/1024 2.07 ± 0.2 μs 2.09 ± 0.21 μs 0.99
saxpy/static workgroup=(1024,)/Float16/1048576 0.164 ± 0.011 ms 0.161 ± 0.0089 ms 1.02
saxpy/static workgroup=(1024,)/Float16/16384 4.33 ± 0.21 μs 4.39 ± 0.26 μs 0.987
saxpy/static workgroup=(1024,)/Float16/2048 2.1 ± 0.21 μs 2.12 ± 0.21 μs 0.993
saxpy/static workgroup=(1024,)/Float16/256 2.61 ± 0.032 μs 2.64 ± 0.034 μs 0.991
saxpy/static workgroup=(1024,)/Float16/262144 0.0439 ± 0.0028 ms 0.0436 ± 0.002 ms 1.01
saxpy/static workgroup=(1024,)/Float16/32768 6.78 ± 0.32 μs 6.66 ± 0.31 μs 1.02
saxpy/static workgroup=(1024,)/Float16/4096 2.42 ± 0.031 μs 2.42 ± 0.031 μs 0.997
saxpy/static workgroup=(1024,)/Float16/512 3.12 ± 0.06 μs 3.15 ± 0.07 μs 0.99
saxpy/static workgroup=(1024,)/Float16/64 2.23 ± 0.021 μs 2.26 ± 0.021 μs 0.984
saxpy/static workgroup=(1024,)/Float16/65536 12.9 ± 0.66 μs 12.8 ± 0.67 μs 1.01
saxpy/static workgroup=(1024,)/Float32/1024 1.95 ± 0.023 μs 1.93 ± 0.023 μs 1.01
saxpy/static workgroup=(1024,)/Float32/1048576 0.245 ± 0.014 ms 0.248 ± 0.016 ms 0.99
saxpy/static workgroup=(1024,)/Float32/16384 4.81 ± 0.75 μs 4.75 ± 0.83 μs 1.01
saxpy/static workgroup=(1024,)/Float32/2048 2.25 ± 0.2 μs 2.26 ± 0.2 μs 0.997
saxpy/static workgroup=(1024,)/Float32/256 2.72 ± 1.6 μs 2.61 ± 0.39 μs 1.04
saxpy/static workgroup=(1024,)/Float32/262144 0.066 ± 0.0038 ms 0.0656 ± 0.0043 ms 1.01
saxpy/static workgroup=(1024,)/Float32/32768 8.33 ± 1.6 μs 7.86 ± 1.3 μs 1.06
saxpy/static workgroup=(1024,)/Float32/4096 2.56 ± 0.082 μs 2.54 ± 0.087 μs 1.01
saxpy/static workgroup=(1024,)/Float32/512 2.48 ± 0.21 μs 2.46 ± 0.21 μs 1.01
saxpy/static workgroup=(1024,)/Float32/64 2.43 ± 0.048 μs 2.4 ± 0.047 μs 1.01
saxpy/static workgroup=(1024,)/Float32/65536 18.6 ± 1.9 μs 17.8 ± 1.6 μs 1.04
saxpy/static workgroup=(1024,)/Float64/1024 2.02 ± 0.026 μs 2.03 ± 0.026 μs 0.995
saxpy/static workgroup=(1024,)/Float64/1048576 0.536 ± 0.091 ms 0.519 ± 0.039 ms 1.03
saxpy/static workgroup=(1024,)/Float64/16384 8.3 ± 1.5 μs 7.78 ± 1.4 μs 1.07
saxpy/static workgroup=(1024,)/Float64/2048 2.5 ± 0.23 μs 2.5 ± 0.24 μs 0.998
saxpy/static workgroup=(1024,)/Float64/256 2.39 ± 0.049 μs 2.4 ± 0.046 μs 0.996
saxpy/static workgroup=(1024,)/Float64/262144 0.124 ± 0.0086 ms 0.124 ± 0.0087 ms 1
saxpy/static workgroup=(1024,)/Float64/32768 18.7 ± 2.3 μs 17.2 ± 1.6 μs 1.08
saxpy/static workgroup=(1024,)/Float64/4096 3.12 ± 0.34 μs 3.11 ± 0.35 μs 1
saxpy/static workgroup=(1024,)/Float64/512 2.36 ± 0.04 μs 2.38 ± 0.041 μs 0.99
saxpy/static workgroup=(1024,)/Float64/64 2.37 ± 0.075 μs 2.37 ± 0.076 μs 1
saxpy/static workgroup=(1024,)/Float64/65536 0.036 ± 0.0031 ms 0.0358 ± 0.0021 ms 1.01
time_to_load 0.311 ± 0.0012 s 0.311 ± 0.0019 s 1

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR.
Go to "Actions"->"Benchmark a pull request"->[the most recent run]->"Artifacts" (at the bottom).

@vchuravy vchuravy merged commit 91ada95 into main Aug 22, 2024
35 of 46 checks passed
@vchuravy vchuravy deleted the vc/backwards_compat_ext branch August 22, 2024 11:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant