Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use autotuning for CUDA kernels by default #82

Merged
merged 2 commits into from
Feb 9, 2021

Conversation

simeonschaub
Copy link
Contributor

JuliaGPU/KernelAbstractions.jl#206 added the ability to automatically tune the workgroupsize of CUDA kernels. This PR stops hardcoding a default workgroupsize and lets KernelAbstractions handle that. This does change the workgroupsize from being statically sized to being dynamically sized, but in my testing, even with fairly small workgroupsizes, that didn't really make a difference.

JuliaGPU/KernelAbstractions.jl#206 added the ability to automatically tune the workgroupsize of CUDA kernels. This PR stops hardcoding a default workgroupsize and lets KernelAbstractions handle that. This does change the workgroupsize from being statically sized to being dynamically sized, but in my testing, even with fairly small workgroupsizes, that didn't really make a difference.
@simeonschaub simeonschaub mentioned this pull request Feb 8, 2021
@mcabbott
Copy link
Owner

mcabbott commented Feb 8, 2021

Cool, thanks for having a look. Should I be surprised that CI passes even though the KernelAbstractions.jl update isn't tagged yet?

@simeonschaub
Copy link
Contributor Author

Previously if you passed nothing as workgroupsize, KernelAbstractions just used a size of 256 as a default, so it will still pass.

@simeonschaub
Copy link
Contributor Author

That PR should now be in the latest KernelAbstractions release, so this should be ready to go barring any other concerns.

src/macro.jl Outdated Show resolved Hide resolved
src/macro.jl Outdated Show resolved Hide resolved
src/macro.jl Outdated Show resolved Hide resolved
Co-authored-by: Michael Abbott <32575566+mcabbott@users.noreply.github.com>
@mcabbott mcabbott merged commit 33f8ff6 into mcabbott:master Feb 9, 2021
@simeonschaub simeonschaub deleted the sds/cuda_autotune branch February 9, 2021 22:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants