Releases: JuliaGPU/KernelAbstractions.jl
Releases · JuliaGPU/KernelAbstractions.jl
v0.8.6
v0.8.5
KernelAbstractions v0.8.5
Closed issues:
Merged pull requests:
v0.7.3
KernelAbstractions v0.7.3
Closed issues:
- Support atomics (#7)
- Add backend lookup function based on input arguments (#229)
- Separate Cassette context from
CompilerMetadata
(#231) - Update for CUDA.jl 3.0 (#241)
- Adding a function to get device from array (type)? (#268)
- Support for atomics (#276)
- CUDA 3.6.3 broke KernelAbstactions. (#280)
- Enzyme fails on GPU kernel (#307)
Merged pull requests:
- Add function get_device (#269) (@oschulz)
- Synchronize SpecialFunctions compat (#279) (@charleskawczynski)
- [CUDAKernels] Avoid Cassette looking at device_functions (#281) (@vchuravy)
- Simplify CI and drop bors (#283) (@vchuravy)
- Excise Cassette (#288) (@vchuravy)
- Use Timer to avoid hangs due to error on the device (#291) (@vchuravy)
- don't test KernelGradients on nightly for now (#292) (@vchuravy)
- WIP: Make CUDA stream from cache match CUDA context (#294) (@christophernhill)
- adding atomic support with atomix (#299) (@leios)
- using CPU as default for Scratchpad and SharedMemory (#300) (@leios)
- CompatHelper: add new compat entry for Atomix at version 0.1, (keep existing compat) (#301) (@github-actions[bot])
- Enzyme v0.10 supports GPU compilation again (#303) (@vchuravy)
- Add atomics support for ROCKernels (#304) (@jpsamaroo)
- Enable CPU atomics (#305) (@jpsamaroo)
- Add 'return nothing' to autodiff (#309) (@pxl-th)
- Make examples work by copy-paste into REPL (#310) (@jwscook)
- bounding UnsafeAtomics and UnsafeAtomicsLLVM (#311) (@leios)
- Add oneAPI back-end. (#312) (@maleadt)
- [doc] Some revamping (#314) (@michel2323)
- Fix depwarn (#315) (@ChrisRackauckas)
- ROCKernels: Update to AMDGPU 0.4 (#316) (@jpsamaroo)
- Fix docstrings rendering (#323) (@navidcy)
- fix typo in docs (#327) (@bjarthur)
- Add @ndrange (#329) (@vchuravy)
- Fix stmt form of at_private (#331) (@vchuravy)
v0.8.4
KernelAbstractions v0.8.4
Merged pull requests:
- Make examples work by copy-paste into REPL (#310) (@jwscook)
- Add oneAPI back-end. (#312) (@maleadt)
- [doc] Some revamping (#314) (@michel2323)
- Fix depwarn (#315) (@ChrisRackauckas)
- ROCKernels: Update to AMDGPU 0.4 (#316) (@jpsamaroo)
v0.8.3
v0.8.2
KernelAbstractions v0.8.2
Closed issues:
Merged pull requests:
- adding atomic support with atomix (#299) (@leios)
- CompatHelper: add new compat entry for Atomix at version 0.1, (keep existing compat) (#301) (@github-actions[bot])
- Enzyme v0.10 supports GPU compilation again (#303) (@vchuravy)
- Add atomics support for ROCKernels (#304) (@jpsamaroo)
- Enable CPU atomics (#305) (@jpsamaroo)
v0.8.1
KernelAbstractions v0.8.1
Closed issues:
- Separate Cassette context from
CompilerMetadata
(#231)
Merged pull requests:
- WIP: Make CUDA stream from cache match CUDA context (#294) (@christophernhill)
- using CPU as default for Scratchpad and SharedMemory (#300) (@leios)
v0.8.0
KernelAbstractions v0.8.0
Closed issues:
- Adding a function to get device from array (type)? (#268)
- CUDA 3.6.3 broke KernelAbstactions. (#280)
wait(kernel(...)
hangs up on Julia v1.7 (#290)
Merged pull requests:
- Add function get_device (#269) (@oschulz)
- Synchronize SpecialFunctions compat (#279) (@charleskawczynski)
- [CUDAKernels] Avoid Cassette looking at device_functions (#281) (@vchuravy)
- Simplify CI and drop bors (#283) (@vchuravy)
- Excise Cassette (#288) (@vchuravy)
- Use Timer to avoid hangs due to error on the device (#291) (@vchuravy)
- don't test KernelGradients on nightly for now (#292) (@vchuravy)
v0.7.2
KernelAbstractions v0.7.2
Merged pull requests:
- CompatHelper: bump compat for "SpecialFunctions" to "2.0" (#278) (@github-actions[bot])
v0.7.1
KernelAbstractions v0.7.1
Closed issues:
- Compilation error with type-converting round functions, e.g., ceil(Int32, 1.2f0) (#254)
- invalid syntax on upcoming 1.7 (#260)
- ROCKernels run doesn't work for me. (#261)
- Convert related kernels errors with CUDAKernels (#265)
Event(ROCDevice())
errors (#267)
Merged pull requests: