Release Version 2 · vosen/ZLUDA

The goal of version 2 has been to fix end to end execution of GeekBench and improve Windows support:

Several new host-side functions are supported now (e.g. cuModuleLoadDataEx)
Several bugs have been fixed on the kernel side (e.g. threadIdx/blockIdx is now handled correctly)
Minor improvement in generated code brought better I/O performance when reading/writing vector objects. This improved performance by several percentage points in select GeekBench benchmarks
ZLUDA now ships its own injector (with_zluda.exe) which should make running ZLUDA on Windows much easier
Additionally, we have gained ability to easily create traces of CUDA kernel execution, making enabling new workloads much easier
ZLUDA now has a CI, which produces binaries on every pull request and commit

Special thanks to @take-cheeze, @nilsmartel and @ritschwumm for contributing to this release

Provide feedback