Nobody expects the Red Team
Too many changes to list, but broadly:
- Remove Intel GPU support from the compiler
- Add AMD GPU support to the compiler
- Remove Intel GPU host code
- Add AMD GPU host code
- More device instructions. From 40 to 68
- More host functions. From 48 to 184
- Add proof of concept implementation of OptiX framework
- Add minimal support of cuDNN, cuBLAS, cuSPARSE, cuFFT, NCCL, NVML
- Improve ZLUDA launcher for Windows