Skip to content

XiaoSong9905/cuda-v100-kernels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CUDA Kernels on V100

Few CUDA Kernels on V100. Mainly used to demonstrate optimization methods.

For minimal dependency requirement, use Makefile to build all executables.

File structure

// reduce operation
reduce/

// Scan operation
scan/

// Square matrix transpose
transpose/

// General matrix multiply C = A * B
sgemm/