Skip to content

Latest commit

 

History

History
17 lines (9 loc) · 820 Bytes

README.md

File metadata and controls

17 lines (9 loc) · 820 Bytes

Make sure you compile the code with the CUDA compiler and have the openmp flags set to be able to use the different versions. To enable the openmp flag for this project you have to edit the nvcc.profile file located in the bin folder of your CUDA toolkit installation1.

Open nvcc.profile as Administrator and change

INCLUDES += “-I$(TOP)/include” $(SPACE)

to

INCLUDES += “-I$(TOP)/include” “/openmp” $(SPACE)

There is no graphical menu, to find out about the different options go into the code and there you can find the different keys that have actions attached.

The most important bit is the fact that 0, 1 and 2 change the place where the calculations are done {CPU, OpenMP and CUDA} respectively.

Enjoy!

[1] Thanks to http://www.orangeowlsolutions.com/archives/783