Mark CUDA 10.1 as unsupported. #4264

trivialfis · 2019-03-16T21:20:49Z

Issue is the same in #4223. Spliter of NVCC might have some problems with pointers, running this in HostDeviceVectorImpl::Copy:

      LOG(DEBUG) << "other->Distribution(): " << other->Distribution();
      LOG(DEBUG) << "other.Distribution(): " << (*other).Distribution();

returns different result. The bug should be reproducible by simply running unittests with CUDA 10.1 on Ubuntu 18.10. Instead of working around it like the last time I did, I think it's more appropriate to explicitly mark CUDA 10.1 is not being supported, or at least we need to mark the nvcc included in CUDA 10.1 is not supported.

Sadly, it's currently the only version that doesn't break my machine ... @RAMitchell WDYT?

The text was updated successfully, but these errors were encountered:

RAMitchell · 2019-03-16T22:55:22Z

Yes we may have to wait for another cuda update. Do what you think is best.

hcho3 · 2019-03-17T00:40:53Z

@trivialfis I was about to include CUDA 10.1 as one of the targets in my upcoming PR for CI refactor. Should we leave it out for the time being?

trivialfis · 2019-03-17T05:08:48Z

@hcho3 yes. Let's skip this version.

rongou · 2019-03-18T23:07:37Z

This problem seems to be specific to gcc. I tried clang-7, it seems to work fine.

trivialfis · 2019-03-19T02:20:05Z

@rongou I'm not sure how did you compile XGBoost with clang, assuming you are referring to the non-apple clang. There was a type deduction in dmlc core logging facility which tricks clang-7.

rongou · 2019-03-19T16:11:07Z

@trivialfis not sure, I built the jvm packages with cuda enabled using clang-7 and it seems to run fine. What is the logging issue? Maybe it only affects cli or python?

trivialfis · 2019-03-19T21:01:21Z

@rongou Drap to the bottom of clang-tidy test: https://xgboost-ci.net/blue/organizations/jenkins/xgboost/detail/PR-4149/12/pipeline

The type of &std:free can not be deduced. Same with clang. The problem might be in std::free implementation if you are using libc++ instead of libstdc++. ; )

rongou · 2019-03-19T22:57:18Z

Yeah I installed libc++ when I installed clang-7. I guess clang uses it by default? If you are using clang-tidy, maybe it's not a crazy idea to go all in with clang.

Anyway, the CUDA bug should be fixed in the next patch update, if all goes well.

jamesdalg · 2019-04-16T05:55:09Z

If it's possible, can someone post some detailed documentation as to how to work around this issue with visual studio 2017 and cmake, with any version that works currently?

rongou · 2019-04-16T18:14:05Z

You can use CUDA 10.0: https://developer.nvidia.com/cuda-10.0-download-archive

trivialfis changed the title ~~Mark CUDA 10.1 Unsupported.~~ Mark CUDA 10.1 as unsupported. Mar 17, 2019

trivialfis mentioned this issue Mar 17, 2019

Mark CUDA 10.1 as unsupported. #4265

Merged

trivialfis closed this as completed in #4265 Mar 17, 2019

sh1ng mentioned this issue May 7, 2019

kmeans build error: namespace "thrust" has no member "device_malloc_allocator" h2oai/h2o4gpu#764

Closed

sh1ng mentioned this issue May 24, 2019

fix build on CUDA 10.1: h2oai/h2o4gpu#768

Merged

lock bot locked as resolved and limited conversation to collaborators Jul 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mark CUDA 10.1 as unsupported. #4264

Mark CUDA 10.1 as unsupported. #4264

trivialfis commented Mar 16, 2019 •

edited

Loading

RAMitchell commented Mar 16, 2019

hcho3 commented Mar 17, 2019

trivialfis commented Mar 17, 2019

rongou commented Mar 18, 2019

trivialfis commented Mar 19, 2019

rongou commented Mar 19, 2019

trivialfis commented Mar 19, 2019

rongou commented Mar 19, 2019

jamesdalg commented Apr 16, 2019

rongou commented Apr 16, 2019

Mark CUDA 10.1 as unsupported. #4264

Mark CUDA 10.1 as unsupported. #4264

Comments

trivialfis commented Mar 16, 2019 • edited Loading

RAMitchell commented Mar 16, 2019

hcho3 commented Mar 17, 2019

trivialfis commented Mar 17, 2019

rongou commented Mar 18, 2019

trivialfis commented Mar 19, 2019

rongou commented Mar 19, 2019

trivialfis commented Mar 19, 2019

rongou commented Mar 19, 2019

jamesdalg commented Apr 16, 2019

rongou commented Apr 16, 2019

trivialfis commented Mar 16, 2019 •

edited

Loading