Fix dart inplace prediction with GPU input. #6777

trivialfis · 2021-03-24T04:25:13Z

On C API there's a check for output prediction being device-readable. With dart this doesn't hold.
Added optimization to avoid memory copying whenever possible. Will post the result later.

* Fix dart inplace predict with data on GPU, which might trigger a fatal check for device access right. * Avoid copying data whenever possible.

trivialfis · 2021-03-24T17:21:04Z

src/common/hist_util.cu

-  const auto& host_data = page.data.ConstHostVector();
-  dh::device_vector<Entry> sorted_entries(host_data.begin() + begin,
-                                          host_data.begin() + end);
+  dh::device_vector<Entry> sorted_entries;


This is to avoid copying data when input is already on GPU.

trivialfis · 2021-03-24T17:21:22Z

src/common/host_device_vector.cu

@@ -92,7 +92,10 @@ class HostDeviceVectorImpl {
    } else {
      gpu_access_ = GPUAccess::kWrite;
      SetDevice();
-      thrust::fill(data_d_->begin(), data_d_->end(), v);
+      auto s_data = dh::ToSpan(*data_d_);


Avoid synchronization.

trivialfis · 2021-03-24T17:21:46Z

src/data/ellpack_page.cu

-    dh::safe_cuda(cudaMemcpyAsync(entries_d.data().get(),
-                                  data_vec.data() + ent_cnt_begin,
-                                  n_entries * sizeof(Entry), cudaMemcpyDefault));
+    if (row_batch.data.DeviceCanRead()) {


Avoid copying data when it's already on GPU.

trivialfis · 2021-03-24T17:36:59Z

100 rounds on higgs:

Before:
Train::Duration: 1497.7212617397308

After:
Train::Duration: 567.4818341732025

hcho3

Trying to understand what this PR does. Is it correct to summarize it as thus:

Current behavior: Copy the predicted scores predts.predictions to the host before summing them into h_out_predts
New behavior: Sum the predicted scores predts.predictions into predts.predictions.DeviceSpan(), keeping all the data on the device.

hcho3 · 2021-03-25T00:45:46Z

src/common/host_device_vector.cu

-      thrust::fill(data_d_->begin(), data_d_->end(), v);
+      auto s_data = dh::ToSpan(*data_d_);
+      dh::LaunchN(device_, data_d_->size(), [=]XGBOOST_DEVICE(size_t i) {
+          s_data[i] = v;


Should we use the bound-checked interface here, given that the size of data_d_ is already clear?

I can use pointer.

trivialfis · 2021-03-25T02:41:09Z

Trying to understand what this PR does.

Your summary is correct.

Fix dart inplace prediction on GPU data.

29a78c5

* Fix dart inplace predict with data on GPU, which might trigger a fatal check for device access right. * Avoid copying data whenever possible.

trivialfis force-pushed the fix-dart-inplace branch from fb0f2f6 to 29a78c5 Compare March 24, 2021 06:33

trivialfis added the Blocking label Mar 24, 2021

Set device.

28a928f

trivialfis commented Mar 24, 2021

View reviewed changes

hcho3 reviewed Mar 25, 2021

View reviewed changes

RAMitchell approved these changes Mar 25, 2021

View reviewed changes

hcho3 approved these changes Mar 25, 2021

View reviewed changes

trivialfis merged commit a7083d3 into dmlc:master Mar 25, 2021

trivialfis deleted the fix-dart-inplace branch March 25, 2021 04:00

trivialfis mentioned this pull request Mar 30, 2021

Optimize dart inplace predict perf. #6804

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix dart inplace prediction with GPU input. #6777

Fix dart inplace prediction with GPU input. #6777

trivialfis commented Mar 24, 2021 •

edited

Loading

trivialfis Mar 24, 2021

trivialfis Mar 24, 2021

trivialfis Mar 24, 2021 •

edited

Loading

trivialfis commented Mar 24, 2021

hcho3 left a comment

hcho3 Mar 25, 2021

trivialfis Mar 25, 2021

trivialfis commented Mar 25, 2021

Fix dart inplace prediction with GPU input. #6777

Fix dart inplace prediction with GPU input. #6777

Conversation

trivialfis commented Mar 24, 2021 • edited Loading

trivialfis Mar 24, 2021

Choose a reason for hiding this comment

trivialfis Mar 24, 2021

Choose a reason for hiding this comment

trivialfis Mar 24, 2021 • edited Loading

Choose a reason for hiding this comment

trivialfis commented Mar 24, 2021

hcho3 left a comment

Choose a reason for hiding this comment

hcho3 Mar 25, 2021

Choose a reason for hiding this comment

trivialfis Mar 25, 2021

Choose a reason for hiding this comment

trivialfis commented Mar 25, 2021

trivialfis commented Mar 24, 2021 •

edited

Loading

trivialfis Mar 24, 2021 •

edited

Loading