Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. #3446

canonizer · 2018-07-05T10:22:05Z

added distributions to HostDeviceVector
using HostDeviceVector for labels, weights and base margings in MetaInfo
using HostDeviceVector for offset and data in SparsePage
other necessary refactoring

- added distributions to HostDeviceVector - using HostDeviceVector for labels, weights and base margings in MetaInfo - using HostDeviceVector for offset and data in SparsePage - other necessary refactoring

RAMitchell

Looks good. Can you confirm for me if the vectors in info get copied back and forth between host and device?

For example the label vector never changes, but when we alternately call label_.HostVector() and label_.DevicePointer() is this being copied?

canonizer · 2018-07-31T14:47:18Z

Yes, the vectors are currently copied between the host and device even if not modified.

If you want to avoid this, I can add const-only versions of methods that get host or GPU data. E.g., ConstHostVector(), ConstDevicePointer() etc.

RAMitchell · 2018-07-31T22:28:13Z

This seems like a good idea, otherwise they will be constantly syncing.

- const versions added to calls that can trigger data transfers, e.g. DevicePointer() - updated the code that uses HostDeviceVector - objective functions now accept const HostDeviceVector<bst_float>& for predictions

- this means no copies are performed if both host and devices access the HostDeviceVector read-only

canonizer · 2018-08-17T13:30:59Z

Done.

- updated the lz4 plugin - added ConstDeviceSpan to HostDeviceVector - using device % dh::NVisibleDevices() for the physical device number, e.g. in calls to cudaSetDevice()

trivialfis · 2018-08-22T17:55:57Z

Is it worth to store some extra information in one of the GPU related class like number of SMs? Currently all the grid stride functions just use one grid.

- replaced HostDeviceVector<unsigned int> with HostDeviceVector<int>

codecov-io · 2018-08-24T16:13:13Z

Codecov Report

Merging #3446 into master will increase coverage by 0.13%.
The diff coverage is 72.87%.

@@             Coverage Diff              @@
##             master    #3446      +/-   ##
============================================
+ Coverage     50.39%   50.52%   +0.13%     
  Complexity      188      188              
============================================
  Files           172      172              
  Lines         13986    14096     +110     
  Branches        457      457              
============================================
+ Hits           7048     7122      +74     
- Misses         6713     6749      +36     
  Partials        225      225

Impacted Files	Coverage Δ	Complexity Δ
include/xgboost/objective.h	`27.27% <ø> (ø)`	`0 <0> (ø)`	⬇️
src/gbm/gbtree.cc	`18.51% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_fast_hist.cc	`1.36% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_colmaker.cc	`1.5% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/objective/multiclass_obj.cc	`15.85% <0%> (-0.2%)`	`0 <0> (ø)`
src/tree/updater_refresh.cc	`5.88% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_skmaker.cc	`2.16% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/gbm/gblinear.cc	`13.79% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_histmaker.cc	`2.82% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/learner.cc	`28.95% <0%> (-0.09%)`	`0 <0> (ø)`
... and 24 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cb4de52...7ec438e. Read the comment docs.

- added distributions to HostDeviceVector - using HostDeviceVector for labels, weights and base margings in MetaInfo - using HostDeviceVector for offset and data in SparsePage - other necessary refactoring

- const versions added to calls that can trigger data transfers, e.g. DevicePointer() - updated the code that uses HostDeviceVector - objective functions now accept const HostDeviceVector<bst_float>& for predictions

- this means no copies are performed if both host and devices access the HostDeviceVector read-only

- updated the lz4 plugin - added ConstDeviceSpan to HostDeviceVector - using device % dh::NVisibleDevices() for the physical device number, e.g. in calls to cudaSetDevice()

- replaced HostDeviceVector<unsigned int> with HostDeviceVector<int>

RAMitchell · 2018-08-29T06:13:03Z

@canonizer I have rebased this. Can you please review before I merge? It was a complicated process so there are probably mistakes. I have a backup of your PR before the rebase if necessary.

RAMitchell · 2018-08-29T06:45:42Z

A couple of GPU tests are currently failing because I removed the behaviour of "wrapping around" when a number of devices is specified that is greater than the number of available devices. This was to be consistent with what @trivialfis did in his PR. We need to decide if we are going to "wrap" the device ordinal or throw some error. I'm thinking we should throw an error because if the user selected multiple GPUs they probably did not intend for it to simply run and multiple blocks on the same GPU.

We could check for a sufficient available devices inside of the GPUDistribution or GPUSet.

…x-hdvec

- added a mock set device handler; when set, it is called instead of cudaSetDevice()

codecov-io · 2018-08-29T17:30:01Z

Codecov Report

Merging #3446 into master will increase coverage by 0.12%.
The diff coverage is 73.01%.

@@             Coverage Diff             @@
##             master   #3446      +/-   ##
===========================================
+ Coverage     50.97%   51.1%   +0.12%     
  Complexity      188     188              
===========================================
  Files           176     176              
  Lines         14090   14200     +110     
  Branches        457     457              
===========================================
+ Hits           7183    7257      +74     
- Misses         6682    6718      +36     
  Partials        225     225

Impacted Files	Coverage Δ	Complexity Δ
include/xgboost/objective.h	`27.27% <ø> (ø)`	`0 <0> (ø)`	⬇️
src/learner.cc	`28.95% <0%> (-0.09%)`	`0 <0> (ø)`
src/tree/updater_colmaker.cc	`1.54% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/objective/multiclass_obj.cc	`15.85% <0%> (-0.2%)`	`0 <0> (ø)`
src/gbm/gbtree.cc	`18.51% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_fast_hist.cc	`1.39% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/gbm/gblinear.cc	`13.79% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_refresh.cc	`5.88% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_histmaker.cc	`2.82% <0%> (ø)`	`0 <0> (ø)`	⬇️
src/tree/updater_skmaker.cc	`2.16% <0%> (ø)`	`0 <0> (ø)`	⬇️
... and 24 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 58d783d...aa2cfc8. Read the comment docs.

dmlc#3446) * Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. - added distributions to HostDeviceVector - using HostDeviceVector for labels, weights and base margings in MetaInfo - using HostDeviceVector for offset and data in SparsePage - other necessary refactoring * Added const version of HostDeviceVector API calls. - const versions added to calls that can trigger data transfers, e.g. DevicePointer() - updated the code that uses HostDeviceVector - objective functions now accept const HostDeviceVector<bst_float>& for predictions * Updated src/linear/updater_gpu_coordinate.cu. * Added read-only state for HostDeviceVector sync. - this means no copies are performed if both host and devices access the HostDeviceVector read-only * Fixed linter and test errors. - updated the lz4 plugin - added ConstDeviceSpan to HostDeviceVector - using device % dh::NVisibleDevices() for the physical device number, e.g. in calls to cudaSetDevice() * Fixed explicit template instantiation errors for HostDeviceVector. - replaced HostDeviceVector<unsigned int> with HostDeviceVector<int> * Fixed HostDeviceVector tests that require multiple GPUs. - added a mock set device handler; when set, it is called instead of cudaSetDevice()

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage.

0895c86

- added distributions to HostDeviceVector - using HostDeviceVector for labels, weights and base margings in MetaInfo - using HostDeviceVector for offset and data in SparsePage - other necessary refactoring

RAMitchell reviewed Jul 23, 2018

View reviewed changes

canonizer added 6 commits August 13, 2018 18:49

Merge branch 'upstream-master' into dmatrix-hdvec

14e32f9

Added const version of HostDeviceVector API calls.

deb629e

- const versions added to calls that can trigger data transfers, e.g. DevicePointer() - updated the code that uses HostDeviceVector - objective functions now accept const HostDeviceVector<bst_float>& for predictions

Updated src/linear/updater_gpu_coordinate.cu.

5fadf7d

Added read-only state for HostDeviceVector sync.

2e8a30d

- this means no copies are performed if both host and devices access the HostDeviceVector read-only

Merge branch 'upstream-master' into dmatrix-hdvec

28b8bc1

Merge branch 'upstream-master' into dmatrix-hdvec

5d00c2a

RAMitchell and others added 2 commits August 20, 2018 16:50

Merge branch 'master' into dmatrix-hdvec

6e6bdb7

Fixed linter and test errors.

457d731

- updated the lz4 plugin - added ConstDeviceSpan to HostDeviceVector - using device % dh::NVisibleDevices() for the physical device number, e.g. in calls to cudaSetDevice()

Merge branch 'upstream-master' into dmatrix-hdvec

592caf9

trivialfis mentioned this pull request Aug 24, 2018

Merge generic device helper functions into gpu set. #3626

Merged

Fixed explicit template instantiation errors for HostDeviceVector.

7ec438e

- replaced HostDeviceVector<unsigned int> with HostDeviceVector<int>

trivialfis mentioned this pull request Aug 27, 2018

Lanuch function for unifying CPU and GPU code. #3608

Closed

RAMitchell force-pushed the dmatrix-hdvec branch from 11f0bc6 to 7ec438e Compare August 29, 2018 02:55

canonizer added 6 commits August 29, 2018 15:42

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage.

4d8f819

- added distributions to HostDeviceVector - using HostDeviceVector for labels, weights and base margings in MetaInfo - using HostDeviceVector for offset and data in SparsePage - other necessary refactoring

Added const version of HostDeviceVector API calls.

8352885

- const versions added to calls that can trigger data transfers, e.g. DevicePointer() - updated the code that uses HostDeviceVector - objective functions now accept const HostDeviceVector<bst_float>& for predictions

Updated src/linear/updater_gpu_coordinate.cu.

fd803db

Added read-only state for HostDeviceVector sync.

97ba477

- this means no copies are performed if both host and devices access the HostDeviceVector read-only

Fixed linter and test errors.

bdcce75

- updated the lz4 plugin - added ConstDeviceSpan to HostDeviceVector - using device % dh::NVisibleDevices() for the physical device number, e.g. in calls to cudaSetDevice()

Fixed explicit template instantiation errors for HostDeviceVector.

bf950c2

- replaced HostDeviceVector<unsigned int> with HostDeviceVector<int>

RAMitchell force-pushed the dmatrix-hdvec branch 3 times, most recently from 0fb1ef1 to 32cb6f4 Compare August 29, 2018 05:35

Rebase

2a04551

RAMitchell force-pushed the dmatrix-hdvec branch from 32cb6f4 to 2a04551 Compare August 29, 2018 05:49

canonizer added 2 commits August 29, 2018 16:49

Merge branch 'dmatrix-hdvec' of github.com:teju85/xgboost into dmatri…

690d165

…x-hdvec

Fixed HostDeviceVector tests that require multiple GPUs.

aa2cfc8

- added a mock set device handler; when set, it is called instead of cudaSetDevice()

RAMitchell merged commit 72cd151 into dmlc:master Aug 30, 2018

lock bot locked as resolved and limited conversation to collaborators Nov 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. #3446

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. #3446

canonizer commented Jul 5, 2018

RAMitchell left a comment

canonizer commented Jul 31, 2018

RAMitchell commented Jul 31, 2018

canonizer commented Aug 17, 2018

trivialfis commented Aug 22, 2018

codecov-io commented Aug 24, 2018 •

edited

Loading

RAMitchell commented Aug 29, 2018

RAMitchell commented Aug 29, 2018

codecov-io commented Aug 29, 2018

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. #3446

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. #3446

Conversation

canonizer commented Jul 5, 2018

RAMitchell left a comment

Choose a reason for hiding this comment

canonizer commented Jul 31, 2018

RAMitchell commented Jul 31, 2018

canonizer commented Aug 17, 2018

trivialfis commented Aug 22, 2018

codecov-io commented Aug 24, 2018 • edited Loading

Codecov Report

RAMitchell commented Aug 29, 2018

RAMitchell commented Aug 29, 2018

codecov-io commented Aug 29, 2018

Codecov Report

codecov-io commented Aug 24, 2018 •

edited

Loading