Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[doc] [dask] fix partitioning in Dask example #6389

Merged
merged 5 commits into from
Dec 14, 2020
Merged

[doc] [dask] fix partitioning in Dask example #6389

merged 5 commits into from
Dec 14, 2020

Conversation

jameslamb
Copy link
Contributor

Tonight I was testing the newest changes to xgboost.dask (#6343 (comment)), and realized I made a mistake in #6345 !

AssertionError: Partitions between X and labels are not consistent: 8 != 1. Please try to repartition/rechunk your data.

I realize I should have set the partitioning explicitly, to be sure data and y are partitioned the same way. This PR fixes that. Sorry for the mistake 😬

I also found while doing this that da.random.choice() puts heavy memory demands on the client, so for this example it should be switched to da.random.random() and a regression objective. I'm going to open an issue in dask/dask for the da.random.choice() issue tomorrow.

@codecov-io
Copy link

codecov-io commented Nov 13, 2020

Codecov Report

Merging #6389 (e9ad559) into master (fcfeb49) will decrease coverage by 0.08%.
The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #6389      +/-   ##
==========================================
- Coverage   79.95%   79.86%   -0.09%     
==========================================
  Files          12       12              
  Lines        3477     3477              
==========================================
- Hits         2780     2777       -3     
- Misses        697      700       +3     
Impacted Files Coverage Δ
python-package/xgboost/tracker.py 93.97% <0.00%> (-1.21%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fcfeb49...e9ad559. Read the comment docs.

@jameslamb
Copy link
Contributor Author

I just updated this to the most recent master.

Copy link
Collaborator

@hcho3 hcho3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for testing the latest changes!

@trivialfis
Copy link
Member

Let me rerun the CI.

@trivialfis trivialfis merged commit afc4567 into dmlc:master Dec 14, 2020
@jameslamb jameslamb deleted the docs/dask branch June 1, 2021 04:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants