-
-
Notifications
You must be signed in to change notification settings - Fork 719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
distributed fails with large graph/dataset #4678
Comments
Can you please test with Dask PR ( dask/dask#7525 ) and Distributed PR ( #4677 )? cc @madsbk |
That fixes this particular issue. Thank you very much! |
Great, thanks for following up @tom-andersson and @madsbk for fixing the issue |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
What happened:
I'm using a local distributed cluster to perform computations with a large 2190x432x432x184 array, with chunks of 7x432x432x184. Dask/distributed works beautifully when I run my computation on a subset of my data, but when running on my full dataset (~500GB) I get the following error:
ValueError: 3713192179 exceeds max_bin_len(2147483647)
. Changes to the number of workers/threads or chunking seems to have no effect. I suspect this is an issue with the graph size, but Googling finds few matches so I'm not sure.I get this error with
dask
anddistributed
version2021.3.1
. I checked previous versions and the error does not occur. However, the speed of computation seems a lot higher with2021.3.1
over previous versions, so I'm reluctant to downgrade.Below is the full trace:
What you expected to happen:
No error.
Minimal Complete Verifiable Example:
I tried to crease a MWE that mimics my script using random data, but I'm unable to get the same error. The gist is that it computes a mean over the 1st and 2nd axis of a 2190x432x432x184 array. In practice there are a few more steps so the graph will be more complicated. This might be why I can't recreate the error in this MWE. Below is a Colab link.
https://colab.research.google.com/drive/1KUP6nrPQfzutsyQjgduD8TrPQMDQbiu_?usp=sharing
Environment:
The text was updated successfully, but these errors were encountered: