Skip to content
This repository has been archived by the owner on Dec 20, 2024. It is now read-only.

full shuffle of the dataset #153

Merged
merged 3 commits into from
Nov 29, 2024
Merged

full shuffle of the dataset #153

merged 3 commits into from
Nov 29, 2024

Conversation

ssmmnn11
Copy link
Member

We currently split dates between tasks / workers and then the shuffling is performed in these subsets. These changes shuffle the dataset consistently on all workers and then each worker takes its subset from the shuffled indices.

Thx to @JoffreyDumontLeBrazidec for testing.

@ssmmnn11 ssmmnn11 added the enhancement New feature or request label Nov 20, 2024
@mchantry
Copy link
Member

This can be merged.

@anaprietonem anaprietonem merged commit 7e363fa into develop Nov 29, 2024
119 checks passed
@anaprietonem anaprietonem deleted the feature/full_shuffle branch November 29, 2024 09:26
@JPXKQX JPXKQX mentioned this pull request Dec 5, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants