Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dawn HPC improvements to pipeline #43

Merged
merged 15 commits into from
Aug 21, 2024

Conversation

JimCircadian
Copy link
Member

This will encompass a load of changes made to support use of the Dawn HPC, which requires slightly more in the way of flexibility for configuring jobs. This also relates to the inclusion of Horovod based training, and likely should be targeted at a 0.3 version of the pipeline instead of main, should these changes diverge too far (at present, they should be compatible with 0.2 releases as well)

At time of creation some more work is needed to integrate changes from Dawn itself, which I'll get to doing very soon

@JimCircadian JimCircadian requested a review from bnubald June 8, 2024 14:01
@JimCircadian JimCircadian self-assigned this Jun 8, 2024
@bnubald bnubald changed the base branch from main to v0.3.0_dev July 9, 2024 15:12
ensemble/predict.tmpl.yaml Outdated Show resolved Hide resolved
@JimCircadian JimCircadian marked this pull request as ready for review August 20, 2024 19:43
@JimCircadian JimCircadian requested a review from bnubald August 20, 2024 19:44
Copy link
Contributor

@bnubald bnubald left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

getopts definitions missing for new options in run_predict_ensemble.sh and run_train_ensemble.sh?

run_predict_ensemble.sh Outdated Show resolved Hide resolved
run_train_ensemble.sh Show resolved Hide resolved
JimCircadian and others added 2 commits August 20, 2024 21:12
Co-authored-by: Bryn Noel Ubald <55503826+bnubald@users.noreply.github.com>
@JimCircadian JimCircadian added this to the v0.3.0 milestone Aug 21, 2024
@JimCircadian JimCircadian linked an issue Aug 21, 2024 that may be closed by this pull request
@JimCircadian JimCircadian merged commit 4f98699 into icenet-ai:v0.3.0_dev Aug 21, 2024
@JimCircadian JimCircadian deleted the 38_dawn_hpc branch August 21, 2024 14:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Adding support for Dawn HPC
2 participants