-
-
Notifications
You must be signed in to change notification settings - Fork 617
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update for CI #2741
Update for CI #2741
Conversation
@puhuk can you try to run the workflow: .github/workflows/pytorch-version-tests.yml on your fork to see if it is passing ? |
- pytorch-version: 1.3.1 | ||
python-version: 3.10 | ||
- pytorch-version: 1.4.0 | ||
python-version: 3.10 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Have you checked if 1.5.1, 1.6.0 etc have 3.10 python binaries ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let me check
When I test with github action, it tests with 3.1
not 3.10
. May I ask why it does not recognize 10
??
https://github.com/puhuk/ignite/actions/runs/3249191243/jobs/5331308987
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it should be a str "3.10"
ignite/.github/workflows/unit-tests.yml
Line 39 in 8d5d7ca
python-version: [3.7, 3.8, 3.9, "3.10"] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I check from this page and I exclude all the pytorch version with python 3.10
except 1.11.0
and 1.12.1
Can you run the workflow on your fork and send here a link on the status ? |
It fails from pytorch 1.11.0 with python 3.10 |
Let's put a skip and add a comment why |
I remove It seems it occurs error when run subprocess.CalledProcessError: Command '['torchrun', '--nproc_per_node=4', '/data/projects/sangho/projects/oss/ignite/tests/ignite/distributed/check_idist_parallel.py', '--backend=gloo', '--init_method=file:///tmp/tmpqgwyyg02/shared']' died with <Signals.SIGSEGV: 11>. |
Where do you see this error ? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @puhuk !
When I reproduce the error from my server. |
Description: Update python, pytorch, cuda version for CI
Check list: