-
Notifications
You must be signed in to change notification settings - Fork 429
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] JUCX: client id support #7136
Conversation
…uest. UCP: Apply review comments. UCP: support backward compatibility. UCP/WIREUP_CM: Fix void pointer error. UCP/WIREUP: use sa_data + 1. UCP/ADDRESS: unpack address and get it's size. UCP/WIREUP_EP: initialize client_id. AZP: Test binary compatibility. Use dpkg UCP/EP: build ucx-1.10 in AZP. AZP: enable examples. AZP: Build ucp_client_server with master bits. UCP/EP: Test binary compatibility between v1.11.x. UCP/EP: Set address mode AM_LANE_ONLY when we cut address. UCP/EP: Minor formatting. UCP/EP: Update src/ucp/wireup/wireup_cm.c Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update test/gtest/ucp/test_ucp_sockaddr.cc Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update src/ucp/wireup/wireup_cm.c Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update src/ucp/wireup/wireup_cm.c Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update test/gtest/ucp/test_ucp_sockaddr.cc Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update src/ucp/wireup/wireup_cm.c Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update src/ucp/wireup/wireup_cm.c Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Update test/gtest/ucp/test_ucp_sockaddr.cc Co-authored-by: dmitrygx <dmitrygla@nvidia.com> UCP/EP: Apply review comments.
@@ -780,6 +774,11 @@ ucp_ep_create_api_conn_request(ucp_worker_h worker, | |||
ucp_ep_h ep; | |||
ucs_status_t status; | |||
|
|||
if (params->field_mask & UCP_EP_PARAM_FIELD_CLIENT_ID) { | |||
ucs_error("client id is supported only for sockaddr connection establishment"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does this mean that client id is not supported for RDMACM?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, this ment to set client_id only for ep creation by sockaddr, neither by worker_addr nor by conn_request
@petro-rudenko api makes sense so far, and I can see the clientId populated in the In order to reject a request, do we need to call something? Or just do nothing in the callback (poc patch: abellina/spark-rapids@f156f7b for reference) I am not able to fully test this right now since I am getting an exception when creating the endpoint:
And the UCX log says:
Which seems related to the comment above. I did not specify a Note I also tried with: |
Just updating here. We discussed offline and the issue I had was that I specified the client id both on new endpoints to a remote listener, and to endpoints created as a response of a connection request. If we don't set the client id on the connection request, this error goes away. I know the API is a bit in flux, there was another issue with what looked like the client id sometimes not being set (i.e. a random high number instead of what I was setting it to). @petro-rudenko any idea on when the new API may be up? |
Closing in favor of #7523 |
What
Follow up on #6859 for JUCX Spark tests
@abellina you can build ucx from this PR and use JUCX from SNAPSHOT of version
1.12.0-client-id-SNAPSHOT