Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[multi-node] Fix sage infer hang #287

Merged
merged 10 commits into from
May 17, 2023

Conversation

DesmonDay
Copy link

PR types

PR changes

Describe

@DesmonDay DesmonDay changed the title Fix infer hang [multi-node] Fix sage infer hang May 16, 2023
if (local_reach_end) {
conf_.buf_size /= 2;
}
total_row_[0] = conf_.buf_size;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

再提一个建议,这里最好不要修改buf_size_。
如果后面还有其他类型节点需要infer,这里buf_size_得不到reset,不友好。
建议这样,
if (global_reach_end)
total_row_[0] = device_key_size - global_infer_node_type_start[infer_cursor];
else
remain = device_key_size - global_infer_node_type_start[infer_cursor];
if local_reach_end
total_row_[0] = remain / 2;
else
total_row_[0] = remain;

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done,不过最后那个else那里,应该是total_row_[0] = conf_.buf_size;

Copy link
Collaborator

@lxsbupt lxsbupt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good

@lxsbupt lxsbupt merged commit e333356 into xuewujiao:gpugraph_new May 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants