Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NVGPU] Propagate layouts through DotWait ops. #3324

Merged
merged 1 commit into from
Mar 8, 2024

Conversation

jlebar
Copy link
Collaborator

@jlebar jlebar commented Mar 8, 2024

[NVGPU] Propagate layouts through DotWait ops.

#3301 converts some tt.dot ops into
ttng.dot_async+wait. We need to propagate layouts through these the same way
we propagate them through regular tt.dot ops. Otherwise...well, eventually we
create unsupported IR, trying to convert from blocked to MMAv3 dot-operand
layout, which we canot do.

This fixes an OAI-internal test.

PR chain

  1. 👉 [NVGPU] Propagate layouts through DotWait ops. #3324 👈 YOU ARE HERE
  2. [Backend] Allow layout propagation through TransOp. #3316

@jlebar jlebar requested a review from ptillet as a code owner March 8, 2024 07:35
@jlebar jlebar requested a review from ThomasRaoux March 8, 2024 07:36
@jlebar jlebar force-pushed the dev-jlebar/propagate-dot-wait branch from d38c20b to b2cae8a Compare March 8, 2024 07:38
#3301 converts some tt.dot ops into
ttng.dot_async+wait.  We need to propagate layouts through these the same way
we propagate them through regular tt.dot ops.  Otherwise...well, eventually we
create unsupported IR, trying to convert from blocked to MMAv3 dot-operand
layout, which we canot do.

This fixes an OAI-internal test.

GPC: propagate-dot-wait
@jlebar jlebar force-pushed the dev-jlebar/propagate-dot-wait branch from b2cae8a to 5fd0d0c Compare March 8, 2024 07:42
@jlebar
Copy link
Collaborator Author

jlebar commented Mar 8, 2024

Thank you, Thomas.

@jlebar jlebar enabled auto-merge (squash) March 8, 2024 07:42
@jlebar jlebar merged commit 8f4755b into main Mar 8, 2024
4 checks passed
@jlebar jlebar deleted the dev-jlebar/propagate-dot-wait branch March 8, 2024 07:51
htyu pushed a commit to htyu/triton that referenced this pull request Mar 20, 2024
[NVGPU] Propagate layouts through DotWait ops.

triton-lang#3301 converts some tt.dot ops into
ttng.dot_async+wait. We need to propagate layouts through these the same
way we propagate them through regular tt.dot ops. Otherwise...well,
eventually we create unsupported IR, trying to convert from blocked to
MMAv3 dot-operand layout, which we cannot do.

This fixes an OAI-internal test.
karupayun pushed a commit to openxla/triton that referenced this pull request Apr 3, 2024
[NVGPU] Propagate layouts through DotWait ops.

triton-lang#3301 converts some tt.dot ops into
ttng.dot_async+wait. We need to propagate layouts through these the same
way we propagate them through regular tt.dot ops. Otherwise...well,
eventually we create unsupported IR, trying to convert from blocked to
MMAv3 dot-operand layout, which we cannot do.

This fixes an OAI-internal test.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants