[Relax][PyTorch] Fix output shape of `torch.nn.functional.scaled_dot_product_attention` #17379

mshr-h · 2024-09-16T14:23:51Z

torch.nn.functional.scaled_dot_product_attention outputs in the shape of (N, ..., L, E_v) but relax.op.nn.attention does (N, L, ..., E_v) so the output should also be transposed.

Maybe we should add E2E tests in tests/python/nightly/ to check the relax torch frontend.

cc: @yongwww

yongwww · 2024-09-17T04:11:34Z

we can transpose to get the expected result. Thanks for the effort!

yongwww

overall looks good to me

tests/python/nightly/relax/test_frontend_from_fx.py

mshr-h · 2024-09-17T10:41:31Z

MSC E2E test is failing. Seems like we also need to change something other than relax frontend.
@Archermmt Do you have any ideas on how to fix the error?

Link to the ci log: https://ci.tlcpack.ai/blue/organizations/jenkins/tvm-unity/detail/PR-17379/6/pipeline/

tests/python/contrib/test_msc/test_translate_torch.py::test_attention FAILED

[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] > [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] [2024-09-17T10:06:53.212Z] > [2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z] E
[2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z] E [2024-09-17T10:06:53.212Z]
[2024-09-17T10:06:53.212Z] tests/python/contrib/test_msc/test_translate_torch.py::test_attention FAILED
=================================== FAILURES ===================================
________________________________ test_attention ________________________________
def test_attention():
"""test torch translator for attention"""
# pylint: disable=import-outside-toplevel
import torch.nn.functional as F
class Attention1(Module):
def forward(self, q_data, k_data, v_data):
return F.scaled_dot_product_attention(q_data, k_data, v_data)
class Attention2(Module):
def forward(self, q_data, k_data, v_data):
return F.scaled_dot_product_attention(q_data, k_data, v_data, is_causal=True)
input_info = [
([32, 8, 128, 64], "float32"),
([32, 8, 128, 64], "float32"),
([32, 8, 128, 64], "float32"),
]
verify_model(Attention1(), input_info)
tests/python/contrib/test_msc/test_translate_torch.py:1127:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
tests/python/contrib/test_msc/test_translate_torch.py:52: in verify_model
tvm.testing.assert_allclose(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
actual = array([[[[0.5253085 , 0.48211107, 0.52921075, ..., 0.518867 ,
0.49926636, 0.48493868],
[0.5294311 ...5],
[0.47335747, 0.48579183, 0.5360674 , ..., 0.543607 ,
0.5020893 , 0.47848547]]]], dtype=float32)
desired = array([[[[0.5253085 , 0.48211107, 0.52921075, ..., 0.518867 ,
0.49926636, 0.48493868],
[0.49697113... ],
[0.47335747, 0.48579183, 0.5360674 , ..., 0.543607 ,
0.5020893 , 0.47848547]]]], dtype=float32)
rtol = 1e-05, atol = 1e-05
def assert_allclose(actual, desired, rtol=1e-7, atol=1e-7):
"""Version of np.testing.assert_allclose with atol and rtol fields set
in reasonable defaults.
Arguments actual and desired are not interchangeable, since the function
compares the abs(actual-desired) with atol+rtol*abs(desired). Since we
often allow desired to be close to zero, we generally want non-zero atol.
"""
actual = np.asanyarray(actual)
desired = np.asanyarray(desired)
np.testing.assert_allclose(actual.shape, desired.shape)
AssertionError:
Not equal to tolerance rtol=1e-07, atol=0
Mismatched elements: 2 / 4 (50%)
Max absolute difference: 120
Max relative difference: 15.
x: array([ 32, 8, 128, 64])
y: array([ 32, 128, 8, 64])
python/tvm/testing/utils.py:119: AssertionError

mshr-h marked this pull request as ready for review September 16, 2024 14:25

mshr-h changed the title ~~Fix torch sdpa converter~~ [Relax][PyTorch] Fix output shape of torch.nn.functional.scaled_dot_product_attention Sep 16, 2024

tqchen assigned yongwww Sep 16, 2024

mshr-h marked this pull request as draft September 17, 2024 02:38

yongwww reviewed Sep 17, 2024

View reviewed changes

tests/python/nightly/relax/test_frontend_from_fx.py Outdated Show resolved Hide resolved

mshr-h marked this pull request as ready for review September 17, 2024 04:37

mshr-h force-pushed the fix-torch-sdpa-converter branch 2 times, most recently from 456c72e to 185d28c Compare September 17, 2024 07:33

mshr-h added 2 commits September 17, 2024 23:49

fix the testcase

6649037

transpose the output

f6f6c1a

mshr-h force-pushed the fix-torch-sdpa-converter branch from 185d28c to 43268e1 Compare September 17, 2024 14:50

yongwww approved these changes Sep 17, 2024

View reviewed changes

mshr-h force-pushed the fix-torch-sdpa-converter branch from 43268e1 to a783823 Compare September 19, 2024 05:25

fix msc testcase

a2b29c0

mshr-h force-pushed the fix-torch-sdpa-converter branch from a783823 to a2b29c0 Compare September 19, 2024 09:06

yongwww merged commit 85f2cc3 into apache:main Sep 20, 2024
17 of 18 checks passed

mshr-h deleted the fix-torch-sdpa-converter branch September 20, 2024 04:30

ysh329 mentioned this pull request Oct 16, 2024

[Release] v0.18.0 Release Candidate Notes #17468

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relax][PyTorch] Fix output shape of `torch.nn.functional.scaled_dot_product_attention` #17379

[Relax][PyTorch] Fix output shape of `torch.nn.functional.scaled_dot_product_attention` #17379

mshr-h commented Sep 16, 2024 •

edited

Loading

yongwww commented Sep 17, 2024

yongwww left a comment

mshr-h commented Sep 17, 2024 •

edited

Loading

[Relax][PyTorch] Fix output shape of torch.nn.functional.scaled_dot_product_attention #17379

[Relax][PyTorch] Fix output shape of torch.nn.functional.scaled_dot_product_attention #17379

Conversation

mshr-h commented Sep 16, 2024 • edited Loading

yongwww commented Sep 17, 2024

yongwww left a comment

Choose a reason for hiding this comment

mshr-h commented Sep 17, 2024 • edited Loading

[Relax][PyTorch] Fix output shape of `torch.nn.functional.scaled_dot_product_attention` #17379

[Relax][PyTorch] Fix output shape of `torch.nn.functional.scaled_dot_product_attention` #17379

mshr-h commented Sep 16, 2024 •

edited

Loading

mshr-h commented Sep 17, 2024 •

edited

Loading