[Lang] MatrixNdarray refactor part9: Add scalarization for AllocaStmt #6168

jim19930609 · 2022-09-27T03:26:49Z

Related issue = #5873, #5819

This PR is working "Part ④" in #5873.

[AllocaStmt scalarization]

Before:
  TensorType<4 x i32>* addr = AllocaStmt(TensorType<4 x i32>)

After:
  i32 addr0 = AllocaStmt(i32)
  i32 addr1 = AllocaStmt(i32)
  i32 addr2 = AllocaStmt(i32)
  i32 addr3 = AllocaStmt(i32)

  scalarized_local_tensor_map_[addr] = {addr0, addr1, addr2, addr3}

[Load AllocaStmt]

Before:
  TensorType<4 x i32> val = LoadStmt(TensorType<4 x i32>* alloca_src)

After:
  i32 val0 = LoadStmt(scalarized_local_tensor_map_[stmt][0])
  i32 val1 = LoadStmt(scalarized_local_tensor_map_[stmt][1])
  i32 val2 = LoadStmt(scalarized_local_tensor_map_[stmt][2])
  i32 val3 = LoadStmt(scalarized_local_tensor_map_[stmt][3])

  tmp = MatrixInitStmt(val0, val1, val2, val3)
  stmt->replace_all_usages_with(tmp)

[Store to AllocaStmt]

Before:
  StoreStmt(TensorType<4 x i32>* alloca_dest_stmt, TensorType<4 x i32> val)

After:
  StoreStmt(i32* scalarized_local_tensor_map_[stmt][0], 
            i32 val->cast<MatrixInitStmt>()->val[0]) 
  StoreStmt(i32* scalarized_local_tensor_map_[stmt][1], 
            i32 val->cast<MatrixInitStmt>()->val[1]) 
  StoreStmt(i32* scalarized_local_tensor_map_[stmt][2], 
            i32 val->cast<MatrixInitStmt>()->val[2]) 
  StoreStmt(i32* scalarized_local_tensor_map_[stmt][3], 
            i32 val->cast<MatrixInitStmt>()->val[3])

netlify · 2022-09-27T03:26:53Z

✅ Deploy Preview for docsite-preview ready!

Name	Link
🔨 Latest commit	`431ff4c`
🔍 Latest deploy log	https://app.netlify.com/sites/docsite-preview/deploys/63352275f0d76c000889f8a5
😎 Deploy Preview	https://deploy-preview-6168--docsite-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

jim19930609 · 2022-09-27T03:42:22Z

Gonna enable more python tests in following PRs

AD1024 · 2022-09-27T14:39:38Z

Thanks for implementing this! LGTM.
Are we leveraging the CFG pass to eleminate that MatrixInitStmt holding pointers to scalarized values?

jim19930609 · 2022-09-28T02:10:08Z

Thanks for implementing this! LGTM. Are we leveraging the CFG pass to eleminate that MatrixInitStmt holding pointers to scalarized values?

I dont think MatrixInitStmt is holding any pointer-typed values? Are you referring to the redundant MatrixInitStmt inserted during scalarization, those are remove right after scalarization is done (There's will be MatrixInitCleanUp pass).

strongoier

Hmm.. I think having a case in the LowerMatrixPtr pass is much easier? You don't have to modify the Scalarize pass at all.

jim19930609 · 2022-09-28T06:30:11Z

Hmm.. I think having a case in the LowerMatrixPtr pass is much easier? You don't have to modify the Scalarize pass at all.

But it is trying to scalarize the AllocaStmt(TensorType) into AllocaStmt(scalar0), ... thus well fits the scalarization pass?

The other thing with LowerMatrixPtr is that it doesn't rely on whether "scalarized" is turned-on or not, but scalarization for AllocaStmt should only be enabled when we decide to do so, thus I feel it's better to decouple these two passes?

strongoier · 2022-09-28T07:16:59Z

But it is trying to scalarize the AllocaStmt(TensorType) into AllocaStmt(scalar0), ... thus well fits the scalarization pass?

IIUC Scalarization aims at faking a tensor operation with a bunch of scalar operations, and LowerMatrixPtr aims at eliminating intermediate matrix ptrs. I agree that AllocaStmt(TensorType) -> AllocaStmt(scalar0), ... should be put in Scalarization, but I think PtrOffsetStmt(AllocaStmt) should be better handled in LowerMatrixPtr. Thus theoretically we need an intermediate representation like MatrixOfPtrStmt.

However, considering that AllocaStmt(TensorType) -> AllocaStmt(scalar0), ... may not be needed in the end, and we haven't decided which parts are mandatory or optional, I think we don't need to be so strict with pass separation for now. The only suggestion here may be to simplify the implementation a bit - it seems that visit(PtrOffsetStmt) already covers the changes in visit(LoadStmt) / visit(StoreStmt) / ...?

…rray_pr10

jim19930609 · 2022-09-28T10:32:40Z

But it is trying to scalarize the AllocaStmt(TensorType) into AllocaStmt(scalar0), ... thus well fits the scalarization pass?

IIUC Scalarization aims at faking a tensor operation with a bunch of scalar operations, and LowerMatrixPtr aims at eliminating intermediate matrix ptrs. I agree that AllocaStmt(TensorType) -> AllocaStmt(scalar0), ... should be put in Scalarization, but I think PtrOffsetStmt(AllocaStmt) should be better handled in LowerMatrixPtr. Thus theoretically we need an intermediate representation like MatrixOfPtrStmt.

However, considering that AllocaStmt(TensorType) -> AllocaStmt(scalar0), ... may not be needed in the end, and we haven't decided which parts are mandatory or optional, I think we don't need to be so strict with pass separation for now. The only suggestion here may be to simplify the implementation a bit - it seems that visit(PtrOffsetStmt) already covers the changes in visit(LoadStmt) / visit(StoreStmt) / ...?

Simplified irpass::scalarize() but still kept scalarization for AllocaStmt and PtrOffsetStmt as part of irpass::scalarize(). The barrier to split "scalarization for AllocaStmt" and "optimization for PtrOffsetStmt(AllocaStmt)" lies in the fact that we need this scalarized_local_tensor_map_ to record the mapping from original AllocaStmt to the scalarized ones, and transferring this scalarized_local_tensor_map_ across passes will be implementation-wisely ugly.

strongoier

Great!!

taichi/transforms/scalarize.cpp

tests/cpp/transforms/scalarize_test.cpp

taichi/transforms/scalarize.cpp

Co-authored-by: Yi Xu <xy_xuyi@foxmail.com>

…nto matrix_ndarray_pr10

[Lang] MatrixNdarray refactor part9: Add scalarization for AllocaStmt

4d8ed5a

Modified comments

be5a497

Bug fix

6955c02

jim19930609 requested review from strongoier, AD1024 and ailzhang September 27, 2022 05:40

strongoier reviewed Sep 28, 2022

View reviewed changes

jim19930609 requested a review from strongoier September 28, 2022 06:31

jim19930609 added 2 commits September 28, 2022 16:10

Adjusted pass implementation w.r.t review comments

6f869a9

Merge branch 'master' of github.com:taichi-dev/taichi into matrix_nda…

e7f6629

…rray_pr10

jim19930609 force-pushed the matrix_ndarray_pr10 branch from 2135b3a to e7f6629 Compare September 28, 2022 08:12

Bug fix

3ab0382

strongoier reviewed Sep 28, 2022

View reviewed changes

taichi/transforms/scalarize.cpp Outdated Show resolved Hide resolved

taichi/transforms/scalarize.cpp Show resolved Hide resolved

tests/cpp/transforms/scalarize_test.cpp Outdated Show resolved Hide resolved

Addressed review comments

47675b6

jim19930609 requested a review from strongoier September 29, 2022 03:53

strongoier approved these changes Sep 29, 2022

View reviewed changes

taichi/transforms/scalarize.cpp Outdated Show resolved Hide resolved

taichi/transforms/scalarize.cpp Outdated Show resolved Hide resolved

taichi/transforms/scalarize.cpp Outdated Show resolved Hide resolved

jim19930609 and others added 3 commits September 29, 2022 12:41

Update taichi/transforms/scalarize.cpp

735606c

Co-authored-by: Yi Xu <xy_xuyi@foxmail.com>

Renamed ptr_offset_stmt to matrix_ptr_stmt

0a50107

Merge branch 'matrix_ndarray_pr10' of github.com:jim19930609/taichi i…

431ff4c

…nto matrix_ndarray_pr10

jim19930609 merged commit 8be973b into taichi-dev:master Sep 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Lang] MatrixNdarray refactor part9: Add scalarization for AllocaStmt #6168

[Lang] MatrixNdarray refactor part9: Add scalarization for AllocaStmt #6168

jim19930609 commented Sep 27, 2022 •

edited

Loading

netlify bot commented Sep 27, 2022 •

edited

Loading

jim19930609 commented Sep 27, 2022 •

edited

Loading

AD1024 commented Sep 27, 2022 •

edited

Loading

jim19930609 commented Sep 28, 2022

strongoier left a comment

jim19930609 commented Sep 28, 2022 •

edited

Loading

strongoier commented Sep 28, 2022 •

edited

Loading

jim19930609 commented Sep 28, 2022

strongoier left a comment

[Lang] MatrixNdarray refactor part9: Add scalarization for AllocaStmt #6168

[Lang] MatrixNdarray refactor part9: Add scalarization for AllocaStmt #6168

Conversation

jim19930609 commented Sep 27, 2022 • edited Loading

netlify bot commented Sep 27, 2022 • edited Loading

✅ Deploy Preview for docsite-preview ready!

jim19930609 commented Sep 27, 2022 • edited Loading

AD1024 commented Sep 27, 2022 • edited Loading

jim19930609 commented Sep 28, 2022

strongoier left a comment

Choose a reason for hiding this comment

jim19930609 commented Sep 28, 2022 • edited Loading

strongoier commented Sep 28, 2022 • edited Loading

jim19930609 commented Sep 28, 2022

strongoier left a comment

Choose a reason for hiding this comment

jim19930609 commented Sep 27, 2022 •

edited

Loading

netlify bot commented Sep 27, 2022 •

edited

Loading

jim19930609 commented Sep 27, 2022 •

edited

Loading

AD1024 commented Sep 27, 2022 •

edited

Loading

jim19930609 commented Sep 28, 2022 •

edited

Loading

strongoier commented Sep 28, 2022 •

edited

Loading