[RFC] Taichi Memory & Execution Model #4594

bobcao3 · 2022-03-21T16:07:31Z

bobcao3
Mar 21, 2022
Collaborator

The porting effort of Taichi has reached a stage where we are targeting support for almost all compute capable graphics devices on earth. Ahead of v1.0 launch, I want to now start the work on a formal / semi-formal memory and execution model, so that we have something to reference and rely on when designing our back-ends, code-gen, and optimizations.

One valuable reference that we can build and modify upon is the Vulkan Memory Model: https://github.com/KhronosGroup/Vulkan-MemoryModel Which is being validated against on many devices and provide a common ground for consumer GPUs.

Another one is the C++ memory model: https://en.cppreference.com/w/cpp/language/memory_model . This has been the underpinning memory model that CUDA has been trying to fully support, and it is the basis of Apple's Metal memory model. By extension, WebGPU's memory model is also largely based on a combination of these two.

In this discussion, we wish to achieve some common ground on basic grantees and limits of such model, especially concerning memory semantics.

bobcao3 · 2022-03-21T16:13:49Z

bobcao3
Mar 21, 2022
Collaborator Author

My initial thought is to use a opaque pointer model (in Vulkan parlance, "logical" pointers), where pointer points to a value, which can be based on a memory location, a register value, scratchpad memory, or even sparse memory location. We should not allow mixed atomics and non-atomics modify operation to the same memory region within a parallel offload (where the size and attributes of a region is yet to be discussed, but I suggest calling a leaf SNode as a region). We should allow non-atomic read to be mixed with atomics but we do not grantee any kind of visibility of operations in the worse case, for potentially speeding up concurrent queue algorithms or fast software rasterizers. (This is perhaps in contrast to WebGPU where this is not allowed and viewed as unsafe)

0 replies

bobcao3 · 2022-03-21T16:15:34Z

bobcao3
Mar 21, 2022
Collaborator Author

Requesting for comments from the WebGPU / WGSL implementation's point of view @AmesingFlank

1 reply

AmesingFlank Mar 23, 2022
Collaborator

WebGPU claims to follow the Vulkan memory model: https://www.w3.org/TR/WGSL/#memory-model.

AmesingFlank · 2022-03-23T03:49:35Z

AmesingFlank
Mar 23, 2022
Collaborator

I feel like I'm missing some context here, so excuse me for throwing down some questions first.

What is the main purpose of defining a memory model for Taichi? Is it purely for documentation/specification purposes and for guiding future implementations? Or does it involve actual engineering work that structurally improves our backends/codegen? More specifically, you mentioned in Slack that this could potentially allow us to build the runtime components in CHI IR, mays I ask how?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Taichi Memory & Execution Model #4594

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

[RFC] Taichi Memory & Execution Model #4594

bobcao3 Mar 21, 2022 Collaborator

Replies: 3 comments · 1 reply

bobcao3 Mar 21, 2022 Collaborator Author

bobcao3 Mar 21, 2022 Collaborator Author

AmesingFlank Mar 23, 2022 Collaborator

AmesingFlank Mar 23, 2022 Collaborator

bobcao3
Mar 21, 2022
Collaborator

Replies: 3 comments 1 reply

bobcao3
Mar 21, 2022
Collaborator Author

bobcao3
Mar 21, 2022
Collaborator Author

AmesingFlank Mar 23, 2022
Collaborator

AmesingFlank
Mar 23, 2022
Collaborator