#14974: ttnn::empty Tensor creation API for MeshDevice #15191

omilyutin-tt · 2024-11-18T22:34:35Z

Ticket

#14974

Problem description

Extensions to Tensor creation APIs to support MeshDevice.

What's changed

Overload for ttnn::empty to support MeshDevice.

Minor formatting fixes / code comments.

Checklist

Post commit CI passes
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
New/Existing tests provide coverage for changes

tests/ttnn/unit_tests/operations/test_creation.py

cfjchu · 2024-11-19T01:40:25Z

ttnn/cpp/ttnn/operations/creation.hpp

+        const DataType& dtype,
+        const Layout& layout,
+        Device* device,
+        const MemoryConfig& memory_config) {


@patrickroberts since I saw you recently touched these creation functions. fyi @eyonland @yan-zaretskiy

constexpr auto full = ttnn::decorators::register_operation_with_auto_launch_op<"ttnn::full", ttnn::operations::creation::Full>(); constexpr auto zeros = ttnn::decorators::register_operation<"ttnn::zeros", ttnn::operations::creation::Zeros>(); constexpr auto ones = ttnn::decorators::register_operation<"ttnn::ones", ttnn::operations::creation::Ones>(); constexpr auto empty = ttnn::decorators::register_operation<"ttnn::empty", ttnn::operations::creation::Empty>(); constexpr auto full_like = ttnn::decorators::register_operation_with_auto_launch_op<"ttnn::full_like", ttnn::operations::creation::FullLike>(); constexpr auto zeros_like = ttnn::decorators::register_operation<"ttnn::zeros_like", ttnn::operations::creation::ZerosLike>(); constexpr auto ones_like = ttnn::decorators::register_operation<"ttnn::ones_like", ttnn::operations::creation::OnesLike>(); constexpr auto empty_like = ttnn::decorators::register_operation<"ttnn::empty_like", ttnn::operations::creation::EmptyLike>(); constexpr auto arange = ttnn::decorators::register_operation_with_auto_launch_op<"ttnn::arange", ttnn::operations::creation::Arange>();

Some of these creation functions use register_operation_with_auto_launch_op and some use register_operation. For the ones that use register_operation I don't see see explicit calls to launch_op. Do we want to make all these invoke functions for these creation ops to be done under the scope of launch_op? It seems inconsistent right now?

When I updated those creation functions, I did not change whether they opted into auto_launch_op, that was determined by @arakhmati, and I don't have a good understanding of the criteria he used to decide that.

When op returns something else than a Tensor, for example a vector of tensors. In this case infra can't determine how many tensors to create or which to create so automatic decoration is not possible. In this case you must use register_operation and make an explicit call to launch_op inside operation's invoke.

If this is not handled correctly, operation won't be async even if async mode is enabled.

Good news, I am sure 6 months from now this wonderful infra should go away.

#10672 introduces the discrepancy, I think we can fix in a follow up? Under the hood, full/zeros/ones are ultimately the same; empty path is a bit simpler as it only requires an allocation.

cfjchu · 2024-11-19T01:41:41Z

ttnn/cpp/ttnn/operations/creation.hpp

+        Tensor device_tensor = allocate_tensor_on_device(shape, dtype, layout, device, memory_config);
+        device_tensor.wait_for_tensor_metadata_populated();
+        device_tensor.wait_for_tensor_data_populated();
+        return device_tensor;


@tt-asaigal what's the correct apis here to use. I don't think we want to call this in the worker thread?

cfjchu · 2024-11-19T01:42:54Z

Also just following up from our convo to prioritize C++ APIs to add a few C++ unit tests.

omilyutin-tt · 2024-11-19T17:58:39Z

Also just following up from our convo to prioritize C++ APIs to add a few C++ unit tests.

Done. There are other tests that target different Tensor parameters, I've just added a basic test for the multi-device API.

omilyutin-tt changed the title ~~Omilyutin/mesh creation~~ Tensor creation APIs for MeshDevice Nov 18, 2024

Denys88 reviewed Nov 18, 2024

View reviewed changes

tests/ttnn/unit_tests/operations/test_creation.py Show resolved Hide resolved

Denys88 approved these changes Nov 18, 2024

View reviewed changes

cfjchu reviewed Nov 19, 2024

View reviewed changes

omilyutin-tt changed the title ~~Tensor creation APIs for MeshDevice~~ #14974: Tensor creation APIs for MeshDevice Nov 19, 2024

omilyutin-tt added 3 commits November 19, 2024 17:47

Add tensor creation API for MeshDevice

412d74a

Block on multi-device allocation, add creation test

ba41a0b

Add C++ multi-device tensor creation test, fix Python test

7f564aa

omilyutin-tt force-pushed the omilyutin/mesh-creation branch from f04ca8c to 7f564aa Compare November 19, 2024 17:54

Parameterize C++ test for sync/async mesh device mode

4980209

omilyutin-tt changed the title ~~#14974: Tensor creation APIs for MeshDevice~~ #14974: ttnn::empty Tensor creation API for MeshDevice Nov 19, 2024

omilyutin-tt marked this pull request as ready for review November 19, 2024 18:42

omilyutin-tt requested review from dmakoviichuk-tt, rfurko-tt, ayerofieiev-tt, TT-BrianLiu, razorback3 and dongjin-na as code owners November 19, 2024 18:42

omilyutin-tt requested review from cfjchu, Denys88 and tt-asaigal and removed request for Denys88 November 19, 2024 18:43

Cleanup comments

7140c0c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#14974: ttnn::empty Tensor creation API for MeshDevice #15191

#14974: ttnn::empty Tensor creation API for MeshDevice #15191

omilyutin-tt commented Nov 18, 2024 •

edited

Loading

cfjchu Nov 19, 2024

patrickroberts Nov 19, 2024

ayerofieiev-tt Nov 19, 2024

omilyutin-tt Nov 19, 2024

cfjchu Nov 19, 2024

cfjchu commented Nov 19, 2024

omilyutin-tt commented Nov 19, 2024

#14974: ttnn::empty Tensor creation API for MeshDevice #15191

Are you sure you want to change the base?

#14974: ttnn::empty Tensor creation API for MeshDevice #15191

Conversation

omilyutin-tt commented Nov 18, 2024 • edited Loading

Ticket

Problem description

What's changed

Checklist

cfjchu Nov 19, 2024

Choose a reason for hiding this comment

patrickroberts Nov 19, 2024

Choose a reason for hiding this comment

ayerofieiev-tt Nov 19, 2024

Choose a reason for hiding this comment

omilyutin-tt Nov 19, 2024

Choose a reason for hiding this comment

cfjchu Nov 19, 2024

Choose a reason for hiding this comment

cfjchu commented Nov 19, 2024

omilyutin-tt commented Nov 19, 2024

omilyutin-tt commented Nov 18, 2024 •

edited

Loading