[doc] Document list generation #990

yuanming-hu · 2020-05-15T01:19:03Z

Related issue = #709 #941 #986

archibate

Thank for adding this, this is really helpful for backend maintainers, lgtm in general except for a few questions.

archibate · 2020-05-15T03:55:53Z

docs/internal.rst

+ - (Tasks ``$0`` and ``$1``) from the list of ``root`` node (``S0``) to list of the ``dense`` nodes (``S1``);
+ - (Tasks ``$2`` and ``$3``) from the list of ``dense`` nodes (``S1``) to the list of ``bitmasked`` nodes (``S2``).


What does from ... to mean? Can we clarify this?

archibate · 2020-05-15T03:56:35Z

misc/listgen_demo.py

+@ti.kernel
+def func():
+    for i in x:
+        print(i)


I guess this will always get not printed since bitmasked is inactivated by default.

That's true. We are just illustrating list generation here.

archibate · 2020-05-15T03:59:09Z

docs/internal.rst

+Our strategy is to generate lists of SNode elements layer by layer.
+This flattens the data structure leaf elements into a 1D dense array, circumventing the irregularity of
+incomplete trees. Then we can simply invoke a regular **parallel for** over the list.


Suggested change

Our strategy is to generate lists of SNode elements layer by layer.

This flattens the data structure leaf elements into a 1D dense array, circumventing the irregularity of

incomplete trees. Then we can simply invoke a regular **parallel for** over the list.

Our strategy is to generate a list of SNode element indices for each layer.

This flattens the data structure leaf elements into a 1D dense array, circumventing the irregularity of

incomplete trees. Then we can simply invoke a regular **parallel for** over the list.

Also, is the generated list stored in CPU or GPU memory? If you find the process hard to explain, please use LLVM implementation for example.

It's on GPU, and these lists are updated in a kernel. There are multiple lists so I think plural form is the correct one. I'd suggest to say lists of active SNode elements layer by layer.

Suggested change

Our strategy is to generate lists of SNode elements layer by layer.

This flattens the data structure leaf elements into a 1D dense array, circumventing the irregularity of

incomplete trees. Then we can simply invoke a regular **parallel for** over the list.

Our strategy is to generate several lists of active SNode elements on GPU, layer by layer.

This flattens the data structure leaf elements into a 1D dense array, circumventing the irregularity of

incomplete trees. Then we can simply invoke a regular **parallel for** over the list.

I'll be more specific and say generation will happen on the device.

k-ye · 2020-05-15T11:06:53Z

docs/internal.rst

+Our strategy is to generate lists of SNode elements layer by layer.
+This flattens the data structure leaf elements into a 1D dense array, circumventing the irregularity of
+incomplete trees. Then we can simply invoke a regular **parallel for** over the list.


It's on GPU, and these lists are updated in a kernel. There are multiple lists so I think plural form is the correct one. I'd suggest to say lists of active SNode elements layer by layer.

yuanming-hu · 2020-05-15T22:23:51Z

Thanks for the reviews! I'm merging this, so that I can add more details on the internal designs (for #986) following this PR.

[skip ci] update

000a31f

yuanming-hu requested a review from k-ye May 15, 2020 01:19

yuanming-hu mentioned this pull request May 15, 2020

[metal] Fix SNode off-by-one problem #986

Merged

archibate reviewed May 15, 2020

View reviewed changes

k-ye approved these changes May 15, 2020

View reviewed changes

[skip ci] apply reviews

6bbe37a

yuanming-hu merged commit 7716b47 into taichi-dev:master May 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[doc] Document list generation #990

[doc] Document list generation #990

yuanming-hu commented May 15, 2020 •

edited

Loading

archibate left a comment

archibate May 15, 2020

archibate May 15, 2020

yuanming-hu May 15, 2020

archibate May 15, 2020

k-ye May 15, 2020

archibate May 15, 2020

yuanming-hu May 15, 2020 •

edited

Loading

k-ye May 15, 2020

yuanming-hu commented May 15, 2020

		- (Tasks ``$0`` and ``$1``) from the list of ``root`` node (``S0``) to list of the ``dense`` nodes (``S1``);
		- (Tasks ``$2`` and ``$3``) from the list of ``dense`` nodes (``S1``) to the list of ``bitmasked`` nodes (``S2``).

[doc] Document list generation #990

[doc] Document list generation #990

Conversation

yuanming-hu commented May 15, 2020 • edited Loading

archibate left a comment

Choose a reason for hiding this comment

archibate May 15, 2020

Choose a reason for hiding this comment

archibate May 15, 2020

Choose a reason for hiding this comment

yuanming-hu May 15, 2020

Choose a reason for hiding this comment

archibate May 15, 2020

Choose a reason for hiding this comment

k-ye May 15, 2020

Choose a reason for hiding this comment

archibate May 15, 2020

Choose a reason for hiding this comment

yuanming-hu May 15, 2020 • edited Loading

Choose a reason for hiding this comment

k-ye May 15, 2020

Choose a reason for hiding this comment

yuanming-hu commented May 15, 2020

yuanming-hu commented May 15, 2020 •

edited

Loading

yuanming-hu May 15, 2020 •

edited

Loading