feat: hamt enumlinks custom #111

aschmahmann · 2021-10-27T18:43:02Z

An alternative to #110. Would still like to clean this up with errgroups if possible.

I've gone through three iterations of this function through this PR (follow the commits):

Basically a clone of go-merkledag's walker but using a union of (CID, Shard) as the elements we can walk over so we can do it in memory 49314cf
Try to clean up the above to use error groups to make the code easier to follow and manage c930522
Switch to using bulkier dagservice requests (i.e. GetMany instead of Get) 8051de7 (the latest also includes some test modifications that were required to get this to work)

@Stebalien recommended that last improvement as an efficiency thing lower down the stack since the Bitswap requests will get grouped. However, I'm not sure this is really the correct implementation since it'll allow slow resolution of some parts of the HAMT to gum up the system.

This does seem to impact walk order considerably causing the number of nodes traversed in the HAMT size estimation using a complete HAMT to drop dramatically. While this seems like a good thing, it means we're really messing with the ordering here and may want to be careful and go with option 2.

cc @schomatis @Stebalien

welcome · 2021-10-27T18:43:04Z

Thank you for submitting this PR!
A maintainer will be here shortly to review it.
We are super grateful, but we are also overloaded! Help us by making sure that:

The context for this PR is clear, with relevant discussion, decisions
and stakeholders linked/mentioned.
Your contribution itself is clear (code comments, self-review for the
rest) and in its best form. Follow the code contribution
guidelines
if they apply.

Getting other community members to do a review would be great help too on complex PRs (you can ask in the chats/forums). If you are unsure about something, just leave us a comment.
Next steps:

A maintainer will triage and assign priority to this PR, commenting on
any missing things and potentially assigning a reviewer for high
priority items.
The PR gets reviews, discussed and approvals as needed.
The PR is merged by maintainers when it has been approved and comments addressed.

We currently aim to provide initial feedback/triaging within two business days. Please keep an eye on any labelling actions, as these will indicate priorities and status of your contribution.
We are very grateful for your contribution!

…dagservice layer

schomatis

The parallelWalkDepth LGTM (modulo the errgroup usage I'm not familiar with), left some minor comments but nothing blocking.

Need more time to understand the changes in the testing logic of the last commit.

schomatis · 2021-10-28T13:54:47Z

hamt/hamt.go

+				var linksToVisit []cid.Cid
+				for _, nextLink := range shardOrCID.links {
+					var shouldVisit bool
+
+					visitlk.Lock()
+					shouldVisit = visit(nextLink)
+					visitlk.Unlock()
+
+					if shouldVisit {
+						linksToVisit = append(linksToVisit, nextLink)
+					}
+				}


nit: I think we could drop this optimization (to simplify the code) as I wouldn't expect to have repeated internal (non-value) shard nodes.

schomatis · 2021-10-28T14:08:42Z

hamt/hamt.go

+						return err
+					}
+
+					nextLinks, err := nextShard.walkLinks(processShardValues)


nit: The general nomenclature around listCidShardUnion is a bit confusing. My general take is that we are processing all the children from the same node at once, together, both in-memory Shards or stored links, but there are some conflicting names:

The walkLinks function actually walks all children, both links and shards. (Similar the nextLinks in this line.)

The 'union' suffix in the structure's name makes me think we have an 'either of' situation.

Similar the shardOrCID name somewhere above in this function. Internally the HAMT stores each of its children in either of shard or link format. In that sense the 'union/or' terms are correct, but when processing all the children from a single node I think we should decouple ourselves from that mutually exclusive definition and focus on a single group of children (that yes, will be expressed in either of those two formats but doesn't seem key to the Walk algorithm).

schomatis · 2021-10-28T14:25:05Z

hamt/hamt.go

+		grp.Go(func() error {
+			for shardOrCID := range feed {


nit: The walk algorithm here is more expansive than the original and its differences should be documented (as this is clearly a copy of the other and anyone reading this code will be thinking of the original when trying to reason through it). (Not referring to the GetMany optimization which is valid in itself and could even be incorporated to the Shard logic.)

In the original we process one parent node at a time (represented by its CID), extract its children, filter which should be emitted as output (value link/shards), and push the rest to the queue/feed one at a time to be processed independently in the next iteration, each as new parent node.

Here we send (after filtering) all the children together as a bulk (lists in listCidShardUnion) and extract all their children in turn together. (It might be a valid optimization and this comment is not against it, just advocating for more documentation around it). I'm not sure if this affects the traversal behavior expected by TestHAMTEnumerationWhenComputingSize; I don't think so but need more time to think about it.

(edit: it was affecting tests, see comment below)

schomatis · 2021-10-28T17:18:18Z

The grouping caused by the the GetMany requests might cause the fake timeout added in countGetsDS to be ineffective in keeping an ordered BFS (some threads win over others turning it more into a DFS). Reducing the concurrency in parallelWalkDepth indeed hits the mark expected by TestHAMTEnumerationWhenComputingSize passing the test. (We might want to add the concurrency parameter to the internal package to adjust it for this test. It would lose the true parallel dimension but would still be useful in testing the brand new parallel walk.)

schomatis · 2021-10-28T17:32:49Z

Follow-up: splitting the children and processing them separately in the loop fixes the test (see PoC branch).

aschmahmann · 2021-10-28T17:35:57Z

Follow-up: splitting the children and processing them separately in the loop fixes the test

Correct. That's the same as using Get instead of GetMany though.

schomatis · 2021-10-28T17:37:55Z

Yes, optimizing for speed is in conflict with adding delays to have a predictable BFS. That is expected, we just need to change our expectation of predictability in the test.

schomatis · 2021-10-28T17:45:56Z

Also note that the GetMany optimization preemptively fetches more links than what it might need so it goes directly against the original optimization (which TestHAMTEnumerationWhenComputingSize was explicitly testing) of fetching as little nodes as possible to determine HAMT directory size.

aschmahmann · 2021-10-28T20:49:34Z

Also note that the GetMany optimization preemptively fetches more links than what it might need

Does it necessarily do that? If we cancel the context then GetMany should terminate early so we're asking for (and might receive) more blocks than we need, but we're not necessarily waiting on them and might not receive the extra blocks at all.

schomatis · 2021-10-29T16:57:10Z

You're right. The final aim of the optimization is reducing enumeration time, not number of fetches. (Still our test tracks the second.)

…l fetching

fix comments in completehamt_test.go

feat: use custom dag traversal for HAMT link enumeration

49314cf

aschmahmann changed the base branch from master to schomatis/directory/unsharding October 27, 2021 18:43

aschmahmann added 2 commits October 27, 2021 14:52

remove unused code

29ffa00

cleanup and switch to errgrp

c930522

aschmahmann force-pushed the feat/hamt-enumlinks-custom branch from e4d2cd8 to c930522 Compare October 28, 2021 02:12

aschmahmann added 2 commits October 28, 2021 01:34

switch to GetMany in EnumLinksAsync

8051de7

switch sharding threshold test to work on the blockstore rather than …

af57e4b

…dagservice layer

schomatis reviewed Oct 28, 2021

View reviewed changes

gofmt

bec9689

aschmahmann added 2 commits November 10, 2021 17:23

refactor some names and add more comments

d0faeb3

test: adjust TestHAMTEnumerationWhenComputingSize to allow for optima…

a99e187

…l fetching

schomatis approved these changes Nov 11, 2021

View reviewed changes

aschmahmann added 2 commits November 11, 2021 19:07

fix TestHAMTEnumerationWhenComputingSize optimal size computation

2927cdc

fix comments in completehamt_test.go

chore: order deps

d9a5431

aschmahmann merged commit 20d951f into schomatis/directory/unsharding Nov 12, 2021

This was referenced Nov 12, 2021

Fix HAMT enumeration for in-memory traversal #110

Closed

Add automatic sharding/unsharding tests ipfs/kubo#8547

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: hamt enumlinks custom #111

feat: hamt enumlinks custom #111

aschmahmann commented Oct 27, 2021 •

edited

Loading

welcome bot commented Oct 27, 2021

schomatis left a comment

schomatis Oct 28, 2021

schomatis Oct 28, 2021

schomatis Oct 28, 2021

schomatis Oct 28, 2021

schomatis commented Oct 28, 2021

schomatis commented Oct 28, 2021

aschmahmann commented Oct 28, 2021

schomatis commented Oct 28, 2021

schomatis commented Oct 28, 2021

aschmahmann commented Oct 28, 2021

schomatis commented Oct 29, 2021

feat: hamt enumlinks custom #111

feat: hamt enumlinks custom #111

Conversation

aschmahmann commented Oct 27, 2021 • edited Loading

welcome bot commented Oct 27, 2021

schomatis left a comment

Choose a reason for hiding this comment

schomatis Oct 28, 2021

Choose a reason for hiding this comment

schomatis Oct 28, 2021

Choose a reason for hiding this comment

schomatis Oct 28, 2021

Choose a reason for hiding this comment

schomatis Oct 28, 2021

Choose a reason for hiding this comment

schomatis commented Oct 28, 2021

schomatis commented Oct 28, 2021

aschmahmann commented Oct 28, 2021

schomatis commented Oct 28, 2021

schomatis commented Oct 28, 2021

aschmahmann commented Oct 28, 2021

schomatis commented Oct 29, 2021

aschmahmann commented Oct 27, 2021 •

edited

Loading