implement trickledag for faster unixfs operations #713

whyrusleeping · 2015-02-01T22:00:46Z

Alright, so ive come up with a new tree structure optimized for both streaming AND seeking through a given file. This improves both upon the ext4 structure (Which is mainly aimed at on disk filesystems) and the "List of Lists" idea i previously commented about.

The downside of the ext4 style tree layout was that, as you got farther into the file, the number of requests you need to make in order to get data increases, I noticed this problem and came up with the "List of Lists" layout, which would work fantastically for a sequential stream, the issue though, comes when you try to seek through it, the top level node is very poorly weighted to one side so that its 'narrow' from the data's perspective, thus seeking through requires O(n) requests to find the desired location in the file, where ext4 was roughly O(log(n)).

The Trickle{Tree,Dag} addresses both of these concerns, each request after the first can return actual file data, and the cost of seeking remains near O(log(n)) since it has a recursive tree structure. A visualization of it would look like the ext4 tree, but instead of having iteratively deeper 'balanced' trees, it has an iteratively deeper version of itself. The primary tenet of its design is "Data at every layer"

An example layout is here:
http://gateway.ipfs.io/ipfs/QmRPfwo1XQErHDXpeCnJ7j92ibGNTBxkrmBFCbvEa78gZB

jbenet · 2015-02-02T05:51:37Z

(i think that if you rebase on master, that error will be fixed)

jbenet · 2015-02-02T07:59:13Z

I think the right thing to do with all these datastructures is to setup a benchmark suite that tests various different types of workloads. it may be that we find one or two that are really different datastructures will be better for different use cases. re-indexing the same data blocks might be fine to have "different handles" on the same content.

jbenet · 2015-02-02T08:46:04Z

importer/importer_test.go

+		t.Fatal(err)
+	}
+}
+


maybe add some benchmarks to this pkg?

Also, these tests only test it from the outside. It would be useful to test the implementation actually creates a well-formed structure. Maybe add a test that checks the structure produced?

jbenet · 2015-02-02T09:10:16Z

Few comments, otherwise LGTM

…gReader

implement trickledag for faster unixfs operations

whyrusleeping added the status/in-progress In progress label Feb 1, 2015

whyrusleeping mentioned this pull request Feb 1, 2015

implement a faster dag structure for unixfs #687

Closed

jbenet modified the milestone: α Feb 2, 2015

jbenet assigned whyrusleeping Feb 2, 2015

whyrusleeping force-pushed the feat/trickledag branch from 397ec6d to 51d8c6d Compare February 2, 2015 08:28

jbenet reviewed Feb 2, 2015
View reviewed changes

whyrusleeping added 4 commits February 4, 2015 21:59

implement trickledag for faster unixfs operations

b3e74fa

refactor importer package with trickle and balanced dag generation

bc79ae1

fix benchmarks

414bdc7

clean up benchmarks, implement WriterTo on DAGReader, and optimize Da…

1e93ee0

…gReader

whyrusleeping force-pushed the feat/trickledag branch from 2eb2965 to 1e93ee0 Compare February 4, 2015 22:00

whyrusleeping added a commit that referenced this pull request Feb 4, 2015

Merge pull request #713 from jbenet/feat/trickledag

adb7ad9

implement trickledag for faster unixfs operations

whyrusleeping merged commit adb7ad9 into master Feb 4, 2015

whyrusleeping removed the status/in-progress In progress label Feb 4, 2015

jbenet deleted the feat/trickledag branch March 31, 2015 21:41

btrask mentioned this pull request Nov 9, 2015

Hash conversion for import/export and long term archival #1953

Closed

daviddias mentioned this pull request Dec 6, 2016

DEX ipfs/specs#57

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement trickledag for faster unixfs operations #713

implement trickledag for faster unixfs operations #713

whyrusleeping commented Feb 1, 2015

jbenet commented Feb 2, 2015

jbenet commented Feb 2, 2015

jbenet Feb 2, 2015

jbenet Feb 2, 2015

jbenet commented Feb 2, 2015

implement trickledag for faster unixfs operations #713

implement trickledag for faster unixfs operations #713

Conversation

whyrusleeping commented Feb 1, 2015

jbenet commented Feb 2, 2015

jbenet commented Feb 2, 2015

jbenet Feb 2, 2015

Choose a reason for hiding this comment

jbenet Feb 2, 2015

Choose a reason for hiding this comment

jbenet commented Feb 2, 2015