Open Problem: Enhanced Bitswap/GraphSync with more Network Smarts #9

daviddias · 2019-09-10T07:53:02Z

No description provided.

daviddias · 2019-09-10T08:27:45Z

Do not forget to link to:

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

jsoares · 2019-11-07T10:50:23Z

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

+### What defines a complete solution?
+> What hard constraints should it obey? Are there additional soft constraints that a solution would ideally obey?
+
+First and foremost, any complete solution should account for extensibility as the IPFS system needs to scale up and more applications are implemented on top. The active number of users of IPFS is increasing exponentially and the requests submitted to the network are following accordingly. That said, a complete solution should account for those numbers.


I'm not sure unbounded exponential scaling is a realistic goal. Would be good to put some order of magnitude here, especially given the reference to "those numbers".

Good point, need to clarify.

I would add that ideally IPFS should dynamically adapt to different environments, analogously to how TCP works in a data center and also works on the broader internet

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

dirkmc

Overall LGTM 👍

I left a couple of comments with some more detail in case you need to incorporate that background info anywhere

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

dirkmc · 2019-11-08T19:29:59Z

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

+### What defines a complete solution?
+> What hard constraints should it obey? Are there additional soft constraints that a solution would ideally obey?
+
+First and foremost, any complete solution should account for extensibility as the IPFS system needs to scale up and more applications are implemented on top. The active number of users of IPFS is increasing exponentially and the requests submitted to the network are following accordingly. That said, a complete solution should account for those numbers.


I would add that ideally IPFS should dynamically adapt to different environments, analogously to how TCP works in a data center and also works on the broader internet

daviddias · 2019-11-11T11:20:57Z

@yiannisbot can you take in @dirkmc's review before I do the final review for the merge? Thank you!

yiannisbot · 2019-11-11T12:56:39Z

Yup, it's on the to-do list for this week as I prepare the RFPs.

yiannisbot · 2019-11-12T12:45:35Z

Overall LGTM 👍

I left a couple of comments with some more detail in case you need to incorporate that background info anywhere

Thanks a lot @dirkmc! Very useful feedback. Most of it now integrated in the main text.

dirkmc · 2019-11-12T14:11:56Z

I'm not sure if we want to include it in this document, but I just want to make sure people are aware that the folks at qri.io have implemented a data transfer mechanism using some IPFS components that keeps track of blocks in a DAG using Manifest files, analagous to bittorrent magnet files.

yiannisbot · 2019-11-12T15:23:09Z

I'm not sure if we want to include it in this document, but I just want to make sure people are aware that the folks at qri.io have implemented a data transfer mechanism using some IPFS components that keeps track of blocks in a DAG using Manifest files, analagous to bittorrent magnet files.

Added in the "Extra Notes"

Stebalien · 2019-11-13T20:05:46Z

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

+
+If none of the directly connected peers have any of the WANT list blocks, bitswap falls back to the DHT to find the requested content. This results in long delays to get to a peer that stores the requested content.
+
+Once the recipient node starts receiving content from multiple peer nodes, it prunes down the long-latency peers and keeps the one to which the RTT is the shortest. Current proposals within the IPFS ecosystem are considering keeping the node with the highest throughput instead. It is not clear at this point which is the best approach.


Not exactly.

We currently prune to peers that have the content, then prioritize sending wants to peers with lower latencies. We still send wants to all peers (IIRC).

The plan is to change that second part to: prioritize sending wants to peers with the least amount of queued work.

@yiannisbot did you take @Stebalien's review in this comment?

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

Stebalien · 2019-11-13T20:11:25Z

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

+
+- *DAG Block Interconnection.* Although bitswap does not/cannot recognise any relationship between different blocks of the same DAG, a requesting node can ask a node that provided a previous block for subsequent blocks of the same DAG. This approach intuitively assumes that a node that has a block of a DAG is very likely to have others. This is often referred to as “session” between the peers that have provided some part of the DAG.
+
+- *Latency vs Throughput.* Bitswap is currently sorting peers by latency, i.e., it is pruning down the connections that incur higher latency. It has been suggested that this is changed to maximise throughput (i.e., keep the pipe full).


It's not really either/or. Really, we should:

Optimize for latency when traversing deep/narrow DAGs (e.g., a blockchain/path). Lower latency means we learn about the next node faster.

Optimize for throughput when traversing a wide DAG in parallel.

That's great! I was not aware this was the intention.

Stebalien · 2019-11-13T20:26:46Z

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

+
+There have been significant research efforts lately in the area of coded caching. The main concept has been proposed in 1960s in the form of error correction and targeted the area of content delivery over wireless, lossy channels. It has been known as Reed-Solomon error correction. Lately, with seminal works such as “Fundamental Limits of Caching”, Niesen et. al. have proposed the use of coding to improve caching performance. In a summary, the technique works as follows: if we have a file that consists of 10 chunks and we store all 10 chunks in the same or different memories/nodes, then we need to retrieve those exact 10 chunks in order to reconstruct the file.
+
+In contrast, according to the coded caching theory, before storing the 10 chunks we encode the file using erasure codes. This results in some number of chunks x>10, say 13, for the sake of illustration. This clearly results in more data produced after adding codes to the original data. However, when attempting to retrieve the original file, a user needs to collect *any 10 of those 13 chunks*. By doing so, the user will be able to reconstruct the original file, without needing to get all 13 chunks. Although such approach does not save bandwidth (we still need to reconstruct 10 chunks of equal size to the original one), it makes the network more resilient to nodes being unavailable. In other words, in order to reconstruct the original file without coding, all 10 of the original peers that store a file have to be online and ready to deliver the chunks, whereas in the coded caching case, any 10 out of the 13 peers need to be available and ready to provide the chunks. Blind replication of the original chunks will not provide the same benefit, as the number of peers will need to be much higher (at least 20 as compared to 13) in order to operate with the same satisfaction ratio.


IIRC, if I erasure encode my file, I can't reconstruct a part of the file without having the minimum number of chunks. Is this correct?

If so, I'm not sure if this buys us anything. Peers will likely either have or not have all the chunks necessary to reconstruct a file; having one chunk will be highly correlated with having the rest.

TL;DR: chunk deletion is not random. Yes, disks can fail but that should be handled at a lower layer.

IIRC, if I erasure encode my file, I can't reconstruct a part of the file without having the minimum number of chunks. Is this correct?

Yes, it is.

If so, I'm not sure if this buys us anything. Peers will likely either have or not have all the chunks necessary to reconstruct a file; having one chunk will be highly correlated with having the rest.

That's correct for the case of small files. But in case of very large files, coded caching provides nice load-balancing properties, i.e., you don't keep someone's uplink saturated for hours to get some GBs worth of data. The replication you need to do in order to achieve equal load-balancing but without caching will be much higher, therefore, resulting in inefficient use of (storage) resources.

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

yiannisbot

I've addressed @Stebalien's comments and committed a new version.

daviddias

Some additional comments. This is looking really solid, We can merge it once the last comments are addressed

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

daviddias · 2019-11-22T16:15:49Z

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

+
+### Extra notes
+
+[qri.io](https://qri.io/): a data transfer mechanism using IPFS components to keep track of blocks in a DAG using Manifest files (similar to bittorrent magnet files) - https://github.com/qri-io/dag


This should go as "one of the experiments within the IPFS Ecosystem", it is a tool that uses IPFS and its APIs for faster syncs.

Create ENHANCED_BITSWAP_GRAPHSYNC.md

a5e86b4

Update ENHANCED_BITSWAP_GRAPHSYNC.md

88459f8

daviddias commented Sep 12, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Show resolved Hide resolved

daviddias and others added 2 commits September 18, 2019 14:30

Update ENHANCED_BITSWAP_GRAPHSYNC.md

3d5e62a

Bitswap/Graphsync Open Problem Description

acc6b6e

daviddias marked this pull request as ready for review November 7, 2019 08:43

daviddias requested review from jsoares, Stebalien and dirkmc November 7, 2019 08:44

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Outdated Show resolved Hide resolved

jsoares reviewed Nov 7, 2019

View reviewed changes

jsoares approved these changes Nov 7, 2019

View reviewed changes

yiannisbot and others added 8 commits November 7, 2019 16:03

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

b80b4c0

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

d9287d7

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

cf2b019

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

5f8aeb2

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

0500bf1

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

cf51f31

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

9e1a51b

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

Update OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md

9516af0

Co-Authored-By: Jorge Soares <mail@jorgesoares.org>

dirkmc approved these changes Nov 8, 2019

View reviewed changes

yiannisbot added 2 commits November 12, 2019 11:51

Integration of Dirk's feedback

bcdca81

Update ENHANCED_BITSWAP_GRAPHSYNC.md

b1d134d

yiannisbot added 2 commits November 12, 2019 12:48

Update ENHANCED_BITSWAP_GRAPHSYNC.md

6fb63c4

Update ENHANCED_BITSWAP_GRAPHSYNC.md

7565ff6

Update ENHANCED_BITSWAP_GRAPHSYNC.md

be3ab0b

Update ENHANCED_BITSWAP_GRAPHSYNC.md

b9e91b4

Stebalien reviewed Nov 13, 2019

View reviewed changes

daviddias commented Nov 17, 2019

View reviewed changes

OPEN_PROBLEMS/ENHANCED_BITSWAP_GRAPHSYNC.md Show resolved Hide resolved

Update ENHANCED_BITSWAP_GRAPHSYNC.md

631cc94

yiannisbot reviewed Nov 20, 2019

View reviewed changes

daviddias commented Nov 22, 2019

View reviewed changes

daviddias merged commit 2a78af6 into master Nov 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open Problem: Enhanced Bitswap/GraphSync with more Network Smarts #9

Open Problem: Enhanced Bitswap/GraphSync with more Network Smarts #9

daviddias commented Sep 10, 2019

daviddias commented Sep 10, 2019 •

edited

Loading

jsoares Nov 7, 2019 •

edited

Loading

yiannisbot Nov 7, 2019

dirkmc Nov 8, 2019

dirkmc left a comment

dirkmc Nov 8, 2019

daviddias commented Nov 11, 2019

yiannisbot commented Nov 11, 2019

yiannisbot commented Nov 12, 2019

dirkmc commented Nov 12, 2019

yiannisbot commented Nov 12, 2019

Stebalien Nov 13, 2019

daviddias Nov 22, 2019

Stebalien Nov 13, 2019

yiannisbot Nov 20, 2019

Stebalien Nov 13, 2019

yiannisbot Nov 20, 2019

yiannisbot left a comment

daviddias left a comment

daviddias Nov 22, 2019


		If none of the directly connected peers have any of the WANT list blocks, bitswap falls back to the DHT to find the requested content. This results in long delays to get to a peer that stores the requested content.

		Once the recipient node starts receiving content from multiple peer nodes, it prunes down the long-latency peers and keeps the one to which the RTT is the shortest. Current proposals within the IPFS ecosystem are considering keeping the node with the highest throughput instead. It is not clear at this point which is the best approach.


		- DAG Block Interconnection. Although bitswap does not/cannot recognise any relationship between different blocks of the same DAG, a requesting node can ask a node that provided a previous block for subsequent blocks of the same DAG. This approach intuitively assumes that a node that has a block of a DAG is very likely to have others. This is often referred to as “session” between the peers that have provided some part of the DAG.

		- Latency vs Throughput. Bitswap is currently sorting peers by latency, i.e., it is pruning down the connections that incur higher latency. It has been suggested that this is changed to maximise throughput (i.e., keep the pipe full).


		There have been significant research efforts lately in the area of coded caching. The main concept has been proposed in 1960s in the form of error correction and targeted the area of content delivery over wireless, lossy channels. It has been known as Reed-Solomon error correction. Lately, with seminal works such as “Fundamental Limits of Caching”, Niesen et. al. have proposed the use of coding to improve caching performance. In a summary, the technique works as follows: if we have a file that consists of 10 chunks and we store all 10 chunks in the same or different memories/nodes, then we need to retrieve those exact 10 chunks in order to reconstruct the file.

		In contrast, according to the coded caching theory, before storing the 10 chunks we encode the file using erasure codes. This results in some number of chunks x>10, say 13, for the sake of illustration. This clearly results in more data produced after adding codes to the original data. However, when attempting to retrieve the original file, a user needs to collect any 10 of those 13 chunks. By doing so, the user will be able to reconstruct the original file, without needing to get all 13 chunks. Although such approach does not save bandwidth (we still need to reconstruct 10 chunks of equal size to the original one), it makes the network more resilient to nodes being unavailable. In other words, in order to reconstruct the original file without coding, all 10 of the original peers that store a file have to be online and ready to deliver the chunks, whereas in the coded caching case, any 10 out of the 13 peers need to be available and ready to provide the chunks. Blind replication of the original chunks will not provide the same benefit, as the number of peers will need to be much higher (at least 20 as compared to 13) in order to operate with the same satisfaction ratio.


		### Extra notes

		[qri.io](https://qri.io/): a data transfer mechanism using IPFS components to keep track of blocks in a DAG using Manifest files (similar to bittorrent magnet files) - https://github.com/qri-io/dag

Open Problem: Enhanced Bitswap/GraphSync with more Network Smarts #9

Open Problem: Enhanced Bitswap/GraphSync with more Network Smarts #9

Conversation

daviddias commented Sep 10, 2019

daviddias commented Sep 10, 2019 • edited Loading

jsoares Nov 7, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dirkmc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daviddias commented Nov 11, 2019

yiannisbot commented Nov 11, 2019

yiannisbot commented Nov 12, 2019

dirkmc commented Nov 12, 2019

yiannisbot commented Nov 12, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yiannisbot left a comment

Choose a reason for hiding this comment

daviddias left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daviddias commented Sep 10, 2019 •

edited

Loading

jsoares Nov 7, 2019 •

edited

Loading