Implment Packet buffer for sending #20

disarticulate · 2020-11-26T19:45:33Z

Most browsers currently have a limit for message size:

https://stackoverflow.com/questions/15435121/what-is-the-maximum-size-of-webrtc-data-channel-messages

My testing on chrome gets the following error:
Attempting to send message of size 988606 which is larger than limit 262144

Although the spec is expected to be built into browsers, these arbitrary size limits result in no error message that I see in the console. The above comes from running debug version of chrome.

When I tried to create my own 'sync' system before switching to try y-webrtc, I used protocol buffers to wrap updates, and hashed the data to keep them order/organized. I don't have any real knowledge about best practices, however.

The text was updated successfully, but these errors were encountered:

dmonad · 2020-11-28T14:51:11Z

Hi @disarticulate ,

this is indeed a problem. Data channels in WebRTC really feel like an afterthought in many places.

One solution would be to write a wrapper around the webrtc package "simple-peer" that will handle splitting up messages. Larger messages simply need to be split up if they exceed a certain size (I wonder if it is possible to get/overwrite the message-size limit)..

I wanted to write a wrapper around simple-peer anyway because different browsers often have trouble communicating with each other. Sometimes messages get lost although we use a reliable webrtc connection. Our wrapper around simple-peer should handle splitting up messages and making sure that no messages get lost (using a retry logic).

I imagine that we simply assign an increasing number to each message. Messages that are split have an additional increasing number that defines the part of the message.

This is how I would define the protocol. Internally, I'd probably simply encode this to Uint8Arrays using lib0/encoding. Protocol buffers is great, but it adds quite some overhead that I try to avoid (bundle size & mental complexity).

# Example of a "normal" message that is not split up
[normalMessageType, messageClock, ...message]

# Example of a split message
[splitMessageType, messageClock, numberOfMessageParts, partNumber, ...messagePart]

The peers would need to maintain a list of messages that they have not received yet. And of course, they would need to merge message parts when all parts have been received. For Yjs it is not necessary to apply messages in a certain order. Any order is fine. Messages just should not get lost.

When I tried to create my own 'sync' system before switching to try y-webrtc, I used protocol buffers to wrap updates, and hashed the data to keep them order/organized. I don't have any real knowledge about best practices, however.

One advantage of using Yjs/CRDTs is that you don't have to care about the order of messages. These messages simply have to arrive somehow at the other peers.

disarticulate · 2020-11-28T19:11:40Z

I looked around for some prior art, and this appears to be the only wrapper around simple peer that overcomes the issue:

https://github.com/disarticulate/simple-peer-files

The simple-peer-files/src/Meta.ts implements a similar protocol to what you describe

I forked it to see how small it could be bundled, including making simple-peer a peerDependency, without @feross/buffer, it came out to ~38Kb, compressed I believe, ~110Kb uncompressed. It looks like they're using some heavy streaming libraries, so i'm not sure how to interpret 'bundle size', but I'd guess a lot of that duplicates what you've done with lib0.

the other thought I had: with the Yjs/CRDT is there anyway to 'naturally' spit out smaller/chunked updates with some kind of flag? This would probably ruin the advantage of out of order updates to the extent that you'd need to mutexlock updates until a splitMessage is finished sending.

For now, I'm down sizing my documents and moving the media/large segmented parts into hashes and seeing if simple-peer-files works well enough to do the heavy lifting and recombine the thing on the otherside.

dmonad · 2020-12-03T15:58:38Z

i'm not sure how to interpret 'bundle size', but I'd guess a lot of that duplicates what you've done with lib0.

Yjs uses lib0/encoding anyway. So I'd like to avoid other encoding-libraries if possible. Seems a lot of people are focused on protobuf ^^ yjs/yjs#262 - I Explained my reasons for not using protobuf in Yjs there.

It seems that WebRTC doesn't always guarantee in-order delivery. So the new protocol should account for that. Simply describing the end of a message only works when the protocol guarantees in-order delivery.

the other thought I had: with the Yjs/CRDT is there anyway to 'naturally' spit out smaller/chunked updates with some kind of flag? This would probably ruin the advantage of out of order updates to the extent that you'd need to mutexlock updates until a splitMessage is finished sending.

There is. You can basically split up Yjs documents into smaller update messages. But, when you insert one huge JSON/binary blob in Yjs, then the smallest update-unit might still be too large for WebRTC. I don't think we can get around splitting of messages..

disarticulate · 2020-12-04T00:47:26Z

my webrtc buffer protocol was to:

hash the data
chunk the data, then calculate the hashes
wrap in a protobuf with packet # and metadata, particularly the final hash;
receive packets in whatever order then reassemble until the hash matches. so no 'technical' order was necessary but nnot knowing the numbers would make reassembly expensive, but not impossible.

hashing was used because i semi-expect to have an unsecure network and wanted my packets not to be modified, but right now it's just syncing device documents.

I think the problem is definitely webrtc, but I could imagine a benefit to standard 'update sizes' via an intelligent chunking function within the core, as abstractly it seems that's what you're doing when you're moving updates left or right. a buffer's just a bunch of updates to the right. It's just it loses the advantage while it's trying to do that update.

anyway, I'm deep into my application layer and cannot provide much other than presenting things I've found along the way.

holtwick · 2021-01-05T16:27:59Z

Hi, I would like to join the discussion with a question:
If I'd like to send a bigger file like an image, I guess it doesn't make sense to wrap that in a Y.Doc?

If that's true, what would be the best way to share such a file among peers? Usually I would send some request to a peer to send me the binary data using a DataChannel Is that correct?

Can we extend y-webrtc to support exchanging additional data formats? Would it be possible to use the same encryption?

dmonad · 2021-01-06T12:31:55Z

In the current state, y-webrtc apparently can't handle large files (depending on the browser being used).

Managing this manually would be pretty hard because you need to coordinate where to get the file from. y-webrtc supports partially connected networks (not every client is connected to every other client).

Therefore, it might make sense to put the image in a subdocument. Then Yjs can handle syncing the image asynchronously. There should be close to no performance overhead if you store the image as a Uint8Array somewhere in Yjs.

dmonad · 2021-01-06T12:32:36Z

Another nice alternative is to use webtorrent (for large files).

disarticulate · 2021-01-06T12:50:10Z

If you use any array, you would need to chunk the transaction to ~16KB to maximize transmission, according to some testing I've seen. Past 64KB, certain browsers silently fail to send.

…

On Wed, Jan 6, 2021, 06:32 Kevin Jahns ***@***.***> wrote: In the current state, y-webrtc apparently can't handle large files (depending on the browser being used). Managing this manually would be pretty hard because you need to coordinate where to get the file from. y-webrtc supports partially connected networks (not every client is connected to every other client). Therefore, it might make sense to put the image in a subdocument. Then Yjs can handle syncing the image asynchronously. There should be close to no performance overhead if you store the image as a Uint8Array somewhere in Yjs. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#20 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEFHWPNKHZHAZXCP5M5ZXWDSYRJ4VANCNFSM4UEDJSXA> .

holtwick · 2021-01-06T13:05:58Z

Thanks, @dmonad and @disarticulate for the valuable feedback. I will test the solutions you mentioned once I get to the implementation of that feature in my project. I'll give feedback on the outcomes.

To summarize the solutions you proposed:

Yjs sub-document with data in Uint8Array format
Using webtorrent (smart load distribution)
Direct transmission with chunked data about 16KB each. Example code

I would add another solution, for my special use case, involving a stupid web server to upload the data once and clients fetching from there.

disarticulate · 2021-03-21T16:39:21Z

I created a monkey patch, hack, into SimplePeer here:

https://github.com/disarticulate/y-webrtc/

I did the following:

Extended SimplePeer's class as SimplePeerExtended.js
Overwrite import in y-webrtc.js to use th eextneded version
created two Y.Doc for transmission (txDoc) and receiving (rxDoc)
created a initial setup and sync transmissions for peers
a. client1 syncs: txDoc -> rxDoc (one way)
b. client2 syncs: txDoc -> rxDoc (one way)
send(chunk) -> queses data, creates more chunks with packets, and sends each packet into an array in the txDoc
txDoc.on('update' -> sends msg to sync
rxDoc is updated with msg
upon receipt of all packets, this.push is triggered

it reuses yjs and no outside packages. it may be a design guide to something more economical. also, i believe WebRTC spec doesn't garuntee order of transmission so the CRDT algo does some work here. otherwise we're just using the nice encoded dataset given byh 'update'

martinpengellyphillips · 2022-12-29T19:55:12Z

I just encountered this and took a while to determine the issue. What happened in my case is that syncing in Firefox worked, but syncing the same in Chrome suddenly started failing (having worked previously). I eventually narrowed it down to a size issue where a particularly large update was silently breaking y-webrtc for Chrome.

A few questions:

Is the related pr here still the best approach to workaround this?
Is there anything I can do to help get a fix included in y-webrtc itself?
Can there be a more visible y-webrtc warning / error when this occurs?

Thanks!

disarticulate · 2023-08-23T20:17:57Z

@martinpengellyphillips here's the #25 pull request. I think some of the feedback is about better integration with @dmonad's approach and comments.

As far as I know, this is just how webrtc is going to handle things. Another solution would be to figure out how to ensure all updates using webrtc are already a max size before using the pipe.

andre-dietrich · 2023-11-03T08:24:04Z

I created a monkey patch, hack, into SimplePeer here:

https://github.com/disarticulate/y-webrtc/

I did the following:
1. Extended SimplePeer's class as SimplePeerExtended.js

2. Overwrite import in y-webrtc.js to use th eextneded version

3. created two Y.Doc for transmission (txDoc) and receiving (rxDoc)

4. created a initial setup and sync transmissions for peers
   a. client1 syncs: txDoc -> rxDoc (one way)
   b. client2 syncs: txDoc -> rxDoc (one way)

5. send(chunk) -> queses data, creates more chunks with packets, and sends each packet into an array in the txDoc

6. txDoc.on('update' -> sends msg to sync

7. rxDoc is updated with msg

8. upon receipt of all packets, this.push is triggered
it reuses yjs and no outside packages. it may be a design guide to something more economical. also, i believe WebRTC spec doesn't garuntee order of transmission so the CRDT algo does some work here. otherwise we're just using the nice encoded dataset given byh 'update'

@disarticulate ... Thanks for your efforts, I used your fix as an alternative WebRTC-Provider and it works like charm, tested it on different browsers and with images and even video files ...

dmonad mentioned this issue Dec 18, 2020

The content is different under different operating systems (can be reproduce in the monaco demo) yjs/y-monaco#6

Open

2 tasks

disarticulate mentioned this issue Jan 8, 2021

Implement a peer to peer mesh topology/routing #22

Closed

disarticulate mentioned this issue Mar 21, 2021

'peers' event on on SignalingConn emits 'added' peer from peer's close event #23

Open

2 tasks

disarticulate linked a pull request May 21, 2021 that will close this issue

extend simple peer (not peerjs, oops) to handle buffered/packet transmission; add raw dependency w/MIT license #25

Open

datakurre mentioned this issue Apr 13, 2022

Collaborative WebRTC disconnects with big enough update jupyterlite/jupyterlite#598

Closed

This was referenced Jan 29, 2023

New issue, there are so many issues amark/gun#1307

Closed

WebRTC: Browser to Browser libp2p/js-libp2p#1462

Closed

jeffrafter mentioned this issue Jul 25, 2023

Error when updating text field with long string #42

Open

2 tasks

arvinxx mentioned this issue Mar 14, 2024

✨ feat: support sync data between different device lobehub/lobe-chat#1525

Merged

7 tasks

niklauslee mentioned this issue May 1, 2024

Real-time collaboration dgmjs/dgmjs#42

Closed

15 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implment Packet buffer for sending #20

Implment Packet buffer for sending #20

disarticulate commented Nov 26, 2020

dmonad commented Nov 28, 2020 •

edited

Loading

disarticulate commented Nov 28, 2020 •

edited

Loading

dmonad commented Dec 3, 2020

disarticulate commented Dec 4, 2020 •

edited

Loading

holtwick commented Jan 5, 2021

dmonad commented Jan 6, 2021

dmonad commented Jan 6, 2021

disarticulate commented Jan 6, 2021 via email

holtwick commented Jan 6, 2021

disarticulate commented Mar 21, 2021 •

edited

Loading

martinpengellyphillips commented Dec 29, 2022

disarticulate commented Aug 23, 2023

andre-dietrich commented Nov 3, 2023

Implment Packet buffer for sending #20

Implment Packet buffer for sending #20

Comments

disarticulate commented Nov 26, 2020

dmonad commented Nov 28, 2020 • edited Loading

disarticulate commented Nov 28, 2020 • edited Loading

dmonad commented Dec 3, 2020

disarticulate commented Dec 4, 2020 • edited Loading

holtwick commented Jan 5, 2021

dmonad commented Jan 6, 2021

dmonad commented Jan 6, 2021

disarticulate commented Jan 6, 2021 via email

holtwick commented Jan 6, 2021

disarticulate commented Mar 21, 2021 • edited Loading

martinpengellyphillips commented Dec 29, 2022

disarticulate commented Aug 23, 2023

andre-dietrich commented Nov 3, 2023

dmonad commented Nov 28, 2020 •

edited

Loading

disarticulate commented Nov 28, 2020 •

edited

Loading

disarticulate commented Dec 4, 2020 •

edited

Loading

disarticulate commented Mar 21, 2021 •

edited

Loading