fix: compressor didn't chunkify big payload #3

g11tech · 2022-04-14T17:55:44Z

As per snappy frame encoding format, an uncompressed chunk can't be greater than UNCOMPRESSED_CHUNK_SIZE = 65536.
This PR updates the same in the compressor and includes a test case for the same, as well as bumps the version.

dapplion · 2022-04-15T07:44:44Z

lib/compress-stream.js

@@ -52,18 +60,25 @@ CompressStream.prototype._uncompressed = function (chunk) {

 CompressStream.prototype._transform = function (chunk, enc, callback) {
  var self = this
+  async function compressChunks() {


Why async function?

the API of transform is to call callback on the completion of transformation of the chunk, so don't want to holdup transform

But there is no await in this body, and compressChunks() is called without any handling. Errors will become unhandled rejections. Can you just make it

function compressChunks() {

I guess the callback can error and cause unhandled rejection.

Maybe the implementation can look like:

new Promise(() => { ... callback(...); ... }).catch((e) => logUnexpectedError(e);

done, just console logged as nothing fancy available in lib

dapplion · 2022-04-15T07:48:20Z

lib/compress-stream.js

+      for (let startFrom = 0; startFrom < chunk.length; startFrom += UNCOMPRESSED_CHUNK_SIZE) {
+        const endAt = startFrom + Math.min(chunk.length - startFrom, UNCOMPRESSED_CHUNK_SIZE);
+        const bytesChunk = chunk.slice(startFrom, endAt);
+        const compressed = snappy.compressSync(bytesChunk)


Why switch to the sync version? If we want to do that we should do it in another PR as study in depth the implications

@dapplion the async version where I chunk bytes from outside (like call compressStream.write(chunk)) which uses the original flow, compared to this where chunking happen in the transform using this way is 2X ~~better~~ slower

(i.e. compressSync based solution is coming out ahead):

(orig-snappy is the original with data chunked at compressStream.write level)

If I use async version inside with await Promise((resolve)=>... wrap to make sure the async returned data is written in a proper serial order, its 10-5% slower because of the overhead, if I don't care about the serial order in which chunks are written (which leads to incorrect stream), then async chunking version inside transform is 5% better.

I think for now this is our best bet 🙂

Okay sounds good to me. Can you commit a benchmark tho? To inform us of the cost of compressing an uncompressing objects of 10Kb, 100Kb, 1000Kb

mpetrunic · 2022-04-15T09:06:48Z

I suggest merging #4 first and reverting version bump here^^

wemeetagain · 2022-04-25T18:36:23Z

lib/compress-stream.js

+ * 
+ */
+
+CompressStream.prototype._transform = function(chunk, enc, callback) {
  var self = this


I guess we don't need self anymore now

wemeetagain

LGTM

g11tech added 3 commits April 14, 2022 15:25

Update the compressor to chunk the bytes as per the standard

0757db6

bump package version

6e22a09

test for multi chunk compress

ab41c6d

g11tech requested a review from a team as a code owner April 14, 2022 17:55

remove the dupilicated test

ce0a908

dapplion reviewed Apr 15, 2022

View reviewed changes

mpetrunic changed the title ~~Update the compressor to chunkify a big payload~~ fix: compressor didn't chunkify big payload Apr 15, 2022

revering version bump as releases are auto created

5c0a678

g11tech mentioned this pull request Apr 22, 2022

Merge 🐼 Tracker ChainSafe/lodestar#3945

Closed

22 tasks

g11tech added 2 commits April 25, 2022 15:04

benchmarks

d6c2786

log unhandled exception

0e7e5ef

wemeetagain reviewed Apr 25, 2022

View reviewed changes

remove self

21c7b4c

wemeetagain approved these changes Apr 26, 2022

View reviewed changes

wemeetagain merged commit a658af0 into master Apr 26, 2022

wemeetagain deleted the g11tech/chunkcompress branch April 26, 2022 09:46

github-actions bot mentioned this pull request Apr 26, 2022

chore(master): release 5.0.1 #5

Merged

g11tech mentioned this pull request Apr 26, 2022

feat: add async chunk compress #6

Merged

This was referenced Apr 27, 2022

chore(master): release 5.0.2 #8

Merged

chore(master): release 5.0.3 #9

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: compressor didn't chunkify big payload #3

fix: compressor didn't chunkify big payload #3

g11tech commented Apr 14, 2022

dapplion Apr 15, 2022

g11tech Apr 15, 2022 •

edited

Loading

dapplion Apr 16, 2022

wemeetagain Apr 25, 2022

g11tech Apr 25, 2022

dapplion Apr 15, 2022

g11tech Apr 15, 2022 •

edited

Loading

dapplion Apr 16, 2022

mpetrunic commented Apr 15, 2022 •

edited

Loading

wemeetagain Apr 25, 2022

wemeetagain left a comment

fix: compressor didn't chunkify big payload #3

fix: compressor didn't chunkify big payload #3

Conversation

g11tech commented Apr 14, 2022

dapplion Apr 15, 2022

Choose a reason for hiding this comment

g11tech Apr 15, 2022 • edited Loading

Choose a reason for hiding this comment

dapplion Apr 16, 2022

Choose a reason for hiding this comment

wemeetagain Apr 25, 2022

Choose a reason for hiding this comment

g11tech Apr 25, 2022

Choose a reason for hiding this comment

dapplion Apr 15, 2022

Choose a reason for hiding this comment

g11tech Apr 15, 2022 • edited Loading

Choose a reason for hiding this comment

dapplion Apr 16, 2022

Choose a reason for hiding this comment

mpetrunic commented Apr 15, 2022 • edited Loading

wemeetagain Apr 25, 2022

Choose a reason for hiding this comment

wemeetagain left a comment

Choose a reason for hiding this comment

g11tech Apr 15, 2022 •

edited

Loading

g11tech Apr 15, 2022 •

edited

Loading

mpetrunic commented Apr 15, 2022 •

edited

Loading