perf/feature: new serialization format for constraint systems #1119

gbotrel · 2024-04-29T02:57:17Z

Description

This PR uses integer compression techniques to serialize some constraint systems parts containing ids, offsets, etc and bypass our current cbor choice. (cbor is still used for the rest).

As a bench, using lzss.DecompressionTestCircuit with params landing at 90M constraints (plonk), we go from 9225Mb to 2238Mb size, plus, we serialize (and most importantly deserialize) way faster:

benchmark                                      old ns/op        new ns/op      delta
BenchmarkSerialization/serialize_cbor-96       54962357901      3086254450     -94.38%
BenchmarkSerialization/serialize_cbor-96       41877637150      3117870148     -92.55%
BenchmarkSerialization/deserialize_cbor-96     100910397720     4267472009     -95.77%
BenchmarkSerialization/deserialize_cbor-96     117815108278     3776636214     -96.79%

ivokub

Very nice optimizations! I think in general is good, but in a perfect scenario I would try to figure out if we can work with io.Writer and io.Reader instead of direct byte arrays as imo otherwise we have many copies of very similar structures in memory (constraint system itself, bytes when serializing them and also byte buffer for internal steps a la serializing calldata). But due to applying compression to uints we still cannot get away with all.

And I completely agree that it doesn't make sense for us to include additional compression in the implementation. It should be straightforward to add gzip etc. when the user is interested in it. Otherwise there is too many options (fast or good compression ...)

internal/generator/backend/template/representations/coeff.go.tmpl

constraint/marshal.go

internal/backend/ioutils/intcomp_test.go

ivokub · 2024-04-30T10:18:34Z

Very nice optimizations! I think in general is good, but in a perfect scenario I would try to figure out if we can work with io.Writer and io.Reader instead of direct byte arrays as imo otherwise we have many copies of very similar structures in memory (constraint system itself, bytes when serializing them and also byte buffer for internal steps a la serializing calldata). But due to applying compression to uints we still cannot get away with all.

And I completely agree that it doesn't make sense for us to include additional compression in the implementation. It should be straightforward to add gzip etc. when the user is interested in it. Otherwise there is too many options (fast or good compression ...)

Yep, it makes sense that if we work with []byte then can have parallel processing which is more important than saving a few GB in settings where memory usage is in tens of GBs anyway. Added the test corpus.

Good to merge from my side.

gbotrel added 2 commits April 28, 2024 10:25

step1: start to explore using intcomp

68aa034

perf: partial binary serialization for constraint system

6c210d5

gbotrel marked this pull request as draft April 29, 2024 02:57

gbotrel added 5 commits April 29, 2024 09:26

test: regenerate test circuits

7e1a271

perf: make it parallel

e86061f

style: cosmetics edit

58a7356

feat: use uvarint for call data and regenerate regression test scs

ff9ce95

build: clean go.mod

2bf1dc5

gbotrel requested review from Tabaie and ivokub April 29, 2024 21:26

gbotrel marked this pull request as ready for review April 29, 2024 21:26

test: add fuzztest for intcomp

9a15e13

ivokub approved these changes Apr 29, 2024

View reviewed changes

internal/generator/backend/template/representations/coeff.go.tmpl Show resolved Hide resolved

constraint/marshal.go Show resolved Hide resolved

internal/backend/ioutils/intcomp_test.go Show resolved Hide resolved

test: add fuzzing seed corpus

30baf99

gbotrel merged commit f3c5cf3 into master Apr 30, 2024
7 checks passed

gbotrel deleted the perf/cbor_circuit branch April 30, 2024 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf/feature: new serialization format for constraint systems #1119

perf/feature: new serialization format for constraint systems #1119

gbotrel commented Apr 29, 2024 •

edited

Loading

ivokub left a comment

ivokub commented Apr 30, 2024

perf/feature: new serialization format for constraint systems #1119

perf/feature: new serialization format for constraint systems #1119

Conversation

gbotrel commented Apr 29, 2024 • edited Loading

Description

ivokub left a comment

Choose a reason for hiding this comment

ivokub commented Apr 30, 2024

gbotrel commented Apr 29, 2024 •

edited

Loading