Implement `ADDMOD` #564

adria0 · 2022-06-10T17:00:28Z

Specs in https://github.com/privacy-scaling-explorations/zkevm-specs/blob/master/specs/opcode/08ADDMOD.md. It shares some code with MULMOD

CPerezz · 2022-06-20T11:32:47Z

So for the curiuos ones:

After checking the issue with @ed255 we saw that adding codegen-units=1 as it is in the bench profile solves the issue.

That basically tells us that the error is on the compilation pipeline of rustc. Which leads to a machine code that tries to access a wrong memory section.

We've obtained LLDB and gdb traces and will submit an issue into rustlang/rust

What that means for us?

Meanwhile the issue is not solved, we need to compile our code with codegen-units=1 (which will cause the CI to be significantly slower).
We do have the alternative which is #[ignore] all the bytecode tests that lead to this issue until the error in rust is solved and we have a new release for it.

We need to try one last thing which is try the latest nightly and stable toolchains and compile with them.

This might show that newer releases come with this issue solved and therefore we can ignore this (although we fill the issue anyway).

This shares a lot of similarities with rust-lang/rust#62896 and might have to do with a misscompilation that rustc is doing.

ed255

Overall looks good!
Please take a look at my comments.

The current concern right now for this PR is resolving the segfault, which seems unrelated to this PR.

zkevm-circuits/src/evm_circuit/util/math_gadget.rs

zkevm-circuits/src/evm_circuit/execution/addmod.rs

ed255 · 2022-06-20T13:03:19Z

In my current investigation of the segfault I made the hypothesis that the issue comes from a stack overflow.

Analyzing the segfault with gdb we see that it happens in the function probestack. The code in this function says:

// Our goal here is to touch each page between %rsp+8 and %rsp+8-%rax,
// ensuring that if any pages are unmapped we'll make a page fault.

I was puzzled because there was no report of stack overflow, only invalid memory. But maybe it's the case that a stack overflow is only reported for the main thread, and other threads just crash when accessing the memory outside of the stack?

To test this hypothesis I ran the test with an increased minimum stack for threads (default is 2MB, I use 16MB):

RUST_BACKTRACE=1 RUST_MIN_STACK=16777216 cargo test --release addmod_simple -- --nocapture

And the tests passes without crash!

CPerezz · 2022-06-20T13:08:48Z

In my current investigation of the segfault I made the hypothesis that the issue comes from a stack overflow.

Analyzing the segfault with gdb we see that it happens in the function probestack. The code in this function says:
// Our goal here is to touch each page between %rsp+8 and %rsp+8-%rax,
// ensuring that if any pages are unmapped we'll make a page fault.
I was puzzled because there was no report of stack overflow, only invalid memory. But maybe it's the case that a stack overflow is only reported for the main thread, and other threads just crash when accessing the memory outside of the stack?

To test this hypothesis I ran the test with an increased minimum stack for threads (default is 2MB, I use 16MB):
RUST_BACKTRACE=1 RUST_MIN_STACK=16777216 cargo test --release addmod_simple -- --nocapture
And the tests passes without crash!

Should we make a PR including this env var to .cargo/config.toml?

zkevm-circuits/src/evm_circuit/util/math_gadget.rs

adria0 · 2022-06-20T14:15:17Z

And the tests passes without crash!

Should we make a PR including this env var to .cargo/config.toml?

Awesome @ed255! Confirmed that also works with M1 :)

ed255 · 2022-06-20T14:31:45Z

zkevm-circuits/src/evm_circuit/execution/addmod.rs

+        self.muladd_d_n_r.assign(
+            region,
+            offset,
+            [d, n, a_reduced_plus_b, a_reduced_plus_b_overflow],


Suggested change

[d, n, a_reduced_plus_b, a_reduced_plus_b_overflow],

[d, n, a_reduced_plus_b_overflow. a_reduced_plus_b],

The overflow bit should come first

Ok, bad news, all tests passes both for

[d, n, a_reduced_plus_b, a_reduced_plus_b_overflow] and
[d, n, a_reduced_plus_b_overflow, a_reduced_plus_b]

🧐

oh... We need to figure out why! 🕵️‍♀️

I have a test case missing!

But the missing test is not related...

@davidnevadoc, the MulAdd512WordGadgets is ok with the following witness

MulAddWords512Gadget([1,10,12,0], Some(2))

could you check it? Maybe I'm wrong somewhere but I am not able to find it

I think I found the reason why this works when passing swapped d and e in the assign function of MulAddWords512Gadget.
On one hand, MulAddWords512Gadget only assigns to its internal witness which are carry_0, carry_1 and carry_2. And those values are calculated from a, b, c, d, e. If I remember correctly, for ADDMOD we only needed 257 bit arithmetic, but we're using MulAddWords512Gadget for convenience. This means that the possible values we will encounter for the carry witnesses is more limited (~~I think they are always 0 or 1~~). And it seems that swapping d and e (using values from ADDMOD), the calculated carry witnesses stay the same! Or at least that's the case with the tests.

I'm not sure if there's a case where swapping d and e in the assign function for MulAddWords512Gadget could change the carry witnesses.

Let's see what @davidnevadoc says, @ed255 !

I'm not sure if there's a case where swapping d and e in the assign function for MulAddWords512Gadget could change the carry witnesses.

I've tried to find some combination of inputs that calculates in different carry witnesses when swapping d and e, but I didn't find any. It's a bit tricky because at this point the inputs are derived from other calculations. So even after trying I'm not convinced that such inputs don't exist.

Anyway, I think we can merge this PR even if this is not yet resolved!

The reason for this is what @ed255 said plus the fact that we are using saturating_sub in assign.
Turns out that in this example the correct values for the 3 carry are 0. When d and e are swapped, the computation for these carries would result in an underflow and a panic. Instead of this, the use of saturating_sub gives a 0 (the correct value by chance).
The constraints pass because the rest of the cells (d and e included) are placed correctly.

zkevm-circuits/src/evm_circuit/execution/addmod.rs

ed255 · 2022-06-20T15:20:23Z

I've been debugging for a while the current code trying to figure out what is causing the high stack consumption that leads to the overflow (seen with the invalid memory segfault), and I found one of the culprits!
The type ExecutionConfig in src/evm_circuit/execution.rs has been growing with each gadget that we've added, and it currently uses 337 KiB of memory. This struct is nested in other structs that are used as return values in functions, which causes several functions in the call stack to use more than 337 KiB of stack memory, which easily adds up!

While increasing the RUST_MIN_STACK solves the issue, I think we should also be careful to not have big values in the stack, so I propose moving the ExecutionConfig to the heap like this:

--- a/zkevm-circuits/src/evm_circuit.rs
+++ b/zkevm-circuits/src/evm_circuit.rs
@@ -22,7 +22,7 @@ use witness::Block;
 pub struct EvmCircuit<F> {
     fixed_table: [Column<Fixed>; 4],
     byte_table: [Column<Fixed>; 1],
-    execution: ExecutionConfig<F>,
+    execution: Box<ExecutionConfig<F>>,
 }

 impl<F: Field> EvmCircuit<F> {
@@ -38,7 +38,7 @@ impl<F: Field> EvmCircuit<F> {
         let fixed_table = [(); 4].map(|_| meta.fixed_column());
         let byte_table = [(); 1].map(|_| meta.fixed_column());

-        let execution = ExecutionConfig::configure(
+        let execution = Box::new(ExecutionConfig::configure(
             meta,
             power_of_randomness,
             &fixed_table,
@@ -47,7 +47,7 @@ impl<F: Field> EvmCircuit<F> {
             rw_table,
             bytecode_table,
             block_table,
-        );
+        ));

         Self {
             fixed_table,

zkevm-circuits/src/evm_circuit/execution/addmod.rs

…ons/zkevm-circuits into feature/addmod

ed255

LGTM! Reviewing this took longer than expected!

davidnevadoc

LGTM! Good work 👍

davidnevadoc · 2022-06-21T10:53:48Z

zkevm-circuits/src/evm_circuit/execution/addmod.rs

+        self.muladd_d_n_r.assign(
+            region,
+            offset,
+            [d, n, a_reduced_plus_b, a_reduced_plus_b_overflow],


The reason for this is what @ed255 said plus the fact that we are using saturating_sub in assign.
Turns out that in this example the correct values for the 3 carry are 0. When d and e are swapped, the computation for these carries would result in an underflow and a panic. Instead of this, the use of saturating_sub gives a 0 (the correct value by chance).
The constraints pass because the rest of the cells (d and e included) are placed correctly.

github-actions bot added the crate-zkevm-circuits Issues related to the zkevm-circuits workspace member label Jun 10, 2022

Implement ADDMOD

1ca4427

adria0 force-pushed the feature/addmod branch from f89e3dc to 1ca4427 Compare June 10, 2022 17:12

adria0 requested a review from ed255 June 14, 2022 18:06

Merge branch 'main' into feature/addmod

5ad56ea

adria0 force-pushed the feature/addmod branch 3 times, most recently from a55976b to b83316d Compare June 20, 2022 08:53

ed255 reviewed Jun 20, 2022

View reviewed changes

merge fix

92f45d9

adria0 force-pushed the feature/addmod branch from b83316d to 92f45d9 Compare June 20, 2022 13:07

adria0 commented Jun 20, 2022

View reviewed changes

zkevm-circuits/src/evm_circuit/util/math_gadget.rs Outdated Show resolved Hide resolved

adria0 commented Jun 20, 2022

View reviewed changes

zkevm-circuits/src/evm_circuit/util/math_gadget.rs Show resolved Hide resolved

adria0 and others added 3 commits June 20, 2022 15:53

Apply suggestions from @ed255 code review

a70aaba

Merge branch 'main' into feature/addmod

fe85380

fmt

46994df

ed255 reviewed Jun 20, 2022

View reviewed changes

adria0 added 2 commits June 20, 2022 16:40

added RUST_MIN_STACK

b4cee20

fix order

13cd65a

ed255 reviewed Jun 20, 2022

View reviewed changes

zkevm-circuits/src/evm_circuit/execution/addmod.rs Outdated Show resolved Hide resolved

ed255 mentioned this pull request Jun 20, 2022

Box ExecutionConfig in EvmCircuit #583

Merged

adria0 commented Jun 20, 2022

View reviewed changes

zkevm-circuits/src/evm_circuit/execution/addmod.rs Outdated Show resolved Hide resolved

adria0 and others added 3 commits June 20, 2022 18:40

Add a_reduced + b requieres 512 bit. Improve tests.

4ff67ef

Add a_reduced + b requieres 512 bit. Improve tests.

e6ee7e5

Merge branch 'main' into feature/addmod

85d8db3

adria0 and others added 4 commits June 21, 2022 11:16

Merge branch 'feature/addmod' of github.com:privacy-scaling-explorati…

ed61005

…ons/zkevm-circuits into feature/addmod

remove stack set

f67abb0

make tests verbose on error

7e5a3f6

Merge branch 'main' into feature/addmod

7c24153

adria0 requested review from davidnevadoc, miha-stopar and ed255 June 21, 2022 10:09

ed255 approved these changes Jun 21, 2022

View reviewed changes

davidnevadoc approved these changes Jun 21, 2022

View reviewed changes

adria0 merged commit 50c5194 into main Jun 21, 2022

adria0 deleted the feature/addmod branch August 2, 2022 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `ADDMOD` #564

Implement `ADDMOD` #564

adria0 commented Jun 10, 2022

CPerezz commented Jun 20, 2022

ed255 left a comment

ed255 commented Jun 20, 2022

CPerezz commented Jun 20, 2022

adria0 commented Jun 20, 2022

ed255 Jun 20, 2022

adria0 Jun 20, 2022

ed255 Jun 20, 2022

adria0 Jun 20, 2022

adria0 Jun 20, 2022

ed255 Jun 21, 2022 •

edited

Loading

adria0 Jun 21, 2022

ed255 Jun 21, 2022

davidnevadoc Jun 21, 2022

ed255 commented Jun 20, 2022

ed255 left a comment

davidnevadoc left a comment

davidnevadoc Jun 21, 2022

	[d, n, a_reduced_plus_b, a_reduced_plus_b_overflow],
	[d, n, a_reduced_plus_b_overflow. a_reduced_plus_b],

Implement ADDMOD #564

Implement ADDMOD #564

Conversation

adria0 commented Jun 10, 2022

CPerezz commented Jun 20, 2022

ed255 left a comment

Choose a reason for hiding this comment

ed255 commented Jun 20, 2022

CPerezz commented Jun 20, 2022

adria0 commented Jun 20, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ed255 Jun 21, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ed255 commented Jun 20, 2022

ed255 left a comment

Choose a reason for hiding this comment

davidnevadoc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Implement `ADDMOD` #564

Implement `ADDMOD` #564

ed255 Jun 21, 2022 •

edited

Loading