feat(avm): kernel output opcodes #6416

Maddiaa0 · 2024-05-15T11:40:23Z

Overview

This pr implements:

emit note hash
note hash exists (only positive case)
nullifier exists (wip)
emit nullifier
l1 to l2 msg exists
emit unencrypted log
emit l2 to l1 msg
sload
sstore

Currently Not Implemented (TODO)

Negative tests have not been performed in this pr, and will be revisted - chore(avm): negative tests for kernel output opcodes #6468
Further kernel circuit changes will be required to fit this outputs format - chore(avm): refactor the public kernel public inputs to line up with the AVM outputs #6469
- Update check exists opcodes to return booleans that will be enforced in the kernels (done)
Start and end side effects counter must be constrained - feat(avm): constrain start and end side effects counter to line up #6471
The write offfsets for each opcode are not currently constrained to be within a certain range (e.g. max new nullifiers per tx) feat(avm): range constrain kernel output write_offsets to be less than MAX PER TX #6465
The write offsets are not constrained to start at 0 (boundary constraints) feat(avm): constrain start write offsets to be 0 #6467
PIILgen must be updated to create verifier checks for multiple public input columns - pr here feat: codegen for multiple public input columns powdr#61

…hod for proof

…columns

…pcode

…columns

… md/04-12-feat_example_caller_and_address_opcode

…pcode

…columns

AztecBot · 2024-05-15T15:07:10Z

Benchmark results

Metrics with a significant change:

protocol_circuit_simulation_time_in_ms (public-kernel-setup): 354 (-44%)
protocol_circuit_simulation_time_in_ms (public-kernel-teardown): 314 (-41%)
protocol_circuit_input_size_in_bytes (public-kernel-setup): 82,270 (-22%)
protocol_circuit_input_size_in_bytes (public-kernel-teardown): 82,979 (-21%)
protocol_circuit_output_size_in_bytes (public-kernel-setup): 68,469 (-21%)
protocol_circuit_output_size_in_bytes (public-kernel-teardown): 69,026 (-20%)
protocol_circuit_witness_generation_time_in_ms (base-parity): 765 (-37%)
protocol_circuit_witness_generation_time_in_ms (base-rollup): 5,426 (+147%)
protocol_circuit_witness_generation_time_in_ms (public-kernel-app-logic): 1,036 (-65%)
protocol_circuit_witness_generation_time_in_ms (root-parity): 166 (+144%)
protocol_circuit_witness_generation_time_in_ms (public-kernel-tail): 6,080 (-73%)
protocol_circuit_witness_generation_time_in_ms (root-rollup): 132 (+108%)
protocol_circuit_proving_time_in_ms (base-parity): 8,168 (+179%)
protocol_circuit_proving_time_in_ms (public-kernel-app-logic): 12,227 (-74%)
protocol_circuit_proving_time_in_ms (base-rollup): 132,636 (+79%)
protocol_circuit_proving_time_in_ms (root-parity): 114,652 (+158%)
protocol_circuit_proving_time_in_ms (public-kernel-tail): 38,783 (-75%)
protocol_circuit_proof_size_in_bytes (public-kernel-app-logic): 77,704 (-32%)

Detailed results

All benchmarks are run on txs on the Benchmarking contract on the repository. Each tx consists of a batch call to create_note and increment_balance, which guarantees that each tx has a private call, a nested private call, a public call, and a nested public call, as well as an emitted private note, an unencrypted log, and public storage read and write.

This benchmark source data is available in JSON format on S3 here.

Proof generation

Each column represents the number of threads used in proof generation.

Metric	1 threads	4 threads	16 threads	32 threads	64 threads
proof_construction_time_sha256	5,700	1,546	713	786	780

L2 block published to L1

Each column represents the number of txs on an L2 block published to L1.

Metric	8 txs	32 txs	64 txs
l1_rollup_calldata_size_in_bytes	1,412	1,412	1,412
l1_rollup_calldata_gas	9,464	9,476	9,476
l1_rollup_execution_gas	616,105	616,117	616,117
l2_block_processing_time_in_ms	1,293	4,823	9,558 (-1%)
l2_block_building_time_in_ms	44,521 (-1%)	176,481	352,937
l2_block_rollup_simulation_time_in_ms	44,351 (-1%)	175,804	351,689
l2_block_public_tx_process_time_in_ms	23,847 (-1%)	99,670 (-1%)	204,204 (-1%)

L2 chain processing

Each column represents the number of blocks on the L2 chain where each block has 16 txs.

Metric	3 blocks	5 blocks
node_history_sync_time_in_ms	9,478 (-1%)	14,509
node_database_size_in_bytes	14,491,728	21,373,008
pxe_database_size_in_bytes	18,071	29,868

Circuits stats

Stats on running time and I/O sizes collected for every kernel circuit run across all benchmarks.

Circuit	simulation_time_in_ms	witness_generation_time_in_ms	proving_time_in_ms	input_size_in_bytes	output_size_in_bytes	proof_size_in_bytes	num_public_inputs	size_in_gates
private-kernel-init	158 (-2%)	3,776 (+1%)	22,817	20,418 (-1%)	62,921 (-3%)	89,536	2,731	1,048,576
private-kernel-inner	607 (-2%)	4,340 (-2%)	45,944 (+7%)	90,048 (-2%)	62,597 (-3%)	89,536	2,731	2,097,152
private-kernel-tail	571 (+1%)	2,921 (-13%)	36,006 (-3%)	96,541	77,498	10,656	266	2,097,152
base-parity	7.16 (+9%)	⚠️ 765 (-37%)	⚠️ 8,168 (+179%)	128	64.0	2,208	2.00	131,072
root-parity	51.0 (+2%)	⚠️ 166 (+144%)	⚠️ 114,652 (+158%)	27,080	64.0	2,720	18.0	2,097,152
base-rollup	723 (-3%)	⚠️ 5,426 (+147%)	⚠️ 132,636 (+79%)	119,058	766 (+1%)	3,285 (-10%)	47.0	4,194,304
root-rollup	105 (-6%)	⚠️ 132 (+108%)	18,178 (-7%)	22,989 (-9%)	648 (+5%)	3,440	41.0	1,048,576
public-kernel-app-logic	510 (-2%)	⚠️ 1,036 (-65%)	⚠️ 12,227 (-74%)	103,785 (-1%)	85,392 (-1%)	⚠️ 77,704 (-32%)	3,520	2,097,152
public-kernel-tail	1,056 (-3%)	⚠️ 6,080 (-73%)	⚠️ 38,783 (-75%)	384,359 (-3%)	7,530	10,248 (-4%)	266	8,388,608
private-kernel-reset-small	594 (+1%)	2,182 (+4%)	45,391 (-1%)	120,733	64,614	89,536	2,731	2,097,152
private-kernel-ordering	685	N/A	N/A	213,464	34,764	N/A	N/A	N/A
public-kernel-setup	⚠️ 354 (-44%)	276	405	⚠️ 82,270 (-22%)	⚠️ 68,469 (-21%)	65,344	N/A	N/A
public-kernel-teardown	⚠️ 314 (-41%)	310	345	⚠️ 82,979 (-21%)	⚠️ 69,026 (-20%)	65,344	N/A	N/A
merge-rollup	29.1 (+1%)	114	2,469	16,534	756	3,104	N/A	N/A
private-kernel-tail-to-public	N/A	8,899 (+4%)	92,991 (+2%)	N/A	N/A	114,784	3,520	4,194,304

Stats on running time collected for app circuits

Function	input_size_in_bytes	output_size_in_bytes	witness_generation_time_in_ms	proof_size_in_bytes	proving_time_in_ms	size_in_gates	num_public_inputs
ContractClassRegisterer:register	1,344	9,944	466 (-1%)	N/A	N/A	N/A	N/A
ContractInstanceDeployer:deploy	1,408	9,944	41.9 (-1%)	N/A	N/A	N/A	N/A
MultiCallEntrypoint:entrypoint	1,920	9,944	1,437	N/A	N/A	N/A	N/A
SchnorrAccount:constructor	1,312	9,944	989	N/A	N/A	N/A	N/A
SchnorrAccount:entrypoint	2,304	9,944	2,091	16,768	51,429	2,097,152	457
Token:privately_mint_private_note	1,280	9,944	1,137	N/A	N/A	N/A	N/A
Token:transfer	1,376	9,944	4,058	16,768	52,979 (+15%)	2,097,152	457
Benchmarking:create_note	1,312	9,944	951	N/A	N/A	N/A	N/A
FPC:fee_entrypoint_public	1,344	9,944	219	N/A	N/A	N/A	N/A
SchnorrAccount:spend_private_authwit	1,280	9,944	77.3 (-1%)	N/A	N/A	N/A	N/A
Token:unshield	1,376	9,944	3,248	N/A	N/A	N/A	N/A
FPC:fee_entrypoint_private	1,376	9,944	4,014 (-1%)	N/A	N/A	N/A	N/A

Tree insertion stats

The duration to insert a fixed batch of leaves into each tree type.

Metric	1 leaves	16 leaves	64 leaves	128 leaves	512 leaves	1024 leaves	2048 leaves	4096 leaves	32 leaves
batch_insert_into_append_only_tree_16_depth_ms	10.5 (+1%)	17.1 (+1%)	N/A	N/A	N/A	N/A	N/A	N/A	N/A
batch_insert_into_append_only_tree_16_depth_hash_count	16.7	31.8	N/A	N/A	N/A	N/A	N/A	N/A	N/A
batch_insert_into_append_only_tree_16_depth_hash_ms	0.611 (+1%)	0.525 (+1%)	N/A	N/A	N/A	N/A	N/A	N/A	N/A
batch_insert_into_append_only_tree_32_depth_ms	N/A	N/A	48.8	76.0 (-2%)	245	474 (-1%)	928	1,837 (-1%)	N/A
batch_insert_into_append_only_tree_32_depth_hash_count	N/A	N/A	95.9	159	543	1,055	2,079	4,127	N/A
batch_insert_into_append_only_tree_32_depth_hash_ms	N/A	N/A	0.498	0.468 (-2%)	0.444	0.442 (-1%)	0.440	0.439	N/A
batch_insert_into_indexed_tree_20_depth_ms	N/A	N/A	59.3 (+1%)	112 (-1%)	355	697 (-1%)	1,383	2,761	N/A
batch_insert_into_indexed_tree_20_depth_hash_count	N/A	N/A	107 (+1%)	208	692	1,363	2,707	5,395	N/A
batch_insert_into_indexed_tree_20_depth_hash_ms	N/A	N/A	0.510	0.501 (-1%)	0.481	0.478 (-1%)	0.478	0.479	N/A
batch_insert_into_indexed_tree_40_depth_ms	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A	62.9 (+1%)
batch_insert_into_indexed_tree_40_depth_hash_count	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A	107
batch_insert_into_indexed_tree_40_depth_hash_ms	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A	0.554

Miscellaneous

Transaction sizes based on how many contract classes are registered in the tx.

Metric	0 registered classes	1 registered classes
tx_size_in_bytes	82,355 (-2%)	758,659 (+14%)

Transaction size based on fee payment method

| Metric | |
| - | |

jeanmon

Good work! Please look at my feedback before merging.

jeanmon · 2024-05-21T11:42:07Z

barretenberg/cpp/pil/avm/avm_main.pil

+
+    // When we encounter a state writing opcode
+    // We increment the side effect counter by 1
+    KERNEL_OUTPUT_SELECTORS * (avm_kernel.side_effect_counter' - (avm_kernel.side_effect_counter + 1)) = 0;


Is this crucial to start at zero? (Probably, otherwise a malicious prover might choose a huge value and then you get overflow/wraparound. Not sure about what the kernel is expecting.)

If yes, then we should have a constraint to enforce the initial value to be zero. (maybe sthg like: first * avm_kernel.side_effect_counter =0 )

Side effect counter will be required to be constrained to the value of the start_side_effect_counter in the public inputs - this can be done with a copy constraint or via a lookup

jeanmon · 2024-05-21T11:43:14Z

barretenberg/cpp/pil/avm/avm_main.pil

+    // OUTPUTS LOOKUPS
+    // Constrain the value of kernel_out_sel to be the correct offset for the operation being performed
+    #[NOTE_HASH_KERNEL_OUTPUT]
+    sel_op_note_hash_exists * (avm_kernel.kernel_out_sel - (avm_kernel.START_NOTE_HASH_EXISTS_WRITE_OFFSET + avm_kernel.note_hash_exist_write_offset)) = 0;


avm_kernel.note_hash_exist_write_offset should be constrained to be initialized to zero. (maybe on first line?)

Same holds for all other offsets.

jeanmon · 2024-05-21T11:48:25Z

barretenberg/cpp/pil/avm/avm_kernel.pil

    pol commit kernel_sel;
+    pol commit kernel_out_sel;


The suffix "sel" makes us think it is a boolean. I think "offset" suffix might be better,

jeanmon · 2024-05-21T11:52:50Z

barretenberg/cpp/src/barretenberg/vm/avm_trace/avm_common.hpp

@@ -11,6 +11,17 @@ using Flavor = bb::AvmFlavor;
 using FF = Flavor::FF;
 using Row = bb::AvmFullRow<bb::fr>;

+// There are 4 public input columns, 1 for context inputs, and 3 for emitting side effects
+using VM_PUBLIC_INPUTS = std::tuple<std::array<FF, KERNEL_INPUTS_LENGTH>,   // Input: Kernel context inputs


The current notation in bb for type alias is camel case. We might need to rename "using VmPublicInputs = ..." to be consistent.

jeanmon · 2024-05-21T11:54:47Z

barretenberg/cpp/src/barretenberg/vm/avm_trace/avm_kernel_trace.cpp

    return result;
 }

+void AvmKernelTraceBuilder::perform_kernel_output_lookup(uint32_t write_offset, FF value, FF metadata)


value and metadata can be made "const &"

barretenberg/cpp/src/barretenberg/vm/tests/avm_kernel.test.cpp

barretenberg/cpp/src/barretenberg/vm/tests/helpers.test.cpp

Maddiaa0 added 30 commits April 4, 2024 10:57

avm_logderivative

8853132

temp: chall line up

abc3cba

fix: degree too low for lookup relations

8b88dcd

chore: rename validate trace proof to check circuit, make another met…

c8d9601

…hod for proof

chore: remove dangling code

c695905

Merge branch 'master' into md/04-03-avm_logderivative

40eeb8f

chore: further cleanup

619175b

chore: from powdr codegen

65c3159

temp

61d10f6

feat: bb support for public input columns

7e84550

Merge branch 'master' into md/04-11-feat_bb_support_for_public_input_…

c054aca

…columns

merge fixy

f38773f

chore: test structure

a4262dc

🧹

cf4bb86

use pilgen

12a9789

Merge branch 'master' into md/04-11-feat_bb_support_for_public_input_…

14d15cc

…columns

feat: example caller and address opcode

b0f8041

feat: generalise builder, move after review

d5813a2

Merge branch 'master' into md/04-12-feat_example_caller_and_address_o…

6265c61

…pcode

fix: add tests for all call context opcodes

59b221c

Merge branch 'master' into md/04-11-feat_bb_support_for_public_input_…

b613cf5

…columns

Merge branch 'md/04-11-feat_bb_support_for_public_input_columns' into…

81a2500

… md/04-12-feat_example_caller_and_address_opcode

chore: update pil comments

c934c61

fix: remove redundant comment

a8ceaf8

fix: some negative tests

af343db

temp

9388ecc

Merge branch 'master' into md/04-12-feat_example_caller_and_address_o…

9e7aad0

…pcode

chore: remove l1 gas - no longer exists

859a3df

chore: rearrange where relations live, based on review

b4a47b3

Merge branch 'master' into md/04-11-feat_bb_support_for_public_input_…

79cf849

…columns

Maddiaa0 added 2 commits May 16, 2024 14:31

fix: alter exists opcodes to all use fields

c1c4f21

fix: different in and out tags for exists opcodes

c88d5ef

This was referenced May 16, 2024

feat(avm): constrain start write offsets to be 0 #6467

Closed

chore(avm): negative tests for kernel output opcodes #6468

Open

Maddiaa0 added 4 commits May 16, 2024 15:18

feat: use pil generated public input columns

c5847b2

Merge branch 'master' into md/05-10-kernel_outputs

70a6e86

fix: dirty merge

f8e8349

fix: annotate todos

289c302

Maddiaa0 marked this pull request as ready for review May 16, 2024 19:48

Maddiaa0 requested review from jeanmon and IlyasRidhuan as code owners May 16, 2024 19:48

Maddiaa0 added 3 commits May 16, 2024 20:48

Merge branch 'master' into md/05-10-kernel_outputs

17640ef

fix: incorrect offset in emitNoteHash test

7e120ad

Merge branch 'master' into md/05-10-kernel_outputs

0dc2c58

jeanmon approved these changes May 21, 2024

View reviewed changes

Maddiaa0 added 5 commits May 23, 2024 10:47

fix: review

6b2bfd6

Merge branch 'master' into md/05-10-kernel_outputs

3e85832

fix: add more constants to constant gen

1922ccb

fix: add call_ptrs

9dca2e2

fmt

f9cf428

Maddiaa0 enabled auto-merge (squash) May 23, 2024 11:43

Maddiaa0 added 5 commits May 27, 2024 10:33

Merge branch 'master' into md/05-10-kernel_outputs

cb1f006

fix: share public inputs construction in executor

1bdb64f

fix: typo

17c216b

Merge branch 'master' into md/05-10-kernel_outputs

c714d7f

Merge branch 'master' into md/05-10-kernel_outputs

b95b838

Maddiaa0 merged commit 0281b8f into master May 28, 2024
85 checks passed

Maddiaa0 deleted the md/05-10-kernel_outputs branch May 28, 2024 09:20

AztecBot mentioned this pull request May 28, 2024

chore(master): Release 0.42.0 #6572

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(avm): kernel output opcodes #6416

feat(avm): kernel output opcodes #6416

Maddiaa0 commented May 15, 2024 •

edited

Loading

AztecBot commented May 15, 2024 •

edited

Loading

Proof generation

L2 block published to L1

L2 chain processing

Circuits stats

Tree insertion stats

Miscellaneous

jeanmon left a comment

jeanmon May 21, 2024

Maddiaa0 May 23, 2024

jeanmon May 21, 2024

jeanmon May 21, 2024

jeanmon May 21, 2024

jeanmon May 21, 2024

feat(avm): kernel output opcodes #6416

feat(avm): kernel output opcodes #6416

Conversation

Maddiaa0 commented May 15, 2024 • edited Loading

Overview

Currently Not Implemented (TODO)

AztecBot commented May 15, 2024 • edited Loading

Benchmark results

Proof generation

L2 block published to L1

L2 chain processing

Circuits stats

Tree insertion stats

Miscellaneous

jeanmon left a comment

Choose a reason for hiding this comment

jeanmon May 21, 2024

Choose a reason for hiding this comment

Maddiaa0 May 23, 2024

Choose a reason for hiding this comment

jeanmon May 21, 2024

Choose a reason for hiding this comment

jeanmon May 21, 2024

Choose a reason for hiding this comment

jeanmon May 21, 2024

Choose a reason for hiding this comment

jeanmon May 21, 2024

Choose a reason for hiding this comment

Maddiaa0 commented May 15, 2024 •

edited

Loading

AztecBot commented May 15, 2024 •

edited

Loading