feat: reject multiple setup paths #351

daejunpark · 2024-08-23T05:43:32Z

currently, if multiple setup paths exist, one of them is randomly selected and used for testing. since the existence of multiple setup paths are likely due to an incorrect setup(), this behavior makes debugging more challenging. to address this, now it will fail immediately when multiple setup paths are detected.

to be merged after #350

karmacoma-eth · 2024-08-23T22:46:44Z

tests/regression/test/Context.t.sol

+        // vm.assume(mode0 < 9);
+        // NOTE: explicitly branch over mode0, as an infeasible path with mode0 >= 9 may not be eliminated due to an extremely inefficient solver environment (e.g, github workflow)
+        mode0 = split_mode_up_to_9(mode0);
+
        _check_call0(mode0);


wait what? I don't understand what's going on here

this is not related to this pr. it's a temporary fix to a flaky test that i found.

details of the flaky test: when mode0 == 9, _check_call0() fails due to HalmosException (which is intentional). however, the goal of this test is to consider only success paths, so the assumption mode0 < 9 is provided at the beginning. however, the HalmosException path may still appear after path exploration, if the branching solver couldn't solve its infeasible path condition. this won't happen under normal circumstances, as the path condition mode0 < 9 && mode0 == 9 is obviously unsat. but in the github runner, the solver may timeout, leading to a test failure, due to the HalmosException path, even if it is infeasible. a proper fix is more involved, and will be implemented later.

got it, thank you

karmacoma-eth

I agree with the general direction, however:

we still have quite a few symbolic sources like balances, right? we might want to tighten these as much as possible
do we want this to be a hard break or more of a nudge? i.e. maybe allow a bypass with a CLI flag?

daejunpark · 2024-08-24T00:27:30Z

we still have quite a few symbolic sources like balances, right? we might want to tighten these as much as possible

balance is addressed in #352

do we want this to be a hard break or more of a nudge? i.e. maybe allow a bypass with a CLI flag?

we could add a bypass flag, or make this optional (e.g., --strict), later if this turns out to cause test failures too often.

daejunpark added 4 commits August 22, 2024 21:35

deprecated --bytecode

9f3e119

deprecated --reset-bytecode

a52ed73

feat: do not allow multiple setup paths

a7a2aac

test: fix flakiness of context tests for ci

b9df6fe

daejunpark requested a review from karmacoma-eth August 23, 2024 05:50

daejunpark mentioned this pull request Aug 23, 2024

halmos v2 planning #346

Open

17 tasks

karmacoma-eth reviewed Aug 23, 2024

View reviewed changes

karmacoma-eth approved these changes Aug 23, 2024

View reviewed changes

Merge branch 'main' into feat/fail-multiple-setups

125f22a

daejunpark merged commit 23f2140 into main Aug 24, 2024
57 checks passed

daejunpark deleted the feat/fail-multiple-setups branch August 24, 2024 00:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: reject multiple setup paths #351

feat: reject multiple setup paths #351

daejunpark commented Aug 23, 2024 •

edited

Loading

karmacoma-eth Aug 23, 2024

daejunpark Aug 24, 2024 •

edited

Loading

karmacoma-eth Aug 24, 2024

karmacoma-eth left a comment

daejunpark commented Aug 24, 2024

feat: reject multiple setup paths #351

feat: reject multiple setup paths #351

Conversation

daejunpark commented Aug 23, 2024 • edited Loading

karmacoma-eth Aug 23, 2024

Choose a reason for hiding this comment

daejunpark Aug 24, 2024 • edited Loading

Choose a reason for hiding this comment

karmacoma-eth Aug 24, 2024

Choose a reason for hiding this comment

karmacoma-eth left a comment

Choose a reason for hiding this comment

daejunpark commented Aug 24, 2024

daejunpark commented Aug 23, 2024 •

edited

Loading

daejunpark Aug 24, 2024 •

edited

Loading