*: test command v1.2 update #451

KaloyanTanev · 2024-11-16T12:28:07Z

Summary

In version v1.2 we will be improving by a lot the test command. Add that to the docs.

Details

run/prepare/test-command.md

Add charon test all command
Add new flags
Add load test examples
Rename the performance to infra
Improve explanations on places

adv/troubleshooting/test-command.md
New section that explains what is the reason for each test failing. A link to it will be added at the end of running charon test <TEST>.

OisinKyne

Sorry for the long wait and long review :)

First off, I have to address moving this command out of alpha, I think it sets us up for a problem: Either we 1) make no breaking changes to these commands from here on out, e.g. changing flag names or defaults, etc. 2) we continue to make breaking changes to flags etc, and we respect semantic versioning, incrementing a major version every time we do so, hampering our velocity. or 3) we continue making minor breaking changes to these commands without respecting Semver, shipping them in minor releases. This I believe would negatively impact our respect as a dev team, particularly if we break someone's app/dv/script, because they blindly bumped to a new minor version, and charon failed to start becuase they are now using an unknown flag. None of the three of these options sound like a strategically good decision for us, as we aim to iterate and improve on this feature quickly, what do you think we should do, pick one of these options or something else?
I think on the run/prepare folder page, we should consider pasting sample outputs for each command. When just browsing, it would help people understand the commands quickly without running them. If we felt it made this page super long, it might be the moment to break each section into Tabs with Run with bash, Run with docker, Example Output. Wdyt?

(Could also go on the troubleshooting page, if we embed URIs in the console output, and people land to the correct anchor tag to view their results and how to 'troubleshoot' them, i.e. compare them to what we imply decent looks like. That might be better than in this guide? idk)

I don't love that in these test commands we have --endpoints, --beacon-endpoints, --beacon-node-endpoint, --mev-beacon-node-endpoint, all to mean one or more BN URLs, meanwhile on charon run we use --beacon-node-endpoints. Imo that's too many varieties. I think the prefixing is interesting way to organise flags, but imo the fewer varieties of these flags the fewer annoying user issues of assuming the wrong flag name. Lets see how few we can cut it down to. Imo I would try and get as close to all of them as just --beacon-node-endpoints.
We should also add these commands to the charon CLI reference page
For the MEV test:
- when I attempted the load test first, it hung on my machine. (think it might have been due to the .io relay). Now, 4 of the 6 defaults fail on createmultipleblocks test. Took 4m11s.
- Do we need 6 digits of precision for milliseconds? tbh as a human I probably need 0 digits beyond milliseconds, the variance is probably much larger than tenths or hundredths of ms. maybe in the JSON it could be more precise. would maybe make it easier to read.
- I fear a little how people might perceive the naming "mev relay load test". It might sound a bit DoS-y. more thoughts on how we structure this command to follow.
- We should allow you to run the mev test the v1.1.2 way even if you don't have a bn handy (that or we put an obol bn/public default bn in there, but we've discussed that we don't want to do that, i remember). I think many people don't have a bn easily and I don't want to hinder ux too much in that case, the simple test is likely 'good enough'.
- What if we adjusted this command such that if you pass the bn flag optionally, we do the better test, and we don't call it a load test. E.g.: --beacon-node-endpoints If included, allows this command to ask relays for real MEV bids rather than simple status checks. Improves the realism of the test, increases the duration to a number of minutes.
- Relatedly, asking for multiple blocks might smooth the result, but i think you said it increases test time a lot because you need matches irl to the real relays + real proposers to get bids and some are rare. I would be open to just 'load testing' by default if we're given a bn endpoint, but only asking them for a single block, not the 3 block test, and then optionally, people can use --load-test-blocks to get a longer, smoother data if they want to run the command for a few hours or something. (IMO i'd rename to something like --number-of-payloads Increases the accuracy of the test by asking for multiple payloads. Increases test duration. (Default: 1)).
Not in this PR, but so I don't forget, I want to suggest a tweak on the description of the test all command on the charon side:

Run test suites to evaluate your DV peers, your beacon node(s), your validator client, your MEV relays, your machine's performance and internet connectivity. Verify that Charon can validate effectively on the tested setup.

Happy to chat through this feedback synchronously if you want!

docs/run/prepare/test-command.md

docs/adv/troubleshooting/test_command.md

OisinKyne · 2024-11-22T10:28:19Z

docs/run/prepare/test-command.md

+  --beacon-endpoints="https://ethereum-holesky-beacon-api.publicnode.com,https://ethereum-sepolia-beacon-api.publicnode.com" \
+  --beacon-load-test \
+  --mev-endpoints="https://0xac6e77dfe25ecd6110b8e780608cce0dab71fdd5ebea22a16c0205200f2f8e2e3ad3b71d3499c54ad14d6c21b41a37ae@boost-relay.flashbots.net" \
+  --mev-beacon-node-endpoint="https://ethereum-beacon-api.publicnode.com" \


Suggested change

--mev-beacon-node-endpoint="https://ethereum-beacon-api.publicnode.com" \

--mev-beacon-node-endpoints="https://ethereum-beacon-api.publicnode.com" \

Is this to get the proposer schedule to ask the relay for a block? can we just use the --beacon-endpoints flag if present, and do the older version of the mev test if we are given none at all?

Is this to get the proposer schedule to ask the relay for a block?

It is used for 2 things:

Get the latest block, so we can use its hash in the next one, when we query the MEV relay (it is required by it)

Fetch proposers for an epoch, so we know for which validators to query the MEV relay

can we just use the --beacon-endpoints flag if present

Yes, we can use it and I assume you also want it for test mev and not only for test all. However, it will cause:

Users being unable to test rare MEV, with test all as it will always get stuck finding a block.

Misleading users as we will make absolutely no use of multiple beacon nodes.

Users being unable to test rare MEV, with test all as it will always get stuck finding a block.

Hmmm, because we would try the 'good mev test' and thus not test the rare relays? I guess thats a reason to not join them, but I would also consider a flag on test all like --mev-quick-test or something to do the non real way, but can't say that's necessarily easier than omitting a different flag

Misleading users as we will make absolutely no use of multiple beacon nodes.

If we do add ss to beacon-node-endpoint flags places, we can create an issue to throw in the backlog about trying all beacon nodes with a pick list like @gsora developed for the --fallback-beacon-nodes feature, so if someone takes deep offence to the misleading name, we can point to the issue and say sorry, WIP?

If we do add ss to beacon-node-endpoint flags places, we can create an issue to throw in the backlog about trying all beacon nodes with a pick list like @gsora developed for the --fallback-beacon-nodes feature, so if someone takes deep offence to the misleading name, we can point to the issue and say sorry, WIP?

Okay, but... in the MEV tests we don't test or care about the BN's performance. How fast it delivers us the payloads we ask it to doesn't change the outcome of the MEV test at all, it is testing purely the relay. And that is on purpose, to remove the bias that the BN might add. That said, I'm expecting just a healthy BN with average performance to be able to answer once per slot and once per epoch on the requests I send to it.

docs/run/prepare/test-command.md

OisinKyne

Made some suggestions, happy enough to meet you in the middle with the namings on the flags. Suggest we add back in the alpha if that's been changed on the charon code side at this point. :)

netlify · 2024-11-29T12:38:27Z

✅ Deploy Preview for obol-docs ready!

Name	Link
🔨 Latest commit	`1c44406`
🔍 Latest deploy log	https://app.netlify.com/sites/obol-docs/deploys/6751c570325eed0008db69c1
😎 Deploy Preview	https://deploy-preview-451--obol-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Change after a [comment from oisin under this PR](ObolNetwork/obol-docs#451 (comment)). category: feature ticket: none

KaloyanTanev · 2024-12-03T09:48:20Z

I've rebased the PR on main

Change after a [comment from oisin under this PR](ObolNetwork/obol-docs#451 (comment)). category: feature ticket: none

sonarqubecloud · 2024-12-05T15:23:49Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

KaloyanTanev requested a review from slugmann321 November 16, 2024 12:28

KaloyanTanev self-assigned this Nov 16, 2024

OisinKyne reviewed Nov 22, 2024

View reviewed changes

KaloyanTanev force-pushed the kalo/test-command-v1.2-update branch 2 times, most recently from b345846 to 756edb4 Compare November 27, 2024 19:21

KaloyanTanev changed the base branch from docs_sections_overhaul to main November 27, 2024 19:23

OisinKyne approved these changes Nov 28, 2024

View reviewed changes

KaloyanTanev mentioned this pull request Nov 29, 2024

cmd: use private key file flag, instead of data dir flag ObolNetwork/charon#3393

Merged

KaloyanTanev force-pushed the kalo/test-command-v1.2-update branch from 004f1ad to 42019fd Compare December 1, 2024 19:09

slugmann321 approved these changes Dec 2, 2024

View reviewed changes

KaloyanTanev force-pushed the kalo/test-command-v1.2-update branch from 838dccc to f128dae Compare December 3, 2024 09:47

KaloyanTanev added 16 commits December 3, 2024 12:13

Update test command docs

0e9cc55

Add test command suggestions

45f40a9

Add tabs for docker/executable

fd7049d

Split tests into multiple tabs

c6f2b12

Add suggestion by Oisin

7966dfd

Change data-dir to private-key-file

a73785c

Formatting and small fixes

588ab27

Add user parsing to the docker volumes

d594216

Bring back alpha state

196f5d5

Update fio description

b82b672

Update deployment best practices

3500f2e

Fix test-command.md to mdx links

10ae75f

Update from TOML to JSON

e78daa5

Change MEV endpoints

1feeaa7

Change docker test all load beacon node endpoint to local one

41018bf

Update beacon node endpoints under load test and description

0af467f

KaloyanTanev added 5 commits December 3, 2024 12:14

Run to Example Run

0655ffd

Formatting

17c523a

Fix private key file flag

3c3bd3a

Fix example ENR

5c40f87

Hide table of contents

05329e7

KaloyanTanev force-pushed the kalo/test-command-v1.2-update branch from f128dae to 05329e7 Compare December 3, 2024 10:14

KaloyanTanev added a commit to ObolNetwork/charon that referenced this pull request Dec 5, 2024

cmd: use private key file flag, instead of data dir flag (#3393)

dc9b8f8

Change after a [comment from oisin under this PR](ObolNetwork/obol-docs#451 (comment)). category: feature ticket: none

Add peers- to the all command for private key file flag

1c44406

KaloyanTanev merged commit 0242f40 into main Dec 5, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

*: test command v1.2 update #451

*: test command v1.2 update #451

KaloyanTanev commented Nov 16, 2024 •

edited

Loading

OisinKyne left a comment

OisinKyne Nov 22, 2024

KaloyanTanev Nov 25, 2024 •

edited

Loading

OisinKyne Nov 28, 2024

KaloyanTanev Nov 29, 2024

OisinKyne left a comment

netlify bot commented Nov 29, 2024 •

edited

Loading

KaloyanTanev commented Dec 3, 2024

sonarqubecloud bot commented Dec 5, 2024

	--mev-beacon-node-endpoint="https://ethereum-beacon-api.publicnode.com" \
	--mev-beacon-node-endpoints="https://ethereum-beacon-api.publicnode.com" \

*: test command v1.2 update #451

*: test command v1.2 update #451

Conversation

KaloyanTanev commented Nov 16, 2024 • edited Loading

Summary

Details

OisinKyne left a comment

Choose a reason for hiding this comment

OisinKyne Nov 22, 2024

Choose a reason for hiding this comment

KaloyanTanev Nov 25, 2024 • edited Loading

Choose a reason for hiding this comment

OisinKyne Nov 28, 2024

Choose a reason for hiding this comment

KaloyanTanev Nov 29, 2024

Choose a reason for hiding this comment

OisinKyne left a comment

Choose a reason for hiding this comment

netlify bot commented Nov 29, 2024 • edited Loading

✅ Deploy Preview for obol-docs ready!

KaloyanTanev commented Dec 3, 2024

sonarqubecloud bot commented Dec 5, 2024

Quality Gate passed

KaloyanTanev commented Nov 16, 2024 •

edited

Loading

KaloyanTanev Nov 25, 2024 •

edited

Loading

netlify bot commented Nov 29, 2024 •

edited

Loading