Horizon Prerequisites Benchmarking #4960

urvisavla · 2023-07-13T00:14:25Z

What problem does your feature solve?

We want to update the Horizon prerequisite documentation with the minimum hardware requirements for running the Horizon. To do this, we need to perform benchmarking and testing similar to what we've conducted previously.

What would you like to see?

Determine the minimum specifications required for the Horizon compute instance and the Postgres database instance with focus on measuring the memory, CPU, disk space, and IOPS requirements for both components.

What alternatives are there?

mollykarcher · 2023-07-13T16:50:02Z

Some unknowns to investigate here. Notably, how much API load we assume that users will have. Potentially use existing goreplay setup but filtered/reduced depending on what we decide.

sreuland · 2023-09-11T20:02:56Z

@urvisavla , during verification of compute resources, wanted to mention it should include ENABLE_CAPTIVE_CORE=true and CAPTIVE_CORE_USE_DB=true, I think those are the defaults at this point. Since, captive core with disk db usage will dramatically lower the amount of RAM used by captive, current pre-reqs in docs mention 32GB of ram required, but with on-disk usage, that should be well under 8GB in almost all cases if not lower - #4092 (comment)

urvisavla · 2023-09-11T20:46:09Z

@urvisavla , during verification of compute resources, wanted to mention it should include ENABLE_CAPTIVE_CORE=true and CAPTIVE_CORE_USE_DB=true, I think those are the defaults at this point. Since, captive core with disk db usage will dramatically lower the amount of RAM used by captive, current pre-reqs in docs mention 32GB of ram required, but with on-disk usage, that should be well under 8GB in almost all cases if not lower - #4092 (comment)

@sreuland
We observed RAM usage on the ingestion instance (dev cluster) to remain below 8GB, usually hovering around 6GB. However, during state-verification, the RAM usage spikes to 11GB and remain so for the entire duration of state-verification. Meanwhile, the memory usage on the API instance (prod cluster) remains consistently below 3GB. I believe that the main contributor to memory usage is the in-memory graph for path payments.

Considering these observations and given that our recommendations are for an instance serving all functions (API + ingestion), 16GB RAM should be adequate. I will update our documentation to reflect this recommendation.

urvisavla · 2023-09-20T17:51:58Z

Update:

Shared a document with the team detailing observations from EC2 and RDS instances from the dev and prod clusters
Updated the hardware specifications, including CPU, memory, and disk, in our public docs within the partner-experience branch (to be merged to the main branch soon).
Unfortunately, couldn't obtain hardware benchmarks for running an API instance due to the absence of API traffic in both staging and dev clusters.
Explored options like using the 'go-replay' tool for mirroring traffic, but it proved to be unfeasible #2461.
Next steps:
Explore developing a custom tool to simulate requests from prod (using logs from AWS) and replay them on dev cluster. And for that we'd want to use instances with specifications similar to what we plan to recommend in our public docs. Created ops request for provisioning new instances.

sreuland · 2023-09-25T16:35:40Z

Created https://github.com/stellar/ops/issues/2536 request for provisioning new instances.

Hello @urvisavla , I left a comment for considertaion of using k8s for provisioning new instances rather than ec2:
https://github.com/stellar/ops/issues/2536#issuecomment-1728517511

sreuland · 2023-10-02T16:28:07Z

@urvisavla , you mentioned a performance benchmarks doc was shared, can it be linked or summ'd here also? Thanks!

urvisavla · 2023-10-16T18:01:03Z

@urvisavla , you mentioned a performance benchmarks doc was shared, can it be linked or summ'd here also? Thanks!

Sorry, I missed this earlier. Here is the doc.

urvisavla added horizon partner-experience labels Jul 13, 2023

urvisavla self-assigned this Jul 13, 2023

mollykarcher mentioned this issue Jul 13, 2023

services/horizon: Determine storage needs for running captive core #4919

Closed

sreuland mentioned this issue Sep 11, 2023

Partner Experience: update the helm horizon working example stellar/stellar-docs#223

Merged

urvisavla mentioned this issue Sep 12, 2023

Partner Experience: Update Horizon HW spec stellar/stellar-docs#231

Merged

sreuland mentioned this issue Oct 2, 2023

Captive-cores should use bucketlistdb stellar/quickstart#490

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Horizon Prerequisites Benchmarking #4960

Horizon Prerequisites Benchmarking #4960

urvisavla commented Jul 13, 2023

mollykarcher commented Jul 13, 2023

sreuland commented Sep 11, 2023

urvisavla commented Sep 11, 2023

urvisavla commented Sep 20, 2023

sreuland commented Sep 25, 2023

sreuland commented Oct 2, 2023

urvisavla commented Oct 16, 2023

Horizon Prerequisites Benchmarking #4960

Horizon Prerequisites Benchmarking #4960

Comments

urvisavla commented Jul 13, 2023

What problem does your feature solve?

What would you like to see?

What alternatives are there?

mollykarcher commented Jul 13, 2023

sreuland commented Sep 11, 2023

urvisavla commented Sep 11, 2023

urvisavla commented Sep 20, 2023

sreuland commented Sep 25, 2023

sreuland commented Oct 2, 2023

urvisavla commented Oct 16, 2023