Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

exp/lighthorizon: Create pubnet indices for the MVP endpoints. #4475

Closed
Tracked by #4571
Shaptic opened this issue Jul 21, 2022 · 4 comments
Closed
Tracked by #4571

exp/lighthorizon: Create pubnet indices for the MVP endpoints. #4475

Shaptic opened this issue Jul 21, 2022 · 4 comments
Assignees

Comments

@Shaptic
Copy link
Contributor

Shaptic commented Jul 21, 2022

Parent epic: #4317


Our target endpoint is:

GET /accounts/:id/transactions

So we need an index for each account that stores information about which ledgers the account was active in. Before we actually do this, there's an open question to answer: What kind of indices do we make?

  1. Should they be checkpoint-based or ledger-based? There are trade-offs. or
  2. Should we just create both? Then, we can compare both index size and latency differences.

The index builder is ready for parallel construction, but it might not be perfectly ready for AWS Batch. There is some preliminary work to get both steps (map and reduce) "dockerized" and running on Batch.

@Shaptic Shaptic changed the title Create pubnet indices for our target endpoints. exp/lighthorizon: Create pubnet indices for the MVP endpoints. Jul 21, 2022
@Shaptic Shaptic mentioned this issue Jul 21, 2022
7 tasks
@sreuland
Copy link
Contributor

sreuland commented Aug 12, 2022

I'm helping take over last step here of running a range on cloud to confirm indexes are built out. Initially will build for 07/01 - 07/31 to confirm.

@sreuland sreuland self-assigned this Aug 12, 2022
sreuland added a commit to sreuland/go that referenced this issue Aug 13, 2022
@sreuland
Copy link
Contributor

@Shaptic , looking into alternative k8s job for batch processing in lieu of AWS batch in short term. needs some adjustment on cluster to support JOB_COMPLETION_INDEX per https://github.com/stellar/ops/issues/1790

sreuland added a commit to sreuland/go that referenced this issue Aug 16, 2022
sreuland added a commit to sreuland/go that referenced this issue Aug 16, 2022
sreuland added a commit to sreuland/go that referenced this issue Aug 17, 2022
sreuland added a commit to sreuland/go that referenced this issue Aug 28, 2022
sreuland added a commit to sreuland/go that referenced this issue Aug 30, 2022
@sreuland
Copy link
Contributor

status update, running map/reduce jobs on k8s, identified some issues with slow performance of reduce jobs when running against s3 index. next step, adding logging into reduce, capture time spent in sections with s3 calls to identify i/o rates,
determine next step for optimizations in reduce based on findings.

@sreuland
Copy link
Contributor

after discussion with @Shaptic , the partial index data loaded onto s3 will suffice for mvp criteria, therefore will move this ticket to done, and carved out new ticket for follow-up on reduce performance, #4566

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants