Efficient Querier Pagination #6191

aaronc · 2020-05-11T19:48:24Z

Summary

It has been brought to my attention that the current approach to pagination is inefficient because it still requires the full result set to be retrieved by the querier so it will slow down under really huge result sets.

Proposal

@alpe proposed two approaches here: regen-network/cosmos-modules#63 (review) which I am quoting below

Option A

For the list query pagination we would need some additional query payload

Limit uint32 - max number of entries, defaults to 20(?) when empty

After bytes - last element id of the previous result set; not included in the new resultset; can be nil for first page; may need to by type agnostic

Result:

Data repeated Model(???) - result set

HasMore bool

Proof Proof(?)

Option B

Make it easier for clients:

Limit uint32 - max number of entries, defaults to 20(?) when empty

Cursor bytes - same idea as After but not type agnostic. Clients would never build it from the result set; can be nil for first page

Result:

Data repeated Model(???) - result set

Cursor bytes - can be nil when last page is returned

HasMore bool - required as there may only be a first page

Proof Proof(?)

I like option B a bit more now as we would drop the object ID type in the requests and let the client simple return what would be the offset for the next page. All logic, encoding is server side only.

Approach B (adding a type agnostic Cursor field) may be the better approach going forward and would be the approach I recommend as a starting point. Approach A, however, might be easier to adapt to existing query methods without changing the result types (i.e. for #6163).

I would propose this be done from the start for gRPC queriers (#5894).

I would not recommend changing existing querier methods unless clients complain of UX problems in order to not break backwards compatibility of existing clients during the migration to protobuf.

The text was updated successfully, but these errors were encountered:

jgimeno · 2020-05-12T16:35:37Z

@aaronc even type A will change the result type, since we need a way to return the after []byte type. Am I wrong?

The only issue I see maybe is the way pagination will work in CLI command line, maybe is a little bit annoying to provide the last element ID as a param.

aaronc · 2020-05-12T18:14:37Z

@aaronc even type A will change the result type, since we need a way to return the after []byte type. Am I wrong?

Well ideally that is included as the key of the last item in the result set. But the cleaner way would probably be to include it directly in the result set as Cursor or something.

jgimeno · 2020-05-13T13:52:48Z

Ok! What we do then about this, I can provide an example on how to do case B in one PR for a specific case, and if we like implement it for all, or we wait to gRPC?

aaronc · 2020-05-13T16:31:36Z

@jgimeno I think the most bang for the buck will be doing together with gRPC. I have an init PR for bank #5882. I need to update it anyway and can add it there as a starting point. I would also like to update ADR 021 to mention this. Have you read that?

jgimeno · 2020-05-14T13:51:06Z

Gonna read it. This is the PR your are using right? #5953

aaronc · 2020-05-14T13:56:39Z

Yes, that's the init PR

aaronc · 2020-05-14T17:40:22Z

Here's what I'm thinking, just take Page []byte in the request and return NextPage []byte in the response if there are more pages (nil otherwise).

Ex:

message BalancesRequest {
  bytes address = 1;
  bytes page = 2;
  uint32 limit = 3;
}

message BalancesResponse {
  repeated cosmos_sdk.v1.Coin balances = 1;
  bytes next_page = 2;
}

jgimeno · 2020-05-15T15:25:51Z

I see, and I think is a good approach, I wonder how user friendly would be this from the point of view of the command line.

Is not the same --page 1 that --page 0x1234567890ABCD but I can't come with other idea that does not need to do a traversal of the full set.

alexanderbez · 2020-05-15T15:47:59Z

One way would be to support both -- efficient and lazy (current) pagination:

message BalancesRequest {
  bytes address = 1;
  uint32 page = 2;
  bytes page_key = 3;
  uint32 limit = 4;
}

The idea is that a client can either provide page or page_key, where the former would do lazy pagination and the latter would do efficient pagination. In the presence of both, we can either error or pick the efficient path.

aaronc · 2020-05-15T16:00:24Z

Hmm... I'd rather not complicate things by supporting both options on the querier level. Most usage I imagine will actually be programmatic.

I can't see any other use case beside the CLI, and if we really want to support the simple --page 2 syntax, the client can just inefficiently iterate through result sets itself until it gets to the right page.

jgimeno · 2020-05-15T16:02:23Z

Yeah, maybe we can just accept this tradeoff.

jgimeno · 2020-05-15T16:09:04Z

Sorry, I am just thinking, but I see with the approach suggested other problem with the REST.

This approach only allows pagination in a webapp in the style of:

<- prev next ->

Since we always need at least the first call to get the next page key.

So exact paginations on the style of:

1 2 3 4 5 6 ... etc wont be possible without many tricks.

This is just some thoughts, I am not advocating against, just to bring them to the table and have them in mind.

alexanderbez · 2020-05-15T16:10:09Z

Why is this a tradeoff or complication? It's pretty trivial (if/else check). Clients that don't want to pass around a byte/hex key should not have to.

jgimeno · 2020-05-15T16:12:20Z

I think what @aaronc says is that from the point of view of the API having two types of pagination is not optimal, not because of the coding complexity but from API design.

alexanderbez · 2020-05-15T16:16:55Z

Mhhh, I respectfully disagree. Can you please explain how it's non-optimal? The client can choose to use which ever method it wants. One value is omitted over the other.

jgimeno · 2020-05-15T16:24:42Z

I understand you perfectly, with non optimal I wanted to say that is less simple than having only one way of pagination, which from a client perspective it just makes it a little bit harder but just looking at

uint32 page = 2;
bytes page_key = 3;

to understand the difference of both.

Don't get me wrong @alexanderbez! I think is not a bad idea to have both, I did not think about it and I think it helps to bring the --page 1 use case.

Whatever we decide is ok for me. :D

aaronc · 2020-05-15T16:30:00Z

Maybe it's the right thing to do, I just think it would be simpler if we could avoid it. It's added complexity for every request type and every querier method. I'll see if I can find a simpler way... but maybe there isn't.

alexanderbez · 2020-05-15T16:31:53Z

ACK whatever you guys wish to go with...

aaronc · 2020-05-15T20:07:03Z

Did a quick spike on this with for #5975

I think this approach will address both concerns:

https://github.com/cosmos/cosmos-sdk/pull/5953/files#diff-bc1bef9ecc3076635102b2d152b5449c

https://github.com/cosmos/cosmos-sdk/pull/5953/files#diff-8ca04691fa5aa1d8ede405ec840d9f21

Using page numbers can be made more efficient if the iteration only unmarshals the values that are actually going to be returned. The Paginate function linked above just skips unneeded results until it gets to the requested page.

alpe · 2020-05-19T13:11:51Z

IMHO we should focus on one way to query and keep the api simple and self explaining. Either cursor or pages. The cursor is more efficient on the server side as we would not have to read and skip data. It is also more correct with data being inserted while iterating. An insert before the cursor would not have any effect on the result set and after the cursor would just be appended.
On the other hand pages was used before and people would not be forced to a new UX.

Maybe we can learn more about the common client use cases.

aaronc · 2020-06-10T14:21:25Z

So I'd like us to come to a decision on this either way.

My current proposal as represented in #6353 (specifically https://github.com/cosmos/cosmos-sdk/pull/6353/files#diff-bc1bef9ecc3076635102b2d152b5449c) is to handle both keys as @alpe has suggested or standard limit/offset. I don't think it's that complex to handle both and I think it would account for both:

users that want the most efficient pagination, and
users that want to show search paginated search results on a web page like google.

I understand @alpe's concerns about just wanting to support one approach, but at least on the server side we can abstract over the complexity with helper functions. In #6353, I have a helper that works over PrefixStores that should work for most of the cases we have in the SDK.

If we had to choose between one of those approaches, I would advocate for @alpe's more efficient key approach. But if we don't want to have the opinionated either/or battle, I present #6353 as a middle ground.

A few details on the approach in #6353:

Each request type has a PageRequest page field and each response type has a PageResponse page field
Either key or offset can be used together with limit. I am choosing offset rather than "page number" because limit/offset is the more common terminology in general.
When offset is used, count_total can also be set and this will cause the PageResponse to include the total number of results (for web pages that want to show the total number of results and total number of pages).
PageResponse always returns next_key so that clients can switch to the more efficient key method at any point.

Whether we choose this approach or settle on supporting just one of offset/pagenum or key, I'd prefer to settle on something rather quickly because other than this, gRPC module queriers are unblocked. If we decide on something, I think #6353 can be completed rather quickly and then each gRPC querier will get pagination from the start.

anilcse · 2020-06-10T18:28:09Z

My current proposal as represented in #6353 (specifically https://github.com/cosmos/cosmos-sdk/pull/6353/files#diff-bc1bef9ecc3076635102b2d152b5449c) is to handle both keys as @alpe has suggested or standard limit/offset. I don't think it's that complex to handle both and I think it would account for both:

+1

#6353 seems to be a better approach considering all these points. Querier is blocked on this and we should move quickly. I would advocate to support both for now and handle the complexity with helper function on server side.

We are planning to resume #6353 and start working on querier stuff.

aaronc · 2020-06-10T18:42:50Z

Any other opinions here? @alexanderbez @jgimeno ?

It we're fine with the dual-approach #6353, it would be great to green-light it and get it moving.

alexanderbez · 2020-06-10T18:50:04Z

My opinion still stands from my previous comment -- as long as we can support both methods 👍

aaronc · 2020-06-10T18:57:10Z

Okay great, seems like we're on the same page. Let's move forward with the dual approach in #6353 and we can discuss any details in that PR.

anilcse · 2020-06-17T08:15:19Z

@aaronc we might need to use page instead of offset. Otherwise the default behaviour to use key over offset (when both are not set) is not possible as offset's default value is 0 and is still a valid offset value. We should not use key by default if the offset is 0

One workaround can be to use end value as offset + limit - 1 inside paginate (https://github.com/cosmos/cosmos-sdk/pull/6452/files#diff-8ca04691fa5aa1d8ede405ec840d9f21R58), but that doesn't look like a standard approach. Any thoughts?

aaronc · 2020-06-17T14:13:38Z

I don’t understand what the problem is. If offset and key are both unset the behavior should be identical. If key is set and offset is zero we use key. There is no ambiguity. If both offset and key are set there should be a validation error.

anilcse · 2020-06-17T16:18:07Z

Okay, as discussed over chat, we'll be using offset=1 for first page records

Edit:
Sorry about this. It's just a change to the end value when using offset

anilcse · 2020-06-17T16:48:30Z

@aaronc @alexanderbez @clevinson I suggest we enforce a maxLimit for pagination. If the limit is not set, we will use the maxLimit. Also, we allow users to set limit only under this maxLimit.

We can have a hard coded value of 25 for maxLimit

Any thouthts?

alexanderbez · 2020-06-17T17:16:05Z

I'm in favor of a max limit, although idk if 25 is the ideal limit. Why not just keep it at 100?

aaronc · 2020-06-17T17:28:06Z

100 makes sense to me

anilcse · 2020-06-17T17:29:37Z

Sure, we'll keep the max limit to 100

anilcse · 2020-06-25T07:55:51Z

@aaronc how do we apply this for custom filtered querying. Lets say for proposals we have proposer, status and depositor filers. If we use the current paginate, we would get results <= limit though we (might) have more results.

I think we can fix this by returning miss or hit/found, a bool value from onResult().
https://github.com/cosmos/cosmos-sdk/pull/6452/files#diff-8ca04691fa5aa1d8ede405ec840d9f21R49
The counter will increase only if it is a hit

But we'll have a problem with offset and it can't be page_number * limit for clients, we might need to add nextOffset but that will look bad.

/cc @alexanderbez @jgimeno @clevinson @alpe

aaronc · 2020-07-01T17:33:31Z

Addressed in #6452 and #6514

aaronc added the S:proposed label May 11, 2020

aaronc mentioned this issue May 11, 2020

add pagination to x/staking queries #6163

Merged

aaronc added T: Performance Performance improvements T: Client UX REST labels May 11, 2020

aaronc mentioned this issue May 15, 2020

Query Protobuf Migration #5921

Closed

43 tasks

This was referenced Jun 4, 2020

Add basic x/bank gRPC query service #6343

Merged

Add pagination types and helper func #6353

Closed

clevinson added this to the v0.39 milestone Jun 11, 2020

sahith-narahari mentioned this issue Jun 16, 2020

Add pagination #6452

Merged

8 tasks

anilcse mentioned this issue Jun 25, 2020

Add filtered pagination #6514

Merged

8 tasks

aaronc closed this as completed Jul 1, 2020

Efficient Querier Pagination #6191

Efficient Querier Pagination #6191

Comments

aaronc commented May 11, 2020 • edited Loading

Summary

Proposal

Option A

Result:

Option B

Result:

jgimeno commented May 12, 2020

aaronc commented May 12, 2020

jgimeno commented May 13, 2020

aaronc commented May 13, 2020 • edited Loading

jgimeno commented May 14, 2020 • edited Loading

aaronc commented May 14, 2020

aaronc commented May 14, 2020

jgimeno commented May 15, 2020

alexanderbez commented May 15, 2020 • edited Loading

aaronc commented May 15, 2020

jgimeno commented May 15, 2020

jgimeno commented May 15, 2020 • edited Loading

alexanderbez commented May 15, 2020

jgimeno commented May 15, 2020

alexanderbez commented May 15, 2020

jgimeno commented May 15, 2020

aaronc commented May 15, 2020

alexanderbez commented May 15, 2020

aaronc commented May 15, 2020

alpe commented May 19, 2020

aaronc commented Jun 10, 2020 • edited Loading

anilcse commented Jun 10, 2020

aaronc commented Jun 10, 2020 • edited Loading

alexanderbez commented Jun 10, 2020

aaronc commented Jun 10, 2020

anilcse commented Jun 17, 2020 • edited Loading

aaronc commented Jun 17, 2020 • edited Loading

anilcse commented Jun 17, 2020 • edited Loading

anilcse commented Jun 17, 2020

alexanderbez commented Jun 17, 2020 • edited Loading

aaronc commented Jun 17, 2020

anilcse commented Jun 17, 2020

anilcse commented Jun 25, 2020 • edited Loading

aaronc commented Jul 1, 2020

aaronc commented May 11, 2020 •

edited

Loading

aaronc commented May 13, 2020 •

edited

Loading

jgimeno commented May 14, 2020 •

edited

Loading

alexanderbez commented May 15, 2020 •

edited

Loading

jgimeno commented May 15, 2020 •

edited

Loading

aaronc commented Jun 10, 2020 •

edited

Loading

aaronc commented Jun 10, 2020 •

edited

Loading

anilcse commented Jun 17, 2020 •

edited

Loading

aaronc commented Jun 17, 2020 •

edited

Loading

anilcse commented Jun 17, 2020 •

edited

Loading

alexanderbez commented Jun 17, 2020 •

edited

Loading

anilcse commented Jun 25, 2020 •

edited

Loading