docs: Spec on current cachekv implementation #13977

dangush · 2022-11-22T20:04:22Z

Description

Contributes to: #12986

Adds documentation of the current CacheKVStore implementation, as per phase 1 of the plan outlined in #12986 to improve the SDK's storage layer.

Author Checklist

All items are required. Please add a note to the item if the item is not applicable and
please add links to any relevant follow up issues.

I have...

included the correct docs: prefix in the PR title
targeted the correct branch (see PR Targeting)
provided a link to the relevant issue or specification
followed the documentation writing guidelines
reviewed "Files changed" and left comments if necessary
confirmed all CI checks have passed

Reviewers Checklist

All items are required. Please add a note if the item is not applicable and please add
your handle next to the items reviewed if you only reviewed selected items.

I have...

confirmed the correct docs: prefix in the PR title
confirmed all author checklist items have been addressed
confirmed that this PR only changes documentation
reviewed content for consistency
reviewed content for thoroughness
reviewed content for spelling and grammar
tested instructions (if applicable)

store/cachekv/README.md

…kv-spec

tac0turtle

Thank you for this document!!

In a followup PR, we should add concurrency assumptions of the cachekv store. Currently it seems like the memdb used has a mutex and the cachekv store has a mutex which could lead to unforeseen issues

thpani

Hi @dangush!

Cool work, especially on describing the iterator! CacheKV is probably the most complex part of the store module.

I've left some comments below, where I think the current explanation can be improved.

In general, @angbrav and I have also been diving into the store module and started writing down our understanding. If you intend to add more content, it would be good to sync!

store/cachekv/README.md

thpani · 2022-11-24T17:57:08Z

store/cachekv/README.md

+* Allow iteration over contiguous spans of keys
+* Act as a cache, so we don't repeat I/O to disk for reads we've already done
+  * Note: We actually fail to achieve this for iteration right now
+  * Note: Need to consider this getting too large and dropping some cached reads


Could you explain what you mean here? In contrast to the inter-block cache, there is no upper bound on the cache size in a CacheKV.

I believe @ValarDragon (who wrote this part) is referring to considering runtime issues that can be mitigated by bounding cache. For example, the complexity of running iteration on any size range of keys right now is tied to the overall size of the cache. The best case here would be to design iteration to run relative to the size of the range, but bounding cache size may be needed / a consideration also.

Hmmm, but the current use of CacheKV to scope transactions in memory until writing back to the underlying IAVL doesn't really allow for bounding cache size, right?

Not that I'm aware of, no.

store/cachekv/README.md

thpani · 2022-11-25T11:17:04Z

store/cachekv/README.md

+
+## Iteration
+
+Efficient iteration over keys in `KVStore` is important for generating Merkle range proofs. Iteration over `CacheKVStore` requires producing all key-value pairs from the underlying `KVStore` while taking into account updated values from the cache. 


We should say here that iterators range over a key interval [start, end), as it becomes important below.

store/cachekv/README.md

tac0turtle · 2022-12-16T03:19:45Z

@thpani @angbrav is it okay we merge this then tackle needed changes in a follow up pr.

thpani · 2022-12-16T09:07:06Z

@tac0turtle I would like to resolve #13977 (comment) first, it might cause confusion if it's merged as-is.

The remaining comments should be easy to address, we can do a follow-up PR but should also be fairly easy to address right here.

Also, keep in mind that we need to sync this with the changes of #13881.

angbrav · 2022-12-16T09:33:36Z

@tac0turtle I'd rather resolve the simple issues in this PR and the more complex ones in a different one. But I am also fine with the alternative.

By simple I mean:

More complex ones (a different PR):

sync this with the changes of perf: optimize iteration on nested cache context #13881
are nested iterators safe?
do we need a mutex?

store/cachekv/README.md

alexanderbez · 2022-12-16T15:29:46Z

store/cachekv/README.md

+* Allow iteration over contiguous spans of keys
+* Act as a cache, so we don't repeat I/O to disk for reads we've already done
+  * Note: We actually fail to achieve this for iteration right now
+  * Note: Need to consider this getting too large and dropping some cached reads


Not that I'm aware of, no.

yihuang · 2022-12-17T17:09:29Z

FYI, I just did a relatively big refactoring on cachekv: #14350

Co-authored-by: Aleksandr Bezobchuk <alexanderbez@users.noreply.github.com>

tac0turtle · 2022-12-28T10:11:23Z

merging this and lets handle the changes in a follow up pr

docs on current cachekv implementation

dde2af0

github-actions bot added the C:Store label Nov 22, 2022

ValarDragon reviewed Nov 22, 2022

View reviewed changes

store/cachekv/README.md Outdated Show resolved Hide resolved

dangush added 3 commits November 22, 2022 16:11

added suggested change, some minor edits

3122e7d

Merge remote-tracking branch 'upstream/main' into dangush/12986-cache…

9e0cb04

…kv-spec

formatting tweaks

6b1ab7e

dangush marked this pull request as ready for review November 24, 2022 01:29

dangush requested a review from a team as a code owner November 24, 2022 01:29

Merge branch 'main' into dangush/12986-cachekv-spec

b6f94f2

tac0turtle assigned angbrav and alexanderbez Nov 24, 2022

tac0turtle approved these changes Nov 24, 2022

View reviewed changes

thpani reviewed Nov 25, 2022

View reviewed changes

dangush and others added 3 commits November 30, 2022 19:06

Added feedback from @thpani

6bcacbc

Merge branch 'main' into dangush/12986-cachekv-spec

21e9075

Merge branch 'main' into dangush/12986-cachekv-spec

1fed024

alexanderbez approved these changes Dec 16, 2022

View reviewed changes

tac0turtle and others added 2 commits December 28, 2022 11:10

Update store/cachekv/README.md

fdaade6

Co-authored-by: Aleksandr Bezobchuk <alexanderbez@users.noreply.github.com>

Merge branch 'main' into dangush/12986-cachekv-spec

fd45412

tac0turtle enabled auto-merge (squash) December 28, 2022 10:11

tac0turtle merged commit 741f4ae into cosmos:main Dec 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Spec on current cachekv implementation #13977

docs: Spec on current cachekv implementation #13977

dangush commented Nov 22, 2022 •

edited

Loading

tac0turtle left a comment

thpani left a comment

thpani Nov 24, 2022

dangush Dec 1, 2022 •

edited

Loading

thpani Dec 16, 2022

alexanderbez Dec 16, 2022

thpani Nov 25, 2022

tac0turtle commented Dec 16, 2022

thpani commented Dec 16, 2022 •

edited

Loading

angbrav commented Dec 16, 2022 •

edited

Loading

alexanderbez Dec 16, 2022

yihuang commented Dec 17, 2022

tac0turtle commented Dec 28, 2022


		## Iteration

		Efficient iteration over keys in `KVStore` is important for generating Merkle range proofs. Iteration over `CacheKVStore` requires producing all key-value pairs from the underlying `KVStore` while taking into account updated values from the cache.

docs: Spec on current cachekv implementation #13977

docs: Spec on current cachekv implementation #13977

Conversation

dangush commented Nov 22, 2022 • edited Loading

Description

Author Checklist

Reviewers Checklist

tac0turtle left a comment

Choose a reason for hiding this comment

thpani left a comment

Choose a reason for hiding this comment

thpani Nov 24, 2022

Choose a reason for hiding this comment

dangush Dec 1, 2022 • edited Loading

Choose a reason for hiding this comment

thpani Dec 16, 2022

Choose a reason for hiding this comment

alexanderbez Dec 16, 2022

Choose a reason for hiding this comment

thpani Nov 25, 2022

Choose a reason for hiding this comment

tac0turtle commented Dec 16, 2022

thpani commented Dec 16, 2022 • edited Loading

angbrav commented Dec 16, 2022 • edited Loading

alexanderbez Dec 16, 2022

Choose a reason for hiding this comment

yihuang commented Dec 17, 2022

tac0turtle commented Dec 28, 2022

dangush commented Nov 22, 2022 •

edited

Loading

dangush Dec 1, 2022 •

edited

Loading

thpani commented Dec 16, 2022 •

edited

Loading

angbrav commented Dec 16, 2022 •

edited

Loading