Improve IdentityStore Invalidate performance #27184

marcboudreau · 2024-05-22T20:43:14Z

The Invalidate method of the IdentityStore struct was using a simplistic algorithm to synchronize the MemDB records (entities, groups, local entity aliases) with those from the storage bucket. This simplistic algorithm would result in a large number of MemDB operations within a single transaction whenever the storage bucket contained a large number or records. This large number of operations led to using a much slower comparer function within MemDB which caused the Invalidate function to take a long time to complete and could lead the node to fall so far behind in processing WALs sent over by the primary cluster that the replication state would transition to merkle-sync.

The simplistic approach basically consisted of deleting everything from MemDB that was associated with the invalidated storage bucket and re-inserting those resources using state contained in the storage bucket. Since invalidations usually occur to signal a single resource has changed, been added, or been deleted; when a large number of unchanged resources also exist in the storage bucket, a lot of unnecessary work was being done (deleting and re-adding).

These changes replace the simplistic approach for the handling of entities and local entity aliases since they are the more likely resource to exist in large numbers where this problem occurs.

The new approach consists of comparing the contents of the invalidated storage bucket with the set of resources from MemDB associated that storage bucket. Resources that match in both systems are left alone, and only differences are rectified in MemDB.

github-actions · 2024-05-22T20:47:32Z

CI Results:
All Go tests succeeded! ✅

github-actions · 2024-05-22T20:48:04Z

Build Results:
All builds succeeded! ✅

vault/identity_store.go

…tyStore

mpalmi · 2024-05-23T21:12:44Z

vault/identity_store.go

+				// bucketLocalAlias.
+				var memDBLocalAlias *identity.Alias
+				for i, localAlias := range memDBLocalAliases {
+					if localAlias.ID == bucketLocalAlias.ID {


Discussed out of band about the possibility of creating a new MemDBMapIDToEntityByBucketKeyInTxn (yuck). It would be really nice if we could build up a map[ID]identity.Entity and map[ID]identity.Alias and use those to trim duplicates by direct lookup, thus replacing the inner loop.

I think this could potentially be a significant improvement in clarity and performance (though the current patch has already proven a significant improvement over the prior implementation).

Since this code has already been tested and provides the results we want, we can defer this change as follow-up work after the Code Freeze.

vault/identity_store.go

mpalmi

Looks good to me @marcboudreau! I know we've done significant testing and validation of the improvement, which helps us have confidence in the change.

The comment wording nit and the map optimization we discussed can be addressed as future work so I'll go ahead an approve!

edit: It looks like we need a godoc to pass the linter check. Aside from that CI appears to be happy.

mpalmi · 2024-05-24T15:09:46Z

vault/identity_store.go

+			// We've considered the use of github.com/google/go-cmp here,
+			// but opted for sticking with reflect.DeepEqual because go-cmp
+			// is intended for testing and is able to panic in some
+			// situations.


Suggested change

// We've considered the use of github.com/google/go-cmp here,

// but opted for sticking with reflect.DeepEqual because go-cmp

// is intended for testing and is able to panic in some

// situations.

// Though DeepEqual relies on == equality for underlying comparison,

// this is perfectly safe for all compared fields. Timestamps are all in

// unix epoch time and embedded structs contain no `.Equals`.

* improve identitystore invalidate performance * add changelog * adding test to cover invalidation of entity bucket keys within IdentityStore * minor clean ups * adding tests * add missing godoc for tests

…/1.16.x (#27230) * Improve IdentityStore Invalidate performance (#27184) * improve identitystore invalidate performance * add changelog * adding test to cover invalidation of entity bucket keys within IdentityStore * minor clean ups * adding tests * add missing godoc for tests * fix incorrect merge resolution --------- Co-authored-by: Marc Boudreau <marc.boudreau@hashicorp.com>

improve identitystore invalidate performance

328cf93

marcboudreau added the do-not-merge label May 22, 2024

marcboudreau added this to the 1.17.0-rc milestone May 22, 2024

marcboudreau requested a review from mpalmi May 22, 2024 20:43

github-actions bot added the hashicorp-contributed-pr If the PR is HashiCorp (i.e. not-community) contributed label May 22, 2024

add changelog

bc5526c

mpalmi reviewed May 23, 2024

View reviewed changes

vault/identity_store.go Outdated Show resolved Hide resolved

adding test to cover invalidation of entity bucket keys within Identi…

99ce9e4

…tyStore

mpalmi reviewed May 23, 2024

View reviewed changes

mpalmi reviewed May 24, 2024

View reviewed changes

vault/identity_store.go Show resolved Hide resolved

mpalmi reviewed May 24, 2024

View reviewed changes

vault/identity_store.go Outdated Show resolved Hide resolved

mpalmi reviewed May 24, 2024

View reviewed changes

vault/identity_store.go Outdated Show resolved Hide resolved

Marc Boudreau added 2 commits May 24, 2024 10:47

minor clean ups

bc469b0

adding tests

059fbe1

mpalmi approved these changes May 24, 2024

View reviewed changes

marcboudreau removed the do-not-merge label May 24, 2024

add missing godoc for tests

2b19df1

marcboudreau merged commit d309176 into main May 24, 2024
83 checks passed

marcboudreau deleted the marcboudreau/VAULT-27060/identity-store-invalidation branch May 24, 2024 17:48

marcboudreau added backport/1.16.x labels May 24, 2024

This was referenced May 24, 2024

Backport of Improve IdentityStore Invalidate performance into release/1.16.x #27230

Merged

Backport of Improve IdentityStore Invalidate performance into release/1.17.x #27231

Merged

marcboudreau mentioned this pull request Jul 10, 2024

VAULT-28677: Fix dangling entity-aliases in MemDB after invalidation #27750

Merged

6 tasks

hc-github-team-secure-vault-core mentioned this pull request Jul 25, 2024

Backport of VAULT-28677: Fix dangling entity-aliases in MemDB after invalidation into release/1.17.x #27870

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve IdentityStore Invalidate performance #27184

Improve IdentityStore Invalidate performance #27184

marcboudreau commented May 22, 2024

github-actions bot commented May 22, 2024 •

edited

Loading

github-actions bot commented May 22, 2024 •

edited

Loading

mpalmi May 23, 2024 •

edited

Loading

mpalmi left a comment •

edited

Loading

mpalmi May 24, 2024

Improve IdentityStore Invalidate performance #27184

Improve IdentityStore Invalidate performance #27184

Conversation

marcboudreau commented May 22, 2024

github-actions bot commented May 22, 2024 • edited Loading

github-actions bot commented May 22, 2024 • edited Loading

mpalmi May 23, 2024 • edited Loading

Choose a reason for hiding this comment

mpalmi left a comment • edited Loading

Choose a reason for hiding this comment

mpalmi May 24, 2024

Choose a reason for hiding this comment

github-actions bot commented May 22, 2024 •

edited

Loading

github-actions bot commented May 22, 2024 •

edited

Loading

mpalmi May 23, 2024 •

edited

Loading

mpalmi left a comment •

edited

Loading