Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test(store/snapshot): fix flaky test #20078

Merged
merged 9 commits into from
Apr 26, 2024

Conversation

EmilGeorgiev
Copy link
Contributor

@EmilGeorgiev EmilGeorgiev commented Apr 17, 2024

Description

Closes: #19834

Fix flaky test in storage/snapshot


Author Checklist

All items are required. Please add a note to the item if the item is not applicable and
please add links to any relevant follow up issues.

I have...

  • [x ] included the correct type prefix in the PR title
  • [ x] confirmed ! in the type prefix if API or client breaking change
  • [x ] targeted the correct branch (see PR Targeting)
  • [ x] provided a link to the relevant issue or specification
  • [ x] reviewed "Files changed" and left comments if necessary
  • [ x] included the necessary unit and integration tests
  • [ x] added a changelog entry to CHANGELOG.md
  • [ x] updated the relevant documentation or specification, including comments for documenting Go code
  • [x ] confirmed all CI checks have passed

Reviewers Checklist

All items are required. Please add a note if the item is not applicable and please add
your handle next to the items reviewed if you only reviewed selected items.

I have...

  • confirmed the correct type prefix in the PR title
  • confirmed all author checklist items have been addressed
  • reviewed state machine logic, API design and naming, documentation is accurate, tests and test coverage

Summary by CodeRabbit

  • Tests
    • Enhanced the store saving test to support parallel execution and handle concurrent save operations more effectively.
    • Introduced a mock structure to improve testing capabilities.

@EmilGeorgiev EmilGeorgiev requested a review from a team as a code owner April 17, 2024 20:06
Copy link
Contributor

coderabbitai bot commented Apr 17, 2024

Walkthrough

The update involves enhancing the TestStore_Save function in the Cosmos SDK to support parallel testing and better manage concurrent operations. Changes include removing unnecessary imports, adding parallel execution capabilities, and introducing a new mock structure for improved test reliability.

Changes

File Path Change Summary
store/snapshots/store_test.go - Removed sync import
- Added t.Parallel()
- Modified concurrent save logic
- Added ReadCloserMock struct

Assessment against linked issues

Objective Addressed Explanation
Identify and resolve flakiness in tests involving concurrent operations [#19834]
Ensure no errors occur due to file access in concurrent scenarios [#19834] The PR focuses on internal logic adjustments and might not address file access directly.
Investigate and prevent panics due to non-existent files in tests [#19834] Changes do not explicitly handle file creation or existence checks.

Possibly related issues


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share
Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai generate interesting stats about this repository and render them as a table.
    • @coderabbitai show all the console.log statements in this repository.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (invoked as PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger a review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai help to get help.

Additionally, you can add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Configration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@alpe alpe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good start on the flaky test! Concurrency is always hard to catch in tests. 🪡
With your changes, the store.Save is called sequential, now. It does fix the flakyness but may not be the original intend.

store/snapshots/store_test.go Outdated Show resolved Hide resolved
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

_, err = store.Save(7, 2, makeChunks(nil))
require.Error(t, err)
_, err = store.Save(8, 1, makeChunks(nil))
require.NoError(t, err)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the case of different height should succeed is missed

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

Out of diff range and nitpick comments (1)
store/snapshots/store_test.go (1)

323-351: The concurrency test scenario is well-implemented using a wait group and an atomic counter. However, consider adding a test case for different heights to ensure that concurrent saves at different heights are handled correctly.

if i < n {
_, err = store.Save(7, 1, ch)
} else {
_, err = store.Save(8, 1, makeChunks(nil))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good intend but this condition on n does not make it very readable, IMHO.
Either you can cover the second test case in separate code block or you refactor towards the height as argument. For example:

	var (
		wgStart, wgDone sync.WaitGroup
		mu              sync.Mutex
		gotErrHeights   []uint64
	)
	srcHeights := []uint64{7, 7, 7, 8, 9}
	wgStart.Add(len(srcHeights))
	wgDone.Add(len(srcHeights))
	for _, h := range srcHeights {
		ch = make(chan io.ReadCloser, 1)
		ch <- &ReadCloserMock{} // does not block on a buffered channel
		close(ch)
		go func(height uint64) {
			wgStart.Done()
			wgStart.Wait() // wait for all routines started
			if _, err = store.Save(height, 1, ch); err != nil {
				mu.Lock()
				gotErrHeights = append(gotErrHeights, height)
				mu.Unlock()
			}
			wgDone.Done()
		}(h)
	}
	wgDone.Wait() // wait for all routines completed
	assert.Equal(t, []uint64{7, 7}, gotErrHeights)

This approach has the benefit that you can identify the failing height rather than relying on the count. WDYT?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

great point

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

Comment on lines +349 to +357
type ReadCloserMock struct {
}

func (r ReadCloserMock) Read(p []byte) (n int, err error) {
return len(p), io.EOF
}

func (r ReadCloserMock) Close() error {
return nil
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The ReadCloserMock is used for testing but currently returns a fixed response. Consider enhancing this mock to return configurable data to increase the flexibility and realism of the tests.

- func (r ReadCloserMock) Read(p []byte) (n int, err error) {
-     return len(p), io.EOF
+ func (r *ReadCloserMock) Read(p []byte) (n int, err error) {
+     if len(r.data) > 0 {
+         n = copy(p, r.data)
+         r.data = r.data[n:]
+         return n, nil
+     }
+     return 0, io.EOF
+ }
+
+ func NewReadCloserMock(data []byte) *ReadCloserMock {
+     return &ReadCloserMock{data: data}
+ }

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Suggested change
type ReadCloserMock struct {
}
func (r ReadCloserMock) Read(p []byte) (n int, err error) {
return len(p), io.EOF
}
func (r ReadCloserMock) Close() error {
return nil
type ReadCloserMock struct {
data []byte
}
func (r *ReadCloserMock) Read(p []byte) (n int, err error) {
if len(r.data) > 0 {
n = copy(p, r.data)
r.data = r.data[n:]
return n, nil
}
return 0, io.EOF
}
func (r ReadCloserMock) Close() error {
return nil
}
func NewReadCloserMock(data []byte) *ReadCloserMock {
return &ReadCloserMock{data: data}
}

Copy link
Contributor

@alpe alpe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the updates. Good job to trace this down 🏅

@cool-develope cool-develope added this pull request to the merge queue Apr 26, 2024
Merged via the queue into cosmos:main with commit 560e6fb Apr 26, 2024
58 of 59 checks passed
meetrick pushed a commit to meetrick/cosmos-sdk that referenced this pull request Apr 29, 2024
Co-authored-by: cool-developer <51834436+cool-develope@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Flaky store test
3 participants