refactor: sharky cleanup and simplification of code #3500

istae · 2022-11-05T20:09:15Z

Checklist

I have read the coding guide.
My change requires a documentation update, and I have done it.
I have added tests to cover my changes.
I have filled out the description and linked the related issues.

Description

Write operations now follow a very basic round robin strategy with circular distribution of requests.

Open API Spec Version Changes (if applicable)

Motivation and Context (Optional)

Related Issue (Optional)

Screenshots (if appropriate):

This change is

pkg/sharky/shard.go

pkg/sharky/store.go

pkg/sharky/slots_test.go

vladopajic · 2023-02-20T15:54:01Z

pkg/sharky/store.go

+	s.wg.Add(1)
 	go func() {
-		defer sh.slots.wg.Done()
+		defer s.wg.Done()
 		sh.process()


Currently new goroutine is started for each shard instance, would it be possible to avoid these goroutines? Are they really necessary?

All that sh.process() function does is essentially two operations:

slot := sh.slots.Next() sh.slots.Use(slot)

Would it be possible to just call these two functions in (store.go:136) instead of reading from channel? Probably one more utility function would be needed to get shard, but overall code might be more simpler.

good idea actually :)

pkg/sharky/recovery.go

vladopajic · 2023-02-21T12:26:25Z

pkg/sharky/slots.go

-	sl.size = uint32(len(sl.data) * 8)
-	sl.head = sl.next(0)
-	return err
+	sl.data = data


Should statement

sl.data = data

be guarded with mutex?

no, this function is only used during bootup so no mutex is needed

pkg/sharky/store.go

vladopajic · 2023-02-22T09:10:27Z

This sharky implementation does not have guaranties which shard and location is returned next. As an outcome this could lead to 1) probably more cache invalidation (on hdd/ssd level); and 2) fragmented data. Would it make sense to have sharky to always return first available location in consistent order? This way disk IO will probably hit same sector(s) on disk (probably having cache hit).

vladopajic · 2023-02-22T09:18:04Z

pkg/sharky/store.go

@@ -44,12 +48,11 @@ type Store struct {
 // - maxDataSize - positive integer representing the maximum blob size to be stored
 func New(basedir fs.FS, shardCnt int, maxDataSize int) (*Store, error) {


Is store ever writing data down to disk (closing and saving shards)?

it asks shards to ask slots to save and close

vladopajic · 2023-02-22T09:18:27Z

pkg/sharky/store.go

@@ -28,12 +27,17 @@ var (
 // - read prioritisation over writing
 // - free slots allow write
 type Store struct {


This file should be tested.

istae · 2023-02-22T15:11:35Z

This sharky implementation does not have guaranties which shard and location is returned next. As an outcome this could lead to 1) probably more cache invalidation (on hdd/ssd level); and 2) fragmented data. Would it make sense to have sharky to always return first available location in consistent order? This way disk IO will probably hit same sector(s) on disk (probably having cache hit).

Order of writes does not guarantee order of reads however. We cannot tell which chunk will be retrieved in the future, so it does not matter which shard we place them.
That being said, there is a some benefit in having shards store only a certain PO.
For example, when the node is calculating the reserve sample and is going through the entire reserve, reading the shards in order could be a great optimization.

nikipapadatou · 2024-02-05T11:12:58Z

Consider this again after the major localstore releases.

bee-runner bot added the pull-request label Nov 5, 2022

istae force-pushed the freaky branch 2 times, most recently from 1f6576a to ec5fe69 Compare November 11, 2022 15:50

istae force-pushed the freaky branch from ec5fe69 to 9645243 Compare November 16, 2022 20:58

istae force-pushed the freaky branch from 9645243 to 7f1815e Compare November 27, 2022 15:07

istae changed the title ~~refactor: rewrite sharky~~ refactor: sharky rewrite Nov 27, 2022

istae marked this pull request as ready for review November 27, 2022 15:08

istae requested review from a team, mrekucci and aloknerurkar and removed request for a team November 27, 2022 15:08

istae force-pushed the freaky branch from 7f1815e to cde4617 Compare November 28, 2022 23:54

istae force-pushed the freaky branch from cde4617 to 945c46e Compare January 29, 2023 11:25

istae force-pushed the freaky branch from 945c46e to 444fc5a Compare February 18, 2023 00:25

istae requested review from notanatol and vladopajic February 18, 2023 14:07

vladopajic reviewed Feb 20, 2023

View reviewed changes

vladopajic reviewed Feb 21, 2023

View reviewed changes

pkg/sharky/store.go Outdated Show resolved Hide resolved

vladopajic reviewed Feb 22, 2023

View reviewed changes

istae force-pushed the freaky branch 2 times, most recently from 0a9a83e to 2e4f076 Compare March 10, 2023 19:05

istae requested a review from vladopajic March 13, 2023 15:30

istae changed the title ~~refactor: sharky rewrite~~ refactor(sharky): cleanup of code and code simplification Mar 13, 2023

istae changed the title ~~refactor(sharky): cleanup of code and code simplification~~ refactor: sharky cleanup and simplification of code Mar 13, 2023

istae added 3 commits March 15, 2023 13:46

fix: sharky rewrite

a61521d

chore: remove print

60e5ed2

chore: head

1b8885f

istae added 4 commits March 15, 2023 13:46

fix: rename

9060f56

fix: circular shard choice

4e41708

fix: asd

f5badca

fix: asd

bd4f775

istae force-pushed the freaky branch from efa9d01 to bd4f775 Compare March 15, 2023 10:47

notanatol approved these changes Mar 18, 2023

View reviewed changes

istae marked this pull request as draft June 10, 2024 17:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: sharky cleanup and simplification of code #3500

refactor: sharky cleanup and simplification of code #3500

istae commented Nov 5, 2022 •

edited

Loading

vladopajic Feb 20, 2023

istae Mar 10, 2023

vladopajic Feb 21, 2023

istae Mar 10, 2023

vladopajic commented Feb 22, 2023

vladopajic Feb 22, 2023

istae Mar 13, 2023

vladopajic Feb 22, 2023

istae commented Feb 22, 2023

nikipapadatou commented Feb 5, 2024

		@@ -44,12 +48,11 @@ type Store struct {
		// - maxDataSize - positive integer representing the maximum blob size to be stored
		func New(basedir fs.FS, shardCnt int, maxDataSize int) (*Store, error) {

refactor: sharky cleanup and simplification of code #3500

Are you sure you want to change the base?

refactor: sharky cleanup and simplification of code #3500

Conversation

istae commented Nov 5, 2022 • edited Loading

Checklist

Description

Open API Spec Version Changes (if applicable)

Motivation and Context (Optional)

Related Issue (Optional)

Screenshots (if appropriate):

vladopajic Feb 20, 2023

Choose a reason for hiding this comment

istae Mar 10, 2023

Choose a reason for hiding this comment

vladopajic Feb 21, 2023

Choose a reason for hiding this comment

istae Mar 10, 2023

Choose a reason for hiding this comment

vladopajic commented Feb 22, 2023

vladopajic Feb 22, 2023

Choose a reason for hiding this comment

istae Mar 13, 2023

Choose a reason for hiding this comment

vladopajic Feb 22, 2023

Choose a reason for hiding this comment

istae commented Feb 22, 2023

nikipapadatou commented Feb 5, 2024

istae commented Nov 5, 2022 •

edited

Loading