storage: concurrent disk image write. #2395

iwanbk · 2024-08-13T03:59:46Z

Write it concurrently to speed it up from the previous sequential write.

Description

Change image DiskWrite from sequential to concurrent, to make it faster.

Changes

change to concurrent write

Related Issues

Fixes :

Checklist

Tests included -> manual test
Build pass
Documentation
Code format and docstring

Write it concurrently to speed it up from the previous sequential write.

muhamadazmy

In general, this looks good. But I would love to see a benchmarking that compares the performance of the 2 approaches.

iwanbk · 2024-08-13T04:07:26Z

@muhamadazmy
The benchmarking already in the linked issue.
Including here because here is indeed the better place

I've tried to paralelize the io.Copy with this environment:

zos node: In Indonesia, run on my old 2016 PC on qemu. (indonesia is close to Australia)
os image: nixos

the results:

original io.Copy: 1 hour (much worse than the reported 20 mins :))
5 goroutines: 20 mins
10 goroutines: 20 mins
15 goroutines: 24 mins

muhamadazmy · 2024-08-13T04:10:45Z

That's cool.

One thing i had to add. Flists with raw images are obsolete and should not be used anymore. Zos still supports it for backwards compatibility only but definitely new workloads with this kind of images should not be allowed.

muhamadazmy · 2024-08-13T07:31:15Z

Forgot to say that make sure to clean up the cache between the benchmarks runs. Since rfs caches the downloaded content in zos-cache. Means that second run will always go faster than first run since it doesn't have to download the image again

iwanbk · 2024-08-13T07:41:49Z

Forgot to say that make sure to clean up the cache between the benchmarks runs. Since rfs caches the downloaded content in zos-cache. Means that second run will always go faster than first run since it doesn't have to download the image again

sure, it only took ~2 mins with the cache.

iwanbk · 2024-08-13T07:45:34Z

actually i not only deleted the cache, but delete all the qemu cow disks because i don't know how to delete it.
I guess it is under /var/cache/modules/flistd?

var/cache/modules/flistd # ls
cache       flist       log         mountpoint  pid         ro

pkg/storage/disk.go

xmonader · 2024-08-13T08:09:28Z

pkg/storage/disk.go

+	// the sequential write is slow because the data source is from the remote server.
+	var (
+		// use errgroup because there is no point in continuing if one of the goroutines failed
+		group               = new(errgroup.Group)


should be usable on the zero value or with a context if required

should be usable on the zero value

could you elaborate more on this?

or with a context if required

not possible for now:

the func doesn't have context

io.Copy itself doesn't have context.

I think make it context aware will need separate ticket

I think make it context aware will need separate ticket

DiskWrite handled by storaged, called by provisiond through zbus.
So, plain Go context will not really work here.
But of course we can start with context inside storaged.

As a side note, i noticed that DiskWrite keep going even tho provisiond already cancel the deployment/contract.
So, we already have real cancellation issue over zbus.

can you please mention the cancellation problem as a known issue on the top of PR to be easily found later on

can you please mention the cancellation problem as a known issue on the top of PR to be easily found later on

Maybe create separate issue because it is not really specific to this PR

xmonader · 2024-08-13T08:12:04Z

pkg/storage/disk.go

+			}
+			wr := io.NewOffsetWriter(file, start)
+			rd := io.NewSectionReader(source, start, len)
+			_, err = io.Copy(wr, rd)


I believe using io.CopyBuffer could avoid temporal buffers https://pkg.go.dev/io#CopyBuffer

also, no need to call file.Sync?

I believe using io.CopyBuffer could avoid temporal buffers https://pkg.go.dev/io#CopyBuffer

the difference between io.Copy and CopyBuffer is : CopyBuffer lets us choose our own buffer.
io.Copy will also use buffer.
The slowness here is because of downloading the file from the hub, so i don't think that the buffer will make big differences here.

also, no need to call file.Sync?

i don't think so, we're copying to rfs anyway.
CMIIW

io.CopyBuffer uses a buffer your provide instead of io.Copy keeping allocating them during every call (it's still an io to save imo)

instead of io.Copy keeping allocating them during every call

No, the buffer used by io.Copy also only allocated once.
See https://github.com/golang/go/blob/master/src/io/io.go#L417-L427

if what you mean is creating global buffer, outside the DiskWrite func then it would be lot of works for very small benefit.

xmonader · 2024-08-13T08:21:15Z

All in all looks good, also I wonder about if that code path should be enabled in case of HDD only nodes?

iwanbk · 2024-08-13T08:31:25Z

All in all looks good, also I wonder about if that code path should be enabled in case of HDD only nodes?

can you elaborate more on this?
Is it because HDD only node will be slow?
And how is provisiond behavior regarding this?

xmonader · 2024-08-13T08:39:54Z

All in all looks good, also I wonder about if that code path should be enabled in case of HDD only nodes?

can you elaborate more on this? Is it because HDD only node will be slow? And how is provisiond behavior regarding this?

zos right now supports also HDD nodes, I'm believe sequential nature of HDD would have the performance impacted (worse) with the concurrency

xmonader · 2024-08-13T08:41:00Z

on another note I'd also add the concept of retries to the code if possible

iwanbk · 2024-08-13T09:21:05Z

zos right now supports also HDD nodes, I'm believe sequential nature of HDD would have the performance impacted (worse) with the concurrency

~~This is true if we write to to regular file, but this is not the case here:~~

~~the destination is rfs, and rfs doesn't the image in a single regular file.~~
also need to be aware that the slowness is on the downloading side (from hub) , not on the write side to the disk image

As a side note, current code is favoring SSD disks

iwanbk · 2024-08-13T09:34:18Z

on another note I'd also add the concept of retries to the code if possible

why not handle it inside rfs?

iwanbk · 2024-08-16T03:00:17Z

zos right now supports also HDD nodes, I'm believe sequential nature of HDD would have the performance impacted (worse) with the concurrency

I assume it won't happen because the slowness from remote rfs will cover it.
But because it is hard to test, i think disable it on HDD-only is safer.

fixed it on caf83ac @xmonader

…ngci-lint

storage: concurrent disk image write.

999362a

Write it concurrently to speed it up from the previous sequential write.

iwanbk requested review from muhamadazmy, delandtj, MarioBassem, ashraffouda, rawdaGastan and xmonader as code owners August 13, 2024 03:59

muhamadazmy reviewed Aug 13, 2024

View reviewed changes

xmonader reviewed Aug 13, 2024

View reviewed changes

pkg/storage/disk.go Outdated Show resolved Hide resolved

xmonader reviewed Aug 13, 2024

View reviewed changes

wording change

d35f3c5

don't do concurrent copy on HDD only nodes

caf83ac

iwanbk added 2 commits August 16, 2024 10:58

remove separate gocyclo and staticcheck that already included in gola…

bfca359

…ngci-lint

push .golangci config

ffc59d8

iwanbk mentioned this pull request Aug 22, 2024

Tests and coverage workflow fails #2405

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: concurrent disk image write. #2395

storage: concurrent disk image write. #2395

iwanbk commented Aug 13, 2024 •

edited

Loading

muhamadazmy left a comment

iwanbk commented Aug 13, 2024

muhamadazmy commented Aug 13, 2024

muhamadazmy commented Aug 13, 2024

iwanbk commented Aug 13, 2024

iwanbk commented Aug 13, 2024

xmonader Aug 13, 2024

iwanbk Aug 13, 2024

iwanbk Aug 13, 2024

xmonader Aug 13, 2024

iwanbk Aug 13, 2024

xmonader Aug 13, 2024

xmonader Aug 13, 2024

iwanbk Aug 13, 2024

iwanbk Aug 13, 2024

xmonader Aug 13, 2024

iwanbk Aug 13, 2024

xmonader commented Aug 13, 2024

iwanbk commented Aug 13, 2024

xmonader commented Aug 13, 2024

xmonader commented Aug 13, 2024 •

edited

Loading

iwanbk commented Aug 13, 2024 •

edited

Loading

iwanbk commented Aug 13, 2024

iwanbk commented Aug 16, 2024

storage: concurrent disk image write. #2395

Are you sure you want to change the base?

storage: concurrent disk image write. #2395

Conversation

iwanbk commented Aug 13, 2024 • edited Loading

Description

Changes

Related Issues

Checklist

muhamadazmy left a comment

Choose a reason for hiding this comment

iwanbk commented Aug 13, 2024

muhamadazmy commented Aug 13, 2024

muhamadazmy commented Aug 13, 2024

iwanbk commented Aug 13, 2024

iwanbk commented Aug 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xmonader commented Aug 13, 2024

iwanbk commented Aug 13, 2024

xmonader commented Aug 13, 2024

xmonader commented Aug 13, 2024 • edited Loading

iwanbk commented Aug 13, 2024 • edited Loading

iwanbk commented Aug 13, 2024

iwanbk commented Aug 16, 2024

iwanbk commented Aug 13, 2024 •

edited

Loading

xmonader commented Aug 13, 2024 •

edited

Loading

iwanbk commented Aug 13, 2024 •

edited

Loading