Simplifies readFrom and WriteTo, and improves their performance. #142

lemire · 2023-10-12T15:54:56Z

On one benchmark, this PR doubles the performance. With this PR, we reach 1.7 GB/s (reading and writing to memory). This is for reading and writing, so that the bandwidth is 3.4 GB/s which is not great, but... Note that this for in-memory... if you write and read to a disk... it is going to be slower.

Before...

$ go test -bench  BenchmarkBitsetReadWrite -benchmem
goos: darwin
goarch: arm64
BenchmarkBitsetReadWrite-8         69720             16075 ns/op              16 B/op          2 allocs/op
PASS
ok      _/Users/dlemire/CVS/github/bitset       1.533s

After...

$ go test -bench  BenchmarkBitsetReadWrite -benchmem
goos: darwin
goarch: arm64
BenchmarkBitsetReadWrite-8        151460              7396 ns/op            2064 B/op          4 allocs/op
PASS
ok      _/Users/dlemire/CVS/github/bitset       1.321s

Fixes #141

lemire · 2023-10-12T15:57:38Z

@thanhpk @klauspost @omerfirmak @king526 : Please review/comment.

I really don't understand why our code is so convoluted when what I wrote is just... obvious? Is there a catch? Why do we have all this complexity in the first place ?

thanhpk · 2023-10-12T16:16:42Z

@lemire you are trading space for time, it would alloc a big chunk of memory, crashing some programs if the bitset is big enough

We already discussed this
#103

omerfirmak · 2023-10-12T16:28:33Z

@lemire you are trading space for time

-benchmem to see the GC hit you are taking. We, at Nethermind, don't depend on bitset anymore (due to #134). But I would rather keep the GC at minimum.

lemire · 2023-10-12T19:43:56Z

@thanhpk

it would alloc a big chunk of memory, crashing some programs if the bitset is big enough

I don't think it would. The overhead memory usage should still effectively constant:

func readUint64Array(reader io.Reader, data []uint64) error {
	length := len(data)
	bufferSize := 128
	buffer := make([]byte, bufferSize*int(wordBytes))
	for i := 0; i < length; i += bufferSize {
		end := i + bufferSize
		if end > length {
			end = length
			buffer = buffer[:wordBytes*uint(end-i)]
		}
		chunk := data[i:end]
		if _, err := io.ReadFull(reader, buffer); err != nil {
			return err
		}
		for i := range chunk {
			chunk[i] = uint64(binaryOrder.Uint64(buffer[8*i:]))
		}
	}
	return nil
}

func writeUint64Array(writer io.Writer, data []uint64) error {
	bufferSize := 128
	buffer := make([]byte, bufferSize*int(wordBytes))
	for i := 0; i < len(data); i += bufferSize {
		end := i + bufferSize
		if end > len(data) {
			end = len(data)
			buffer = buffer[:wordBytes*uint(end-i)]
		}
		chunk := data[i:end]
		for i, x := range chunk {
			binaryOrder.PutUint64(buffer[8*i:], x)
		}
		_, err := writer.Write(buffer)
		if err != nil {
			return err
		}
	}
	return nil
}

@omerfirmak

I would rather keep the GC at minimum.

Yeah. It seems that binary.Read and binary.Write are allocating functions. But we can just reuse the buffer, see my code above.

thanhpk · 2023-10-13T01:10:54Z

@lemire I agree

Simplifies readFrom and WriteTo, and improves their performance.

5df2e1c

tweak

6c9ae5a

tweak

9a5bfb7

lemire merged commit 2cc58bd into master Oct 13, 2023
30 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplifies readFrom and WriteTo, and improves their performance. #142

Simplifies readFrom and WriteTo, and improves their performance. #142

lemire commented Oct 12, 2023 •

edited

Loading

lemire commented Oct 12, 2023

thanhpk commented Oct 12, 2023 •

edited

Loading

omerfirmak commented Oct 12, 2023 •

edited

Loading

lemire commented Oct 12, 2023 •

edited

Loading

thanhpk commented Oct 13, 2023

Simplifies readFrom and WriteTo, and improves their performance. #142

Simplifies readFrom and WriteTo, and improves their performance. #142

Conversation

lemire commented Oct 12, 2023 • edited Loading

lemire commented Oct 12, 2023

thanhpk commented Oct 12, 2023 • edited Loading

omerfirmak commented Oct 12, 2023 • edited Loading

lemire commented Oct 12, 2023 • edited Loading

thanhpk commented Oct 13, 2023

lemire commented Oct 12, 2023 •

edited

Loading

thanhpk commented Oct 12, 2023 •

edited

Loading

omerfirmak commented Oct 12, 2023 •

edited

Loading

lemire commented Oct 12, 2023 •

edited

Loading