[WIP] resurrecting in place `ntoh` #112

aminnj · 2021-09-18T05:31:52Z

Resurrection of #101 (which means some copy-pasting from @Moelf ;) ). Basically, we add a inplace::Bool to interped_data which toggles between doing ntoh.(reinterpret(...)) (copy) or an inplace-variant, both returning the same type which means it's still type stable.

But with a small modification, basketarray()'s return type depends on the boolean. Probably this can be fixed with a barrier function to discard the Ref{UInt8[]} if called with inplace=false (the default.

Unit tests all pass.

julia> using UnROOT

julia> const t = LazyTree(ROOTFile("uncompressed_Run2012BC_DoubleMuParked_Muons.root"),"Events");

julia> function bar(t)
           for evt in t
               _ = evt.Muon_pt
           end
           nothing
       end

before

julia> @btime sum(t.nMuon) seconds=10
  204.417 ms (6078 allocations: 469.98 MiB)
0x0000000008e67ad8

julia> @btime bar(t) seconds=10
  712.140 ms (14304 allocations: 1.82 GiB)

after

julia> @btime sum(t.nMuon) seconds=10
  205.316 ms (6780 allocations: 235.24 MiB)
0x0000000008e67ad8

julia> @btime bar(t) seconds=10
  725.626 ms (16241 allocations: 1.26 GiB)

Performance is pretty much the same but with less total allocated memory. If I profile bar(t) I see there's a materialization. I thought this should be in place??

Moelf · 2021-09-18T13:27:34Z

I know it probably doesn't get much better but this is very messy :(

Moelf · 2021-09-18T13:36:32Z

actually:

#before
(14304 allocations: 1.82 GiB)
#after
(16241 allocations: 1.26 GiB)

I think this means we shouldn't do this, the number of allocation increases probably caused slow down. And the difference isn't as big as the 2x in the nMuon case

aminnj · 2021-09-18T21:45:18Z

I agree it’s not pretty. I’ll keep this an open wip for now since at least it works. If we get inspiration for making it better and truly in place (?) that would be great :)

tamasgal · 2021-09-18T22:18:01Z

I also think that this needs a bit more time ;) let's keep this floating around...

aminnj · 2021-10-18T01:41:09Z

Brainstorming a bit more. What about a thin wrapper type? Instead of returning a Ref{UInt8} everywhere and storing it in the LazyBranch to prevent the underlying data from unsafe_wrap from being GCed, we could define

struct Wrapper{T} <: AbstractVector{T}
           x::Vector{T}
           ref::Ref{Vector{UInt8}}
       end
Base.@propagate_inbounds Base.getindex(w::Wrapper{T}, ind::Int) where T = w.x[ind]
Base.size(w::Wrapper) = size(w.x)

And then with minimal changes, we eliminate the materialization associated with reinterpret.

     function LazyBranch(f::ROOTFile, b::Union{TBranch,TBranchElement})
         T, J = auto_T_JaggT(f, b; customstructs=f.customstructs)
         T = (T === Vector{Bool} ? BitVector : T)
-        _buffer = T[]
+        _buffer = Wrapper{eltype(T)}([], Ref(UInt8[]))
         if J != Nojagg

 function interped_data(rawdata, rawoffsets, ::Type{T}, ::Type{J}) where {T, J<:JaggType}
     if J === Nojagg
-        return ntoh.(reinterpret(T, rawdata))
+        p = convert(Ptr{eltype(T)}, pointer(rawdata))
+        w = unsafe_wrap(Array, p, length(rawdata) ÷ sizeof(eltype(T)))
+        w .= ntoh.(w)
+        return Wrapper(w, Ref(rawdata))

julia> @btime sum(tf.nMuon) # before
  306.325 ms (1794 allocations: 469.67 MiB)
julia> @btime sum(tf.nMuon) # after
  206.946 ms (1716 allocations: 234.91 MiB)

I think it's not too hard to generalize to VoV since it can wrap any AbstractVector. I.e.,

julia> using ArraysOfArrays
julia> Wrapper([1,2,3,4],Ref(UInt8[]));
julia> VectorOfVectors(w, [1,2,5])
2-element VectorOfVectors{Int64, UnROOT.Wrapper{Int64}, Vector{Int64}, Vector{Tuple{}}}:
 [1]
 [2, 3, 4]

Moelf · 2021-12-02T17:54:52Z

I wonder if @oschulz has any opinion. In fact, it would be nice to have a variation of VectorOfVector that has a field for Ref to hold the original byte array...

Moelf · 2021-12-02T17:58:38Z

what happens if we make our own VoV <:

https://github.com/JuliaArrays/ArraysOfArrays.jl/blob/393a25de3796ae5dbf69760edcb659dc984ff0f5/src/array_of_similar_arrays.jl#L18

oschulz · 2021-12-02T18:28:17Z

I'm not sure I've understood all the implications here - are you looking for somethings like a VectorOfVectors wrapped around an UnsafeArray?

Moelf · 2021-12-02T20:36:46Z

the problem is we have an original byte array A::Vector{UInt8}, we want to wrap VoV around it except we want to "cast" it to, for example, B::Vector{Int32}, but if you cast it by unsafe_wrap(), GC thinks A is not needed and thus GC-ed and you get bad data in VoV(B).

oschulz · 2021-12-02T22:12:51Z

But why can't we use ReinterpretArray if we have to keep the reference to A around anyway?

Moelf · 2021-12-02T22:18:30Z

because ReinterpretArray is slow for both:

A .= ntoh.(A) (10x slower)
everything user will later on do, such as looping over the branch
just generally ugly/annoying to deal with ReinterpretArray considering we already have many wrappers

oschulz · 2021-12-03T08:22:33Z

A .= ntoh.(A) (10x slower)

I wonder why ReinterpretArray is performing that much worse, compared to our custom wrappers. Well, probably can't be helped right now.

In that case I would recommend to use a VectorOfVectors around a custom wrapper - would that be workable? I don't think creating a special VoV type would increase performance, and it would generate more code to maintain long term.

Moelf · 2022-02-12T20:42:28Z

JuliaLang/julia#42227 (comment)

on Julia master, y .= ntoh.(y) is no longer slow given y is an ReinterpretedArray

Moelf · 2022-02-12T21:18:59Z

sadly despite the reduced allocation, it doesn't seem to improve timing by a lot in microbenchmark

aminnj · 2022-02-13T02:24:49Z

It might be that zlib decompression + the calculation is washing out the true benchmark

aminnj added 4 commits September 17, 2021 21:23

custom doesn't work

4f0c49e

all unit tests work

7ea7ea0

compact

7359dd5

bool optimization

c55c83a

This comment has been minimized.

Sign in to view

tamasgal added future performance labels Sep 18, 2021

Moelf mentioned this pull request Feb 15, 2022

living dangerously unsafe array cast #147

Closed

Moelf closed this Oct 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] resurrecting in place `ntoh` #112

[WIP] resurrecting in place `ntoh` #112

aminnj commented Sep 18, 2021 •

edited

Loading

This comment has been minimized.

Moelf commented Sep 18, 2021

Moelf commented Sep 18, 2021

aminnj commented Sep 18, 2021

tamasgal commented Sep 18, 2021

aminnj commented Oct 18, 2021

Moelf commented Dec 2, 2021

Moelf commented Dec 2, 2021 •

edited

Loading

oschulz commented Dec 2, 2021

Moelf commented Dec 2, 2021

oschulz commented Dec 2, 2021

Moelf commented Dec 2, 2021 •

edited

Loading

oschulz commented Dec 3, 2021

Moelf commented Feb 12, 2022

Moelf commented Feb 12, 2022 •

edited

Loading

aminnj commented Feb 13, 2022

[WIP] resurrecting in place ntoh #112

[WIP] resurrecting in place ntoh #112

Conversation

aminnj commented Sep 18, 2021 • edited Loading

before

after

This comment has been minimized.

Moelf commented Sep 18, 2021

Moelf commented Sep 18, 2021

aminnj commented Sep 18, 2021

tamasgal commented Sep 18, 2021

aminnj commented Oct 18, 2021

Moelf commented Dec 2, 2021

Moelf commented Dec 2, 2021 • edited Loading

oschulz commented Dec 2, 2021

Moelf commented Dec 2, 2021

oschulz commented Dec 2, 2021

Moelf commented Dec 2, 2021 • edited Loading

oschulz commented Dec 3, 2021

Moelf commented Feb 12, 2022

Moelf commented Feb 12, 2022 • edited Loading

aminnj commented Feb 13, 2022

[WIP] resurrecting in place `ntoh` #112

[WIP] resurrecting in place `ntoh` #112

aminnj commented Sep 18, 2021 •

edited

Loading

Moelf commented Dec 2, 2021 •

edited

Loading

Moelf commented Dec 2, 2021 •

edited

Loading

Moelf commented Feb 12, 2022 •

edited

Loading