Remove debugging slow assert statement #7221

hmaarrfk · 2022-10-26T01:43:08Z

We've been trying to understand why our code is slow. One part is that we use xarray.Datasets almost like dictionaries for our data. The following code is quite common for us

import xarray as xr
dataset = xr.Dataset()
dataset['a'] = 1
dataset['b'] = 2

However, through benchmarks, it became obvious that the merge_core method of xarray was causing alot of slowdowns.
main branch:

With this merge request:

from tqdm import tqdm
import xarray as xr
from time import perf_counter
import numpy as np

N = 1000

# Everybody is lazy loading now, so lets force modules to get instantiated
dummy_dataset = xr.Dataset()
dummy_dataset['a'] = 1
dummy_dataset['b'] = 1
del dummy_dataset

time_elapsed = np.zeros(N)
dataset = xr.Dataset()

for i in tqdm(range(N)):
    time_start = perf_counter()
    dataset[f"var{i}"] = i
    time_end = perf_counter()
    time_elapsed[i] = time_end - time_start
    
    
# %%
from matplotlib import pyplot as plt

plt.plot(np.arange(N), time_elapsed * 1E3, label='Time to add one variable')
plt.xlabel("Number of existing variables")
plt.ylabel("Time to add a variables (ms)")
plt.ylim([0, 50])
plt.grid(True)

Closes #xxxx
Tests added
User visible changes (including notable bug fixes) are documented in whats-new.rst
New functions/methods are listed in api.rst

max-sixty · 2022-10-26T01:58:00Z

Gosh, that's quite dramatic! Impressive find @hmaarrfk. (out of interest, how did you find this?)

I can see how that's quadratic when looping like that. I wonder whether using .assign(var1=1, var2=2, ...) has the same behavior?

Would be interesting to see whether this was covered by our existing asv benchmarks. Would be a good benchmark to add if we don't have one already.

hmaarrfk · 2022-10-26T01:59:57Z

out of interest, how did you find this?

Spyder profiler

for more information, see https://pre-commit.ci

hmaarrfk · 2022-10-26T02:14:40Z

Would be interesting to see whether this was covered by our existing asv benchmarks.

I wasn't able to find something that really benchmarked "large" datasets.

Would be a good benchmark to add if we don't have one already.

Added one.

asv_bench/benchmarks/dataset_creation.py

hmaarrfk · 2022-10-26T03:27:36Z

:/ not fun, the benchmark is failing. not sure why.

hmaarrfk · 2022-10-26T03:32:53Z

I'm somewhat ocnfused, I can run the benchmark locally

[  1.80%] ··· dataset_creation.Creation.time_dataset_creation                                                    4.37±0s

Illviljan · 2022-10-26T04:44:43Z

Error: [ 75.90%] ··· dataset_creation.Creation.time_dataset_creation             failed
[ 75.90%] ···· asv: benchmark timed out (timeout 60.0s)

Maybe 1000 loops is too much. Start with 100 maybe? We still want these benchmarks to be decently fast in the CI.

asv_bench/benchmarks/dataset_creation.py

Illviljan · 2022-10-26T04:56:39Z

I like large datasets as well. I seem to remember getting caught in similar places when creating my datasets. I think I solved it by using Variable instead, does doing something like this improve the performance for you?

import xarray as xr
dataset = xr.Dataset()
dataset['a'] = xr.Variable(dims="time", data=[1])
dataset['b'] = xr.Variable(dims="time", data=[2])

Illviljan · 2022-10-26T05:27:11Z

Now the asv finishes at least! Could you make a separate PR for the asv? I don't think it runs it when comparing to the main branch.

hmaarrfk · 2022-10-26T11:32:32Z

Ok. I'll want to rethink them.

I know it looks quadratic time, but i really would like to test n=1000 and i have an idea

for more information, see https://pre-commit.ci

hmaarrfk · 2022-10-26T12:19:49Z

I know it is not comparable, but I was really curious what "dictionary insertion" costs, in order to be able to understand if my comparisons were fair:

code

from tqdm import tqdm
import xarray as xr
from time import perf_counter
import numpy as np

N = 1000

# Everybody is lazy loading now, so lets force modules to get instantiated
dummy_dataset = xr.Dataset()
dummy_dataset['a'] = 1
dummy_dataset['b'] = 1
del dummy_dataset

time_elapsed = np.zeros(N)
# dataset = xr.Dataset()
dataset = {}

for i in tqdm(range(N)):
# for i in range(N):
    time_start = perf_counter()
    dataset[f"var{i}"] = i
    time_end = perf_counter()
    time_elapsed[i] = time_end - time_start
    
    
# %%
from matplotlib import pyplot as plt

plt.plot(np.arange(N), time_elapsed * 1E6, label='Time to add one variable')
plt.xlabel("Number of existing variables")
plt.ylabel("Time to add a variables (us)")
plt.ylim([0, 10])
plt.title("Dictionary insertion")
plt.grid(True)

I think xarray gives me 3 order of magnitude of "thinking" benefit, so I'll take it!

python --version
Python 3.9.13

Illviljan · 2022-10-27T16:58:45Z

       before           after         ratio
     [c000690c]       [24753f1f]
-     3.17±0.02ms      1.94±0.01ms     0.61  merge.DatasetAddVariable.time_variable_insertion(100)
-        81.5±2ms       17.0±0.2ms     0.21  merge.DatasetAddVariable.time_variable_insertion(1000)

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.
PERFORMANCE INCREASED.

Nice improvements. :)

I haven't fully understood why we had that code though?

benbovy · 2022-10-27T17:40:52Z

Thanks @hmaarrfk!

I haven't fully understood why we had that code though?

Me neither. I don't remember ever seeing this assertion error raised while refactoring things. Any idea @shoyer?

shoyer · 2022-10-28T00:27:22Z

I no longer remember why I added these checks, but I certainly did not expect to see this sort of performance penalty!

Remove debugging slow assert statement

4b8301f

hmaarrfk mentioned this pull request Oct 26, 2022

Actually make the fast code path return early for Aligner.align #7222

Merged

4 tasks

Add a benchmark

a948b95

github-actions bot added run-benchmark Run the ASV benchmark workflow topic-performance labels Oct 26, 2022

[pre-commit.ci] auto fixes from pre-commit.com hooks

ec972e0

for more information, see https://pre-commit.ci

hmaarrfk commented Oct 26, 2022

View reviewed changes

asv_bench/benchmarks/dataset_creation.py Outdated Show resolved Hide resolved

Illviljan reviewed Oct 26, 2022

View reviewed changes

asv_bench/benchmarks/dataset_creation.py Outdated Show resolved Hide resolved

Update asv_bench/benchmarks/dataset_creation.py

119a546

Rework the benchmark

040ac3b

hmaarrfk mentioned this pull request Oct 26, 2022

Dataset insertion benchmark #7223

Merged

4 tasks

[pre-commit.ci] auto fixes from pre-commit.com hooks

3f314f0

for more information, see https://pre-commit.ci

hmaarrfk mentioned this pull request Oct 26, 2022

Insertion speed of new dataset elements #7224

Open

Merge branch 'main' into patch-1

02b4256

dcherian requested a review from benbovy October 27, 2022 15:39

Delete dataset_creation.py

1a58759

github-actions bot removed run-benchmark Run the ASV benchmark workflow topic-performance labels Oct 27, 2022

max-sixty added the plan to merge Final call for comments label Oct 27, 2022

max-sixty merged commit 040816a into pydata:main Oct 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove debugging slow assert statement #7221

Remove debugging slow assert statement #7221

hmaarrfk commented Oct 26, 2022

max-sixty commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

Illviljan commented Oct 26, 2022

Illviljan commented Oct 26, 2022 •

edited

Loading

Illviljan commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022 •

edited

Loading

Illviljan commented Oct 27, 2022

benbovy commented Oct 27, 2022

shoyer commented Oct 28, 2022

Remove debugging slow assert statement #7221

Remove debugging slow assert statement #7221

Conversation

hmaarrfk commented Oct 26, 2022

max-sixty commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

Illviljan commented Oct 26, 2022

Illviljan commented Oct 26, 2022 • edited Loading

Illviljan commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022

hmaarrfk commented Oct 26, 2022 • edited Loading

Illviljan commented Oct 27, 2022

benbovy commented Oct 27, 2022

shoyer commented Oct 28, 2022

Illviljan commented Oct 26, 2022 •

edited

Loading

hmaarrfk commented Oct 26, 2022 •

edited

Loading