improve to_zarr doc about chunking #4048

apatlpo · 2020-05-08T16:43:09Z

follows automatic chunking of zarr archive #4046
Passes isort -rc . && black . && mypy . && flake8

I'm not sure the last point is really necessary for this PR, is it?

pep8speaks · 2020-05-08T16:43:13Z

Hello @apatlpo! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-05-20 13:37:26 UTC

shoyer · 2020-05-10T07:16:42Z

xarray/core/dataset.py

+        zarr may automatically chunk a DataArray if it is not chunked or
+        if `chunks` is not set to -1 in its `enconding` attribute or in the
+        argument of the `encoding` parameter.


Can we instead state what does happen if a variable is already chunked or chunks can be found in encoding?

Something like

If chunks are found in the encoding argument or attribute corresponding to any DataArray, those chunks are used.

Otherwise, if the DataArray is already a dask array, it is written with those chunks.

Finally, if not other chunks are found, Zarr uses its own heuristics to choose automatic chunk sizes.

I closely followed your suggestion.
Note that I am not sure about the docstring style for a list-like note like this one.

apatlpo · 2020-05-15T18:38:20Z

if anybody has a clue on how to fix tests this would be welcome.
thx

rabernat · 2020-05-15T18:45:57Z

There are two sphinx warnings in building the docs
https://dev.azure.com/xarray/xarray/_build/results?buildId=2819&view=logs&j=7e620c85-24a8-5ffa-8b1f-642bc9b1fc36&t=68484831-0a19-5145-bfe9-6309e5f7691d&l=280

xarray/core/dataset.py:docstring of xarray.Dataset.to_zarr:44: WARNING: Definition list ends without a blank line; unexpected unindent.
xarray/core/dataset.py:docstring of xarray.Dataset.to_zarr:46: WARNING: Unexpected indentation.

These seem related to your changes. I would try to get the docs building locally without any warnings.

keewis · 2020-05-15T18:58:43Z

the flake8 issues are fixed in #4057, so you can merge master into your branch to fix them in this branch, too.

apatlpo · 2020-05-20T13:07:25Z

ok, tests are passing.

keewis · 2020-05-20T13:25:24Z

they do, but could you remove the saved_on_disk.nc file that was accidentally added?

Also, the items are not a list despite being formatted like one:

is that intentional?

apatlpo · 2020-05-20T13:39:25Z

argh ... saved_on_disk.nc has been removed.

I was unfortunately not able to create a proper list without breaking tests and have exhausted my time trying to figure it out.
If you know how to do that I am more than happy to follow your advice.

shoyer · 2020-05-20T18:55:08Z

I think it's OK that it's not formatted as a list. It's still pretty readable

shoyer · 2020-05-20T18:55:38Z

thanks @apatlpo !

Update dataset.py

7e3d301

attempt at improving the doc formulation

5c376ca

shoyer reviewed May 10, 2020

View reviewed changes

Aurélien Ponte added 2 commits May 12, 2020 20:40

update to_zarr docstring

8e30027

minor style update

ffd2f57

Aurélien Ponte added 2 commits May 20, 2020 06:53

Merge branch 'master' into to_zarr_doc

3e5ad9f

seems to fix doc compilation locally

f94de3b

apatlpo force-pushed the to_zarr_doc branch from 4ebdc1a to f94de3b Compare May 20, 2020 12:29

delete saved_on_disk.nc

271fc37

shoyer merged commit 484d1ce into pydata:master May 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve to_zarr doc about chunking #4048

improve to_zarr doc about chunking #4048

apatlpo commented May 8, 2020 •

edited

Loading

pep8speaks commented May 8, 2020 •

edited

Loading

shoyer May 10, 2020

apatlpo May 12, 2020

apatlpo commented May 15, 2020

rabernat commented May 15, 2020

keewis commented May 15, 2020 •

edited

Loading

apatlpo commented May 20, 2020

keewis commented May 20, 2020

apatlpo commented May 20, 2020

shoyer commented May 20, 2020

shoyer commented May 20, 2020

improve to_zarr doc about chunking #4048

improve to_zarr doc about chunking #4048

Conversation

apatlpo commented May 8, 2020 • edited Loading

pep8speaks commented May 8, 2020 • edited Loading

Comment last updated at 2020-05-20 13:37:26 UTC

shoyer May 10, 2020

Choose a reason for hiding this comment

apatlpo May 12, 2020

Choose a reason for hiding this comment

apatlpo commented May 15, 2020

rabernat commented May 15, 2020

keewis commented May 15, 2020 • edited Loading

apatlpo commented May 20, 2020

keewis commented May 20, 2020

apatlpo commented May 20, 2020

shoyer commented May 20, 2020

shoyer commented May 20, 2020

apatlpo commented May 8, 2020 •

edited

Loading

pep8speaks commented May 8, 2020 •

edited

Loading

keewis commented May 15, 2020 •

edited

Loading