-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add drop_duplicates for dims #5239
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually -- there is still one unresolved design concern here.
If you drop duplicates over multiple variables at once, what dimensions should the result have? In particular -- should it have the original dimensions, or should all dimensions involved be combined into one?
The latter might seem a little crazy now, but would make more sense once we allow dropping over multi-dimensional variables.
If we really want to avoid any possible controversy here, it might be best to stick to only supporting one variable for now, too.
The docs are failing with:
|
For me this case supports consistent behavior over any number of dims! Which would mean rejecting the "stack only if you supply exactly ndims" proposal. Completely fine with restricting to one dim for the moment and seeing how that goes (it's always possible to pass Were we going with |
Co-authored-by: keewis <keewis@users.noreply.github.com>
the docs build can probably be fixed by adding newlines before the section headers ("Parameters" / "Returns") |
@ahuang11 we discussed this on the core team call. People are excited to merge this, and appreciate you bearing with the changes. I suggested that we narrow this even further to one dimension and merge, so we can benefit from this now and consider additions from there. Would that be OK with you? |
Sure.
What's the reasoning for a single dimension?
…On Wed, May 12, 2021, 11:34 AM Maximilian Roos ***@***.***> wrote:
@ahuang11 <https://github.com/ahuang11> we discussed this on the core
team call. People are excited to merge this, and appreciate you bearing
with the changes.
I suggested that we narrow this even further to one dimension and merge,
so we can benefit from this now and consider additions from there. Would
that be OK with you?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#5239 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADU7FFSS2IQ5QP7BYG4OCWLTNKUXXANCNFSM4353OVNA>
.
|
@shoyer could I confirm that limiting to a single dimension satisfies this? You say "multiple variables" here. I mentioned a single dimension on the call but maybe wasn't clear.
|
Dont understand this
|
Co-authored-by: Maximilian Roos <5635139+max-sixty@users.noreply.github.com>
Co-authored-by: Maximilian Roos <5635139+max-sixty@users.noreply.github.com>
Co-authored-by: Maximilian Roos <5635139+max-sixty@users.noreply.github.com>
* upstream/master: Update release guide (pydata#5274) combine keep_attrs and combine_attrs in apply_ufunc (pydata#5041) Explained what a deprecation cycle is (pydata#5289) Code cleanup (pydata#5234) FacetGrid docstrings (pydata#5293) Add whats new for dataset interpolation with non-numerics (pydata#5297) Allow dataset interpolation with different datatypes (pydata#5008)
Ruined #5089 with reverting so remaking the PR for just dims
pre-commit run --all-files
whats-new.rst
api.rst