add hkls attribute rs.DataSet #286

kmdalton · 2024-12-06T21:39:23Z

This PR makes a .hkls attribute for rs.DataSet instances. The method has a setter to assign new Miller indices. It retains backward compatibility with the get_hkls method.

The use case here is for being able to set new miller indices using an expression like

ds.hkls = hkls

where hkls can either be a dataset containing the keys 'H', 'K', and 'L' or it could be a numpy array. This saves the user having to reset and set the index.

for more information, see https://pre-commit.ci

codecov-commenter · 2024-12-06T21:58:38Z

Codecov Report

Attention: Patch coverage is 70.83333% with 7 lines in your changes missing coverage. Please review.

Project coverage is 88.78%. Comparing base (01286d8) to head (a4fb023).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
reciprocalspaceship/dataset.py	70.83%	7 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #286      +/-   ##
==========================================
- Coverage   88.98%   88.78%   -0.20%     
==========================================
  Files          40       40              
  Lines        2841     2854      +13     
==========================================
+ Hits         2528     2534       +6     
- Misses        313      320       +7

Flag	Coverage Δ
unittests	`88.78% <70.83%> (-0.20%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

JBGreisman · 2024-12-18T15:02:20Z

I think the concept of this makes a lot of sense. I think having DataSet.hkls as an attribute is an intuitive accessor for this sort of info, and I agree with maintaining the get_hkls() method for compatibility. I still have to do a bit of a dive into this to see if there are any gotchas/corner cases that seem problematic.

Some questions that will be easy to answer but I'm just writing here as part of my stream of consciousness:

Does this work correctly for both range-indexed and HKL-indexed DataSets (yes! this is explicitly tested)
What about if there are extra columns in the index?
What happens if the provided HKLs do not have the same number of rows as the DataSet? Do they get NaN-padded?

From what I can tell, this all seems fine, though

JBGreisman

Just a few small comments -- overall, looks good

reciprocalspaceship/dataset.py

JBGreisman · 2024-12-18T15:09:17Z

reciprocalspaceship/dataset.py

+        return hkl
+
+    def get_hkls(self):
+        """For backwards compatibility retain the get_hkls method in addition to the dataset.hkls attribute"""


I think this docstring should be more of a method description, rather than a rationale for keeping the method

Okay, I changed it. Let me know if you are okay with the update.

JBGreisman · 2024-12-18T15:11:56Z

tests/test_dataset.py

+    if range_index:
+        ds = ds.reset_index()
+
+    hmax = 20


We have two pytest fixtures (hkls and dataset_hkl) that could probably support this test, rather than rolling your own. If the other two are insufficient, maybe we should modify them so that this can be used elsewhere as well?

i don't think these fixtures are appropriate, because the hkls in this case should be matched to the length of the dataset. it might make sense to use them to test the case where the dataset length and hkl length differ.

maybe we could match these hkls to those in data_merged and/or data_unmerged?

add hkls attribute rs.DataSet

1c1d607

kmdalton requested a review from JBGreisman December 6, 2024 21:39

[pre-commit.ci] auto fixes from pre-commit.com hooks

8082e20

for more information, see https://pre-commit.ci

JBGreisman reviewed Dec 18, 2024

View reviewed changes

kmdalton added 2 commits December 18, 2024 10:35

fix whitespace

dcafb5b

update docstring

a4fb023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add hkls attribute rs.DataSet #286

add hkls attribute rs.DataSet #286

kmdalton commented Dec 6, 2024 •

edited

Loading

codecov-commenter commented Dec 6, 2024 •

edited

Loading

JBGreisman commented Dec 18, 2024

JBGreisman left a comment

JBGreisman Dec 18, 2024

kmdalton Dec 18, 2024

JBGreisman Dec 18, 2024

kmdalton Dec 18, 2024

kmdalton Dec 18, 2024

add hkls attribute rs.DataSet #286

Are you sure you want to change the base?

add hkls attribute rs.DataSet #286

Conversation

kmdalton commented Dec 6, 2024 • edited Loading

codecov-commenter commented Dec 6, 2024 • edited Loading

Codecov Report

JBGreisman commented Dec 18, 2024

JBGreisman left a comment

Choose a reason for hiding this comment

JBGreisman Dec 18, 2024

Choose a reason for hiding this comment

kmdalton Dec 18, 2024

Choose a reason for hiding this comment

JBGreisman Dec 18, 2024

Choose a reason for hiding this comment

kmdalton Dec 18, 2024

Choose a reason for hiding this comment

kmdalton Dec 18, 2024

Choose a reason for hiding this comment

kmdalton commented Dec 6, 2024 •

edited

Loading

codecov-commenter commented Dec 6, 2024 •

edited

Loading