NXdata linking of signal #205

domna · 2024-03-28T07:15:39Z

This enables the option to describe where the signal data of an NXdata comes from. It's a similar concept as the AXISNAME_depends. It's just a proposal open for discussion and I'm not entirely sure if this is the right path to do it. So I welcome your feedback @lukaspie and @rettigl. Also general feedback from @FAIRmat-NFDI/areab is welcome.

Different options could be:

Additionally or instead provide a depends field for the NXdata group to denote that it was linked to somewhere else. This is essentially a shortcut for writing out AXISNAME_depends + DATA_depends if they belong to the same target group.
Allowing an array or different mechanism here to also describe composite data, e.g., if we combine two detectors together and need to denote which axis comes from where. There we'd need an additional mechanism to tie the axisname to the data depends.

This is coming from a recent discussion with Anders Hahlin and Anders Frisk. They are trying to combine multiple detector readings into the top-level data (e.g., for spin-resolved measurements).

lukaspie · 2024-04-03T12:21:45Z

I like the overall idea and agree that this is something that is needed. I am also not sure what would be the best approach.

Additionally or instead provide a depends field for the NXdata group to denote that it was linked to somewhere else. This is essentially a shortcut for writing out AXISNAME_depends + DATA_depends if they belong to the same target group.

In general, I have the feeling that NeXus more often uses a depends_on field or a @depends_on attribute to denote such concepts, instead of ELEMENT_depends. So maybe we could just use attributes @depends_on:

NXdata:
  \@depends_on(NX_CHAR):
    doc: |
      Points to the path of a field defining the data on which the `DATA` field depends.
  [...]
  AXISNAME(NX_NUMBER):
    \@depends_on(NX_CHAR):
      Points to the path of a field defining the axis on which the ``AXISNAME`` axis depends.

Then, these attributes would anyway be associated with the group/field, so there would be no need for the uppercase notation.

In any case, I would argue it should be the same for the NXdata group and the corresponding AXISNAME fields. That is, if we keep AXISNAME_depends, we should also use DATA_depends (as @domna originally suggested). This would also be in line with having \@AXISNAME_indices as an immediate child of the NXdata group.

tomio13 · 2024-04-03T12:30:06Z

I would vote for a generic \@depends_on attribute, because it is a clean approach, and we do not have to specify what we mean for every element.

domna · 2024-04-05T14:34:27Z

In any case, I would argue it should be the same for the NXdata group and the corresponding AXISNAME fields. That is, if we keep AXISNAME_depends, we should also use DATA_depends (as @domna originally suggested). This would also be in line with having @AXISNAME_indices as an immediate child of the NXdata group.

I agree and I think we cannot fully get rid of the AXISNAME_depends. If we only allow a top-level @depends we can only reference full NXdata groups but especially for the different axes we want to gather them from different transformations, i.e., explicitly not from a single NXdata group. Therefore, my suggestion is that we do DATA_depends and additionally allow a top-level depends which allows for referencing a whole NXdata group, i.e., it's a convenience for writing out all the depends if they come from the same group. Alternatively, we just omit the latter.

However, I was also thinking about the name _depends in general. For me it reads that the axis values itself depend on another field. This means that the values have to somehow stacked on top the other field. What we actually do is reference/link something.

rettigl · 2024-04-08T13:28:57Z

I agree and I think we cannot fully get rid of the AXISNAME_depends.

I think with Luka's suggestion to just make these attributes to the AXISNAME and DATA fields, we can get rid of them. I would suggest to not re-use @depends_on, because this is as far as I know reserved for NXtransformations. We can call it e.g. @ reference or @ source or so.

domna · 2024-04-08T14:42:58Z

I agree and I think we cannot fully get rid of the AXISNAME_depends.

I think with Luka's suggestion to just make these attributes to the AXISNAME and DATA fields, we can get rid of them. I would suggest to not re-use @depends_on, because this is as far as I know reserved for NXtransformations. We can call it e.g. @ reference or @ source or so.

Yes, I just generally meant that we cannot remove the depends associated with AXISNAME and DATA (that is as _depends or as attribute) and only use the top-level depends.

I like @reference. So the proposal based on Lukas' suggestion would be:

NXdata:
  \@reference(NX_CHAR):
    doc: |
      Points to the path of a field defining the data to which this NXdata group refers.
  [...]
  AXISNAME(NX_NUMBER):
    \@reference(NX_CHAR):
      Points to the path of a field defining the axis to which this ``AXISNAME`` axis refers.
  DATA(NX_NUMBER):
    \@reference(NX_CHAR):
       Points to the path of a field defining the axis to which the ``DATA`` refers.

We could also propose @reference as a general concept for when an array is taken/linked to somewhere else as we do this quite a few times in the appdef. That way users always know where the data comes from as hdf5 linking is not really visible when viewing a file.

mkuehbach · 2024-04-09T07:25:05Z

I agree that having an explicit way how to define that a concept A has a connection to another concept B in addition to the explicit parent, child relationships (that come with the concept tree that a base class and appdef defines) makes sense.

My argument always against using @depends_on (although conceptually this makes sense) was that the symbol "depends_on" should be reserved for cases of NXtranslations. In this spirit I support "reference" I also can see an argument to use "refers_to" or: and this I would find better why at all use one symbol only?
The point is the more we add these explicit decoration (of which I am not against of doing) is that we approach an implementation of how one would make statements in e.g. RDF, then why not use e.g. "has_a", "is_a", "is_equivalent_to" not also as @has_a symbols in NeXus? This could also be used then in the NeXusOntology.
The only point why I am hesitant going down this route is that we could then alternatively start off with formulating these references and additional connections between concepts directly using accepted semantic web technology thus watering our message.

tomio13 · 2024-04-09T12:20:56Z

One thing I am not sure I understand: why is @depends_on reserved for only one base class? The point I meant was why not to use it generally when the meaning is quite clear?
As long as we express that the current element needs the definition of a previous one, I think it is a clear definition.
I am also fine using another synonym, but the point I am pressing here: pick one, and make it usable in all cases. Perhaps implying a class binding as well, that is a NXdata refers back to data only (numeric array), an NXtransformations to another NXtransformations, etc.
I find defining many synonyms with all special cases confusing and difficult to keep up with (I know, I am not clever enough for that...)

domna · 2024-04-09T15:07:43Z

The point is the more we add these explicit decoration (of which I am not against of doing) is that we approach an implementation of how one would make statements in e.g. RDF, then why not use e.g. "has_a", "is_a", "is_equivalent_to" not also as @has_a symbols in NeXus? This could also be used then in the NeXusOntology.

I agree with you that these terms are very useful for our description work.
However, in this case here I would say that the concept of @reference is slightly different. It basically refers a dataset from somewhere else when I create an entity of data. Something like same_as feels similar but lies more on the conceptual level for me, e.g., I want to say that the NXsample/temperature is the same as NXmanipulator/sample_holder/temperature and we just create synonyms for the same data. This fixes everything on the appdef level, always. Contrary, the reference is a dynamic link which can especially be useful in the NXdata case where we leave it up to the user to construct the data themself.

One thing I am not sure I understand: why is @depends_on reserved for only one base class? The point I meant was why not to use it generally when the meaning is quite clear?

I agree with you and I think that @depends_on is also useful in other contexts. However, here I think the term "depends" (in whatever form) shouldn't be used in the first place as these values don't depend on, a.k.a. stack with the previous one. They rather reference the value from somewhere else.

rettigl

LGTM. We should, however, also change the respective appdefs that use these concepts (NXmpes, NXmpes_arpes, NXxps (?)).

base_classes/nyaml/NXdata.yaml

lukaspie · 2024-07-03T10:34:55Z

LGTM. We should, however, also change the respective appdefs that use these concepts (NXmpes, NXmpes_arpes, NXxps (?)).

I rebased this branch and then used the @reference attribute in NXmpes and NXmpes_arpes. For NXxps, I would handle it in #249.

IMO, we can merge this PR now.

rettigl

Apart from minor comments, LGTM. Respective example config files also need to be adoped (and maybe the readers/converters)

contributed_definitions/nyaml/NXmpes.yaml

addressed already

* Add depends for data field * Use `@reference` * use reference attribute in NXmpes and NXmpes_arpes * change text style * fix duplicated in reference attribute --------- Co-authored-by: Lukas Pielsticker <50139597+lukaspie@users.noreply.github.com> # Conflicts: # base_classes/nyaml/NXdata.yaml # contributed_definitions/NXdata_mpes.nxdl.xml # contributed_definitions/NXmpes.nxdl.xml # contributed_definitions/NXmpes_arpes.nxdl.xml # contributed_definitions/nyaml/NXdata_mpes.yaml # contributed_definitions/nyaml/NXmpes.yaml # contributed_definitions/nyaml/NXmpes_arpes.yaml

domna requested review from rettigl and lukaspie March 28, 2024 07:15

rettigl approved these changes Jun 13, 2024

View reviewed changes

sanbrock previously requested changes Jun 18, 2024

View reviewed changes

base_classes/nyaml/NXdata.yaml Show resolved Hide resolved

lukaspie force-pushed the nxdata-depends branch from e160e8d to 15b6dd5 Compare July 3, 2024 09:57

lukaspie approved these changes Jul 3, 2024

View reviewed changes

rettigl approved these changes Jul 4, 2024

View reviewed changes

contributed_definitions/nyaml/NXmpes.yaml Outdated Show resolved Hide resolved

contributed_definitions/nyaml/NXmpes.yaml Show resolved Hide resolved

rettigl mentioned this pull request Jul 13, 2024

Update example to use NXmpes_arpes OpenCOMPES/specsanalyzer#44

Merged

domna and others added 5 commits July 15, 2024 17:41

Add depends for data field

ca6cbc9

Use @reference

8b419b3

use reference attribute in NXmpes and NXmpes_arpes

2d0fcd7

change text style

653dd26

fix duplicated in reference attribute

6db20ea

domna force-pushed the nxdata-depends branch from b04ce84 to 6db20ea Compare July 15, 2024 15:42

domna merged commit a60bac2 into fairmat Jul 15, 2024
6 checks passed

domna deleted the nxdata-depends branch July 15, 2024 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NXdata linking of signal #205

NXdata linking of signal #205

domna commented Mar 28, 2024

lukaspie commented Apr 3, 2024

tomio13 commented Apr 3, 2024

domna commented Apr 5, 2024

rettigl commented Apr 8, 2024 •

edited

Loading

domna commented Apr 8, 2024

mkuehbach commented Apr 9, 2024 •

edited

Loading

tomio13 commented Apr 9, 2024 •

edited

Loading

domna commented Apr 9, 2024

rettigl left a comment

lukaspie commented Jul 3, 2024

rettigl left a comment

NXdata linking of signal #205

NXdata linking of signal #205

Conversation

domna commented Mar 28, 2024

lukaspie commented Apr 3, 2024

tomio13 commented Apr 3, 2024

domna commented Apr 5, 2024

rettigl commented Apr 8, 2024 • edited Loading

domna commented Apr 8, 2024

mkuehbach commented Apr 9, 2024 • edited Loading

tomio13 commented Apr 9, 2024 • edited Loading

domna commented Apr 9, 2024

rettigl left a comment

Choose a reason for hiding this comment

lukaspie commented Jul 3, 2024

rettigl left a comment

Choose a reason for hiding this comment

rettigl commented Apr 8, 2024 •

edited

Loading

mkuehbach commented Apr 9, 2024 •

edited

Loading

tomio13 commented Apr 9, 2024 •

edited

Loading