Allow user to store non-openPMD information #115

RemiLehe · 2015-12-04T18:22:54Z

Today I had a request from a prospective user of openPMD: can the user store other datasets in the HDF5-openPMD files, which are not meant to be read by the openPMD parser / viewer ? (for instance because they don't fall either in the category mesh or or in the category particle)

From my point of view, the best way to do this is to store this data outside of the basePath (in this case the parser will not try to read it).

@ax3l : Does that make sense ? If yes, I think it would be good to add it as a side-remark in the standard.

The text was updated successfully, but these errors were encountered:

ax3l · 2015-12-04T21:12:31Z

Absolutely, they can store it everywhere as long as it does not collide with reserved attributes. The standard is not exclusive and only defines a minimal required markup :) That is also the way how new extensions (might) develop: from best practices of existing additional attributes & records. Regarding other data sets (records): if they are stored in the base path + particle/mesh path they need to fulfill the requirements of those locations. Else just add them to an other non-reserved location inside the base path and it will be fine since we don't search outside for standardized formatting.

ax3l · 2015-12-05T21:13:37Z

Does that make sense ? If yes, I think it would be good to add it as a side-remark in the standard.

Make absolutely sense and good point and we should make this more clear in the standard.
I marked it as a revision change (patch level) which means it can be added in e.g., a 1.0.1 release (as you already marked!).

RemiLehe · 2015-12-07T18:11:17Z

Ok, great! I'll do a corresponding PR in the next few days.

ax3l · 2015-12-10T12:40:20Z

An other and additional/orthogonal approach to allow non-openPMD information inside basePath too: if one wants to avoid parsing of additional records, in detail

directories
data sets

(since additional attributes do not harm the parsers), we could also provide a prefix that is ignored by the parsers. Lets say their names must start with "+".
But maybe that is messy.

Would allow to experiment with new additional information, e.g., irregular mesh geometries, inside basePath.

Currently, only attributes can be freely added at any place (well, it's recommended to name them comment). Groups and data sets are restricted inside those paths.

Close #115: Allow user to store non-openPMD information

ax3l · 2017-11-24T14:07:27Z

implemented in 1.0.1

DavidSagan · 2019-03-03T22:20:31Z

@ax3l

Currently, only attributes can be freely added at any place (well, it's recommended to name them comment). Groups and data sets are restricted inside those paths.

I would propose that that it should be allowed for groups and data sets to be freely added. I can imagine situations where, for example, a person wants to add per-particle data and the restriction that this has to be put outside of the group that holds the particle data makes things very messy. Certainly we could mandate that such added groups or data sets be marked as extra. For example using a "+" prefix as you suggested. I think this is a fairly clean solution.

franzpoeschel · 2022-09-21T17:07:18Z

Alternative (and maybe more radical) suggestion:

Allow custom group hierarchies with custom datasets and custom attributes inside every iteration
Treat meshes and particles as keywords, reserved to openPMD (to be more precise: whatever is defined in meshesPath and particlesPath)
Inside these paths, the typical openPMD hierarchy applies, and all data should follow strictly the openPMD standard

The fundamental idea would be that an openPMD dataset cannot only (1) be augmented by custom hierarchies (i.e. have the classical openPMD hierarchy, and other stuff around it that the API ignores), but instead that (2) an openPMD is a custom hierarchy with the classical openPMD structure embedded into it at any place.
Instead of ignoring custom hierarchies, openPMD could then benefit from and interact with them.

Example:
Mesh refinement currently works via the naming of the meshes. Alternatively, one could do:

/data/0/refined_mesh_levels/0/meshes/E
/data/0/refined_mesh_levels/0/meshes/B
/data/0/refined_mesh_levels/1/meshes/E
/data/0/refined_mesh_levels/1/meshes/B
/data/0/refined_mesh_levels/2/meshes/E
/data/0/refined_mesh_levels/2/meshes/B
+++++++ ––––––––––––––––––––– ++++++++
standard        custom        standard

/data/0/simulation_internal/some_checkpointing_info
+++++++ –––––––––––––––––––––––––––––––––––––––––––
standard                  custom

Codes such as for example PIConGPU can put their internal datasets (e.g. PIConGPU_id_provider) anywhere in that hierarchy, and it would be ignored instead of cluttering the openPMD dataset.

Ideally, if done correctly, this would mean that a single dataset can use several standards at the same time, such as mixing Nexus with openPMD.

Downside: No huge change for the standard, but a rather large change for implementations. Readers would need to be updated to find openPMD structures throughout the datasets.

ax3l · 2022-10-12T18:04:18Z

That sounds useful and would be equivalent to relaxing meshes path from values like meshesPath="meshes/" to regexes such as meshesPath=".*meshes/ (or the hard-coding of this exact regex in the standard).

I am not sure if we will not need an "exclude this from parsing" nonetheless via an attribute on custom groups/variables - without it we would keep things definitely fully separate besides sharing an iteration/snapshot id (if that works then that is fine).

franzpoeschel · 2023-04-24T17:16:52Z

For the HELPMI project, I drew up some visualizations of the proposed addition.

openPMD currently:

Proposed extension:

That sounds useful and would be equivalent to relaxing meshes path from values like meshesPath="meshes/" to regexes such as meshesPath=".*meshes/ (or the hard-coding of this exact regex in the standard).

Using a regex is one of the options, yes. Another (more restricted) option would be a list of paths.
Even though it's redundant, I would even suggest a list of patterns, as that is a common workflow in file managing software?

I am not sure if we will not need an "exclude this from parsing" nonetheless via an attribute on custom groups/variables - without it we would keep things definitely fully separate besides sharing an iteration/snapshot id (if that works then that is fine).

Using exclude patterns is a common enough pattern in a lot of software (rsync, git ignore, backup software, …), so, I'm fine with using that.
I don't understand what you mean by "without it we would keep things definitely fully separate besides sharing an iteration/snapshot id"?

ax3l · 2023-04-25T18:06:31Z

Sounds great. Designing as lists of patterns/paths is a good idea.

The last comment was simply: yes, I think we need an exclude pattern, too (as in rsync, git ignore, backup software, ...).

franzpoeschel · 2023-06-05T17:28:35Z

Real-life WIP example from PIConGPU: Checkpointing information is stored under picongpu_internal/, the RNGProvider is a field inside that group (normal openPMD markup), idProvider contains two non-openPMD datasets.

  float     /data/1000/fields/B/x                                      {64, 64, 64}                                                                                                                                                          
  float     /data/1000/fields/B/y                                      {64, 64, 64}                                                                                                                                                          
  float     /data/1000/fields/B/z                                      {64, 64, 64}                                                                                                                                                          
  float     /data/1000/fields/Convolutional PML B/xy                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML B/xz                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML B/yx                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML B/yz                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML B/zx                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML B/zy                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML E/xy                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML E/xz                   {1, 1, 198144}                                                                                                                                                        
  float     /data/1000/fields/Convolutional PML E/yx                   {1, 1, 198144}                                 
  float     /data/1000/fields/Convolutional PML E/yz                   {1, 1, 198144}                                 
  float     /data/1000/fields/Convolutional PML E/zx                   {1, 1, 198144}                                 
  float     /data/1000/fields/Convolutional PML E/zy                   {1, 1, 198144}                                 
  float     /data/1000/fields/E/x                                      {64, 64, 64}                                   
  float     /data/1000/fields/E/y                                      {64, 64, 64}                                   
  float     /data/1000/fields/E/z                                      {64, 64, 64}                                   
  float     /data/1000/particles/e/momentum/x                          {55401}                                        
  float     /data/1000/particles/e/momentum/y                          {55401}                                        
  float     /data/1000/particles/e/momentum/z                          {55401}                                        
  uint64_t  /data/1000/particles/e/particlePatches/extent/x            {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/extent/y            {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/extent/z            {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/numParticles        {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/numParticlesOffset  {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/offset/x            {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/offset/y            {1}                                            
  uint64_t  /data/1000/particles/e/particlePatches/offset/z            {1}                                            
  float     /data/1000/particles/e/position/x                          {55401}                                        
  float     /data/1000/particles/e/position/y                          {55401}                                        
  float     /data/1000/particles/e/position/z                          {55401}                                        
  int32_t   /data/1000/particles/e/positionOffset/x                    {55401}                                        
  int32_t   /data/1000/particles/e/positionOffset/y                    {55401}                                        
  int32_t   /data/1000/particles/e/positionOffset/z                    {55401}                                        
  float     /data/1000/particles/e/weighting                           {55401}                                        
  char      /data/1000/picongpu_internal/fields/RNGProvider3XorMin     {64, 64, 1536}                                 
  uint64_t  /data/1000/picongpu_internal/idProvider/nextId             {1, 1, 1}                                      
  uint64_t  /data/1000/picongpu_internal/idProvider/startId            {1, 1, 1}

RemiLehe added the enhancement label Dec 4, 2015

RemiLehe added this to the 1.0.1: Typo and Wording Changes milestone Dec 4, 2015

ax3l added question revision change backwards-compatible, stylistic change (e.g. typos) and removed enhancement labels Dec 5, 2015

RemiLehe mentioned this issue Dec 10, 2015

Close #115: Allow user to store non-openPMD information #117

Merged

ax3l self-assigned this Dec 10, 2015

ax3l added a commit that referenced this issue Dec 10, 2015

Merge pull request #117 from RemiLehe/non-openPMD-data

bec368b

Close #115: Allow user to store non-openPMD information

ax3l removed this from the 1.0.1: Typo and Wording Changes milestone Nov 24, 2017

ax3l added this to the 1.0: First Major Release milestone Nov 24, 2017

ax3l closed this as completed Nov 24, 2017

ax3l mentioned this issue Dec 1, 2017

Release: 1.0.1 #156

Merged

ax3l mentioned this issue Dec 11, 2017

Error checking of attribute, etc names. #164

Open

ax3l mentioned this issue Mar 3, 2019

Proposed: Non-standard program specific information files #205

Open

DavidSagan reopened this Mar 3, 2019

DavidSagan modified the milestones: openPMD 1.X, openPMD 2.X Mar 15, 2019

DavidSagan modified the milestones: openPMD 2.X, openPMD 1.X Mar 15, 2019

This was referenced Dec 2, 2019

ADIOS Particles are not fully openPMD-compatible ComputationalRadiationPhysics/picongpu#3119

Closed

Relaxed mode: warn and ignore invalid? openPMD/openPMD-api#620

Open

franzpoeschel mentioned this issue Nov 15, 2022

Roadmap: Wishlist of new features openPMD/openPMD-api#1332

Open

12 tasks

franzpoeschel mentioned this issue May 3, 2023

Custom Hierarchies openPMD/openPMD-api#1432

Open

12 tasks

franzpoeschel mentioned this issue Jun 28, 2023

Ext: MeshRefinement #252

Open

11 tasks

franzpoeschel mentioned this issue Sep 12, 2023

Lists of Iteration, Mesh and Particle Pathes #282

Open

ax3l added this to openPMD 2.0 Standard Aug 14, 2024

ax3l moved this to Proposed in openPMD 2.0 Standard Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow user to store non-openPMD information #115

Allow user to store non-openPMD information #115

RemiLehe commented Dec 4, 2015

ax3l commented Dec 4, 2015 via email

ax3l commented Dec 5, 2015

RemiLehe commented Dec 7, 2015

ax3l commented Dec 10, 2015

ax3l commented Nov 24, 2017

DavidSagan commented Mar 3, 2019

franzpoeschel commented Sep 21, 2022 •

edited

Loading

ax3l commented Oct 12, 2022 •

edited

Loading

franzpoeschel commented Apr 24, 2023

ax3l commented Apr 25, 2023 •

edited

Loading

franzpoeschel commented Jun 5, 2023

Allow user to store non-openPMD information #115

Allow user to store non-openPMD information #115

Comments

RemiLehe commented Dec 4, 2015

ax3l commented Dec 4, 2015 via email

ax3l commented Dec 5, 2015

RemiLehe commented Dec 7, 2015

ax3l commented Dec 10, 2015

ax3l commented Nov 24, 2017

DavidSagan commented Mar 3, 2019

franzpoeschel commented Sep 21, 2022 • edited Loading

ax3l commented Oct 12, 2022 • edited Loading

franzpoeschel commented Apr 24, 2023

ax3l commented Apr 25, 2023 • edited Loading

franzpoeschel commented Jun 5, 2023

franzpoeschel commented Sep 21, 2022 •

edited

Loading

ax3l commented Oct 12, 2022 •

edited

Loading

ax3l commented Apr 25, 2023 •

edited

Loading