Hashing, caching and fast-forwarding #652

greschd · 2017-08-17T13:31:35Z

Fixes #119.

Edit / Note: The description below is somewhat out of date, as it describes the initial state of the PR. For a current description, either follow the changes proposed / discussed subsequently, or refer to the descriptions in the documentation made in this PR.

The hashing, caching and fast-forwarding project is a work in progress. Since it is nearing completion, I'm creating this PR to start collecting feedback.

Hashing

The hashing.make_hash creates a hash from a given Python datastructure. Nested structures, are handled by recursive hashing, meaning that for example a dict is turned into

{key: make_hash(value) for key, value in dict.items()}

before hashing it. AiiDA Folder data can also be hashed, by recursively taking the hash of filenames and contents. For floating point numbers, the last 4 bits of the mantissa are truncated before creating the hash. However, this does not apply to ArrayData, where the array is saved as a file on disk and therefore handled by the Folder hashing.

Using the make_hash function, the Node has a get_hash method. For basic nodes, this method creates a hash from get_attrs(), the database folder, and the __version__ of the module where the Node class is defined. If there are errors, get_hash returns None.

Node subclasses can define a class-level _hash_ignored_attributes list, with names of attributes that will not be taken into account when creating the hash. Edit: _updatabale_attributes are also ignored now.

The WorkCalculation subclass adds get_inputs() to the objects that are hashed in get_hash, and exposes the _hash_ignored_inputs to ignore certain inputs.

Caching

When a Node is stored, the hash is saved as an extra. When subsequent nodes are stored, caching can be enabled by setting use_cache=True in the store method. This means that a node of the same type and hash will be returned if it exists. Otherwise a new node is created.

Nodes can determine whether they can be used as cache by implementing _is_valid_cache. This is used to avoid using failed calculations as cache.

The verdi rehash command can be used to re-calculate the hash either for all nodes, or just for a specific class.

Fast-forwarding

Fast-forwarding is implemented by using caching for the calculation of a given Process. If fast-forwarding is enabled and there is already such a calculation, this finished calculation will be used. The subsequent steps of running the process are then skipped, and it is done immediately.

The process decides whether fast-forwarding is used based on the _fast_forward_enabled method. There are two option to set this:

First priority: Passing the _fast_forward=True / False input to the process.
Second priority: A config file cache_config.yml in the .aiida folder. A default can be set, and fast-forwarding for specifc calculation / process classes can be enabled or disabled (by class name). An example file would look like this:

my_profile_name:
  use_cache:  # could be omitted (it's the default)
    default: False
  fast_forward:
    default: False  # could be omitted (it's the default)
    enabled: 
      - SomeCalculation
        OtherCalculation
        SomeWorkChainClass

Since the fast-forwarding is implemented at the level of Process, it doesn't work for InlineCalculation, and JobCalculation that is not launched via work.run.run or work.run.submit.

To-Do's

EDIT: Updated To-Do's after a discussion with Giovanni.

…ckend) that replaces the _dbnode member if a similar node already exists

…None for hash, which obviously should not be checked in the DB

…ditional extra (hash)

* Failing test for two ArrayData with unequal content * (Accidentally) passing test for two ArrayData of different size, with same str representation For the ArrayData, we need to take the actual array into account when creating the hash, not just the shape which is return in get_attrs()

Add more node hashing tests

…ashing

…ode-hashing

Close files after reading them to create the hash.

…ashing

The caching.defaults.use_cache parameter should be used by plugin developers to mark whether a specific .store() call CAN actually use caching. The user decides whether to use it in the end by setting the default to True / False.

…cache function

…ations

…ashing

…e-hashing

greschd · 2018-02-01T12:47:53Z

@sphuber @giovannipizzi I've gone through all your comments now. Please check the changes and let me know if there's something else to be changed.

sphuber

I am happy with the changes. Thanks again for an amazing job Dominik

greschd · 2018-02-02T16:46:35Z

My pleasure. Thanks for all your help!

giovannipizzi · 2018-02-09T15:52:18Z

I'm not sure where @greschd wrote the comment on all attributes being updatable now... anyway, it's a big bug that was undiscovered and I opened #1109 for that.

giovannipizzi

Hi Dominik, thanks a lot! I'm approving and merging this, so we also have more people testing it and giving feedback for potential issues.
I think that there are two more things to do after this

Check and fix Attributes of a calculation can be changed! #1109
discuss if it is a good idea to store the hash in the extra, or should we have a different internal table, or column in DbNode. The reason being that now all tests have to assume that there is an additional extra, which to me is a bit a strange assumption. Another way would be to decide that all extras starting with aiida are not shown by default by get_extra, and these methods have an additional show_internals=True flag, and the set_extra, without flags, complains if one tries to store something starting with aiida (but there is a flag allow_internal=False by default, to allow it). I'm not sure it's a great idea though, maybe it just complicates a lot the logic, and a new column is just the simplest?

greschd · 2018-02-09T16:15:46Z

Cool, thanks for merging!

For the _aiida_hash and _aiida_cached_from extras, I think we discussed at some point adding a column for the hash, and a link type for cached_from. I don't know how much effort both of these changes are, though. Probably we will want to use "internal" extras again in the future when developing new features, so maybe adding the logic to hide the _aiida_* extras still makes sense?

…utes This was already done in PR aiidateam#652 that was merged into develop but needs to be done in this branch as well for consistency, which will be merged into the v0.11.1 patch release

lekah and others added 30 commits June 1, 2017 18:00

aiidateam#119 added first small test to check for node hashing

ce539d8

aiidateam#119 Added small functions get_hash and get_same_node to Node

7589afd

aiidateam#119 Added functionality to store method (only for Django ba…

13caff2

…ckend) that replaces the _dbnode member if a similar node already exists

aiidateam#119 Try-except clause if hashing fails, and also check for …

ca5f047

…None for hash, which obviously should not be checked in the DB

aiidateam#119 Fix in test that was failing because there is now an ad…

87e546f

…ditional extra (hash)

Merge pull request aiidateam#591 from greschd/node-hashing

f7e6fe0

Add more node hashing tests

Merge branch 'develop' of github.com:aiidateam/aiida_core into node-h…

28af8ed

…ashing

Merge branch 'node-hashing' of github.com:aiidateam/aiida_core into n…

f12c3cb

…ode-hashing

Give name to '_' in loop

219d53b

Update hashing algorithm

76acbb6

Fix get_hash for special case of ArrayData

2de9bc8

Fix ArrayData issue by hashing folder content for 'pathlib.Path'

a4ae949

Explicitly raise error when non-directory path is hashed

516c44e

Add pathlib2 requirement

dcb4a38

Print modulename to check failing Travis test

bbe4c33

Try moving ArrayData import inside the test

fb182cc

Move numpy import to test

dc93962

Add comma after checksumdir requirement

5fef7f2

Add find_same to store_all, add to SQLAlchemy

f04d3bb

Simplify get_same_node

1bc7c90

Set hash extra on SQLAlchemy nodes

c5df6bf

Change how extra is stored on SQLAlchemy node

e5c72a1

Use Folder interface for hashing the repository folder

a896eb1

Add logic to ignore attributes

d13f8dd

Add tests for FolderData with empty files and folders.

533e968

Close files after reading them to create the hash.

Add functionality to set the default of caching True / False

55d1ad0

Merge branch 'develop' of github.com:aiidateam/aiida_core into node-h…

29f14ba

…ashing

Add contextmanager to enable / disable caching

7a0d4ef

Change caching default in store methods to False.

148770d

The caching.defaults.use_cache parameter should be used by plugin developers to mark whether a specific .store() call CAN actually use caching. The user decides whether to use it in the end by setting the default to True / False.

greschd and others added 12 commits January 29, 2018 18:59

Fix TestDbExtrasDjango to take into account '_aiida_hash'.

1f5bf07

Fix SQLA test_replacement_1 test to work with '_aiida_hash'

89c3626

Fix error message where 'fast-forwarding' is mentioned.

7b4d164

Remove 'retrieve_temporary_list' from ignored hash attributes.

e515f14

Add doc for what to do when caching is triggered in error, add clear_…

99bad1e

…cache function

Add max_memory_kb and priority to the attributes ignored in JobCalcul…

fe1f5e8

…ations

Add description of '_hash_ignored_inputs'

b7c1e71

Add code to show how to get the full class name

1b4839e

Remove workchain caching test, add caching section to dev guide

01f9004

Merge branch 'develop' into node-hashing

eef22f0

Merge branch 'develop' of github.com:aiidateam/aiida_core into node-h…

5a2a840

…ashing

Merge branch 'node-hashing' of github.com:greschd/aiida_core into nod…

1c0e6bc

…e-hashing

sphuber previously approved these changes Feb 2, 2018

View reviewed changes

Add 'retrieve_temporary_list' to updatable attributes

8596f77

greschd dismissed sphuber’s stale review via 8596f77 February 8, 2018 02:03

Merge branch 'develop' into node-hashing

419b20a

giovannipizzi approved these changes Feb 9, 2018

View reviewed changes

giovannipizzi merged commit f593ca5 into aiidateam:develop Feb 9, 2018

sphuber mentioned this pull request Feb 12, 2018

Merging develop into workflows #1113

Merged

greschd deleted the node-hashing branch April 11, 2018 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hashing, caching and fast-forwarding #652

Hashing, caching and fast-forwarding #652

greschd commented Aug 17, 2017 •

edited

Loading

greschd commented Feb 1, 2018

sphuber left a comment

greschd commented Feb 2, 2018

giovannipizzi commented Feb 9, 2018

giovannipizzi left a comment

greschd commented Feb 9, 2018

Hashing, caching and fast-forwarding #652

Hashing, caching and fast-forwarding #652

Conversation

greschd commented Aug 17, 2017 • edited Loading

Hashing

Caching

Fast-forwarding

To-Do's

greschd commented Feb 1, 2018

sphuber left a comment

Choose a reason for hiding this comment

greschd commented Feb 2, 2018

giovannipizzi commented Feb 9, 2018

giovannipizzi left a comment

Choose a reason for hiding this comment

greschd commented Feb 9, 2018

greschd commented Aug 17, 2017 •

edited

Loading