Evaluate triggerables for all repeat instances where needed #586

lognaturel · 2020-06-20T04:47:00Z

Closes #584

d7593de removed a map that was used to keep track of expanded repeat references for triggerables that need to be evaluated across repeat instances. This PR introduces an alternate approach.
There used to be a special case for repeat instance deletion where all triggerables for instances following the removed instance were evaluated. This bypassed the DAG and resulted in a lot of unnecessary computations. This PR removes the custom code and lets repeat deletion use the DAG to identify which expressions need to be recomputed.
26cdfa2 led to a regression in evaluation of expressions in repeats with references across repeat instances. Prior to that commit, multiplicity was only added during contextualization if the target reference level didn't already have a predicate. This PR re-establishes that condition and adds tests to support it.

Remaining questions:

Does initializeTriggerables need to consider triggerables that affect all instances? addingRepeatInstance_withInnerCalculateDependentOnOuterSum_updatesInnerSumForAllInstances() gets at something related.

What has been done to verify that this works as intended?

Ensured that all tests that passed on v2.17.0 still pass. I analyzed the different types of triggerable cascades and wrote tests for them.

Why is this the best possible solution? Were any other approaches considered?

The alternative we have considered is bringing back the previous map (master...ggalmazor:firedAnchors_exploration). Neither @ggalmazor nor I can explain with confidence what cases that approach covered or why it was a good design. We know it didn't cover cases that we have tests for. Because the map both gets updated as triggerables are triggered and also feeds into which triggerables are triggered, it's hard to reason about. Additionally, it ends up bypassing the expandReference call in evaluateTriggerable in certain cases where every repeat instance must be updated which is also very confusing.

This PR proposes a different approach where the triggerables that need to be updated across repeat instances are identified on creation or removal of repeat. This makes it very clear that reference expansion is only relevant to repeats. It also makes it possible to have a single method (getTriggerablesAffectingAllInstances) that does all that work. We can then rely on a check on the set it produces when evaluating individual triggerables.

I considered using the existing iteration over all the cascades triggered by the changed repeat in getAllToTrigger to identify triggerables that affect all repeat instance but I found that I couldn't efficiently look up whether the changedRef is a repeat instance so I would have had to pass that in. I am confident that the DAG traversal is not a major bottleneck so I went with the clearer approach.

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

The only intentional changes are bug fixes. There are three big types of risk: someone was relying on one of those bugs, there's a regression in behavior we don't have test coverage for, performance has degraded.

The big change in behavior compared to v2.17.0 is that adding a repeat instance will trigger recomputation of any calculation inside that repeat which refers to the generic repeat such as count(../../repeat) or position(..). I don't think anyone could have been relying on that bad behavior. I think there is a performance implication for position(..) but I haven't had a chance to investigate yet.

For performance, here is the ChildVaccination benchmark:
this code
Benchmark Mode Cnt Score Error Units
ChildVaccinationBenchmark.run_1_times thrpt 25 1.405 ± 0.107 ops/s
ChildVaccinationBenchmark.run_2_times thrpt 25 0.685 ± 0.036 ops/s
ChildVaccinationBenchmark.run_3_times thrpt 25 0.472 ± 0.027 ops/s

v2.17.0 (last released)
ChildVaccinationBenchmark.run_1_times  thrpt   25  1.121 ± 0.015  ops/s
ChildVaccinationBenchmark.run_2_times  thrpt   25  0.564 ± 0.012  ops/s
ChildVaccinationBenchmark.run_3_times  thrpt   25  0.372 ± 0.006  ops/s

master
Benchmark Mode Cnt Score Error Units
ChildVaccinationBenchmark.run_1_times thrpt 25 1.432 ± 0.179 ops/s
ChildVaccinationBenchmark.run_2_times thrpt 25 0.709 ± 0.101 ops/s
ChildVaccinationBenchmark.run_3_times thrpt 25 0.540 ± 0.104 ops/s

Guillermo firedAnchors revert
Benchmark Mode Cnt Score Error Units
ChildVaccinationBenchmark.run_1_times thrpt 25 1.217 ± 0.060 ops/s
ChildVaccinationBenchmark.run_2_times thrpt 25 0.625 ± 0.015 ops/s
ChildVaccinationBenchmark.run_3_times thrpt 25 0.413 ± 0.006 ops/s

This unambiguously is faster than v2.17.0 for the cases in ChildVaccination. It compares favorably to bringing back firedAnchors.

One case I know will be slower is when there's a position(..) expression and a new repeat instance is added. Previously this would only be evaluated for the newly-created repeat. Now it will be evaluated for every repeat instance. This has a non-negligible impact when we get to hundreds of repeats and it would be more significant if there's a long cascade based on position. The tradeoff is that we don't need a special approach for deletes (where position does change for nodes after the deleted one) and that count/sum from inside the repeat updates correctly (modulo issues contextualizing absolute refs illustrated by tests). I think that tradeoff is worth it because I haven't seen many forms with hundreds of repeat instances or many forms that use position. Naturally, there are some that do both and that's one thing to get user feedback on. If it is a big problem, we can remove the first case in getTriggerablesAffectingAllInstances and match the v2.17.0 behavior.

I think that the worst case would be that firedAnchors covers some case we haven't identified. Even then I'd say the possible user impact isn't huge because there are more forms without repeats than with repeats. We now have fairly significant test coverage so it seems unlikely that we've missed a common case.

Do we need any specific form for testing your changes? If so, please attach one.

No. Tests capture the interesting cases that I can think of. Ideally a reviewer would do another pass thinking of cases I might have missed.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

Yes, I will follow up with updated DAG documentation.

This was previously used to compute repeat relevance. However, TreeElement fully captures relevance rules. See getodk#581

Was always false so we can remove the branch

For performance reasons, this needs to only be done if we know the changed reference is a repeat. It's not possible to quickly identify whether a TreeReference represents a repeat/repeat instance so we need some information from createRepeatInstance. We could pass a boolean representing whether the changed node is a repeat but this seems clearer to me.

This illustrates the first case in getTriggerablesAffectingAllInstances.

We've captured the count contextualization issue elsewhere and don't have many predicate examples.

addingNestedRepeatInstance_updatesExpressionTriggeredByGenericRef_forAllRepeatInstances has a similar count expression

Repeat instance deletion was treated as a special case so didn't apply here. The back reference case is a regression.

JR makes a distinction between predicates and multiplicity but really multiplicity should just be a position predicate. It doesn't make sense to have both a multiplicity and a predicate at a particular reference step. If that would happen, favor the predicate.

ggalmazor · 2020-06-21T13:58:06Z

Impressive work both on the implementation side and the effort for having every corner case characterized in tests! 👍 👍 👍 👍 👍

We will have a chat in a couple of minutes, but I want to write down some thoughts/comments:

Does initializeTriggerables need to consider triggerables that affect all instances? addingRepeatInstance_withInnerCalculateDependentOnOuterSum_updatesInnerSumForAllInstances() gets at something related.

I think it does, but only if we want to cover that corner case. It doesn't look like something you would find very often, so it could be OK to document it and ignore it for now.

One case I know will be slower is when there's a position(..) expression and a new repeat instance is added

I suggest we add a benchmark around that to keep it under control.

I think that the worst case would be that firedAnchors covers some case we haven't identified.

We've already established that the original implementation can't be explained. We even guess that it's an accidental implementation.

I think that replacing the implementation is something that we can't really avoid. Now the question would be: how hard is going to be to add a new scenario we don't want know about?

My opinion is that now we would be much better suited to change the DAG's behavior than in v2.17. This is not only because we've learned about the DAG's internals, but also because the code has a better design and there's a huge amount of tests that will let us work safely and with confidence.

The big change in behavior compared to v2.17.0 is that adding a repeat instance will trigger recomputation of any calculation inside that repeat which refers to the generic repeat such as count(../../repeat) or position(..). I don't think anyone could have been relying on that bad behavior.

Let's document this and provide form designers with alternative implementations that follow best practices.

ggalmazor

LGTM! Awesome work!

lognaturel added 20 commits June 19, 2020 16:08

Add tests

f410dd9

Isolate and document getting all triggerables to trigger

17d8171

Group and rename methods for repeat instances

3f27d29

Reorder methods and add regions

f57b2ae

Remove unused repeat conditions per targets

e5ea009

This was previously used to compute repeat relevance. However, TreeElement fully captures relevance rules. See getodk#581

Remove trust in previously-committed answer

03f868b

Was always false so we can remove the branch

Make naming consistent, remove indirection

b2a4ea3

Explain context node contextualization

bc6da8d

Rename and update test that now passes

fab613e

This illustrates the first case in getTriggerablesAffectingAllInstances.

Re-establish test with position predicate

739e3a8

We've captured the count contextualization issue elsewhere and don't have many predicate examples.

Remove test that now passes

ddc8138

addingNestedRepeatInstance_updatesExpressionTriggeredByGenericRef_forAllRepeatInstances has a similar count expression

Remove incorrect explanation and split instance ref tests

31d96d4

Repeat instance deletion was treated as a special case so didn't apply here. The back reference case is a regression.

Update expected DAG events from deletion

3fce590

Rename tests about adding or removing repeat instances

e26834e

Move tests about adding or removing repeat instances

2abe0d2

Test for deleting repeat instance without refs to repeat

a14b992

Add test for initialization with defaults

d7783a2

Remove unused method

2efde42

Describe use of expandReference call in Triggerable.apply

e95ef85

This comment has been minimized.

Sign in to view

lognaturel force-pushed the issue-584-2 branch from 6b5fa9f to 5057f2c Compare June 20, 2020 17:13

lognaturel added 3 commits June 20, 2020 10:27

Test deletion with repeat sibling calculation

21033fe

Explain why next-number case is different from prev-number

be5d8b1

Add hypothesis about why predicate count is wrong

c89c1a5

lognaturel requested a review from ggalmazor June 21, 2020 04:32

lognaturel marked this pull request as ready for review June 21, 2020 04:32

ggalmazor approved these changes Jun 21, 2020

View reviewed changes

ggalmazor merged commit c4e7320 into getodk:master Jun 21, 2020

lognaturel deleted the issue-584-2 branch June 21, 2020 15:00

lognaturel mentioned this pull request Jun 22, 2020

Update tests, benchmarks and documentation for cases in which triggerables are computed for all repeat instances on repeat instance add/delete #587

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate triggerables for all repeat instances where needed #586

Evaluate triggerables for all repeat instances where needed #586

lognaturel commented Jun 20, 2020 •

edited

Loading

This comment has been minimized.

ggalmazor commented Jun 21, 2020 •

edited

Loading

ggalmazor left a comment

Evaluate triggerables for all repeat instances where needed #586

Evaluate triggerables for all repeat instances where needed #586

Conversation

lognaturel commented Jun 20, 2020 • edited Loading

What has been done to verify that this works as intended?

Why is this the best possible solution? Were any other approaches considered?

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

Do we need any specific form for testing your changes? If so, please attach one.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

This comment has been minimized.

ggalmazor commented Jun 21, 2020 • edited Loading

ggalmazor left a comment

Choose a reason for hiding this comment

lognaturel commented Jun 20, 2020 •

edited

Loading

ggalmazor commented Jun 21, 2020 •

edited

Loading