Fix C++ port behavior divergence with reference implementation #136

vityaman · 2024-07-28T11:13:17Z

Description

During rewriting test scenarios from TypeScript to C++ there was explored a behavior divergence in C++ port.

Also after adding GitHub Workflow for Java also failing test was detected.

Examples of that

[Java, C++], candidates.tokens[ExprLexer::VAR] returns [] instead of [ExprLexer::ID, ExprLexer::EQUAL] : 83d73ad
[C++], underfilled idexpressionStack on TEST(CPP14Parser, RealCppFile)

Related issues and MRs

The text was updated successfully, but these errors were encountered:

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

…rdering

vityaman · 2024-08-04T11:42:51Z

I found a bug.

When running tests using ctest (cd test && ctest) they passed, but running tests from a binary file ./expr/antlr4-c3-test-expr they fail. Then I explored that CTest runs all tests by sequentially running a binary file with a filter for an only test case, while running tests using a binary file leads to all test scenarios execution.

So, I decided that the problem was in some implicit dependency of test scenarios. I switched Expr MostSimpleSetup and TypicalSetup test cases and got test failure.

This is because of caching setsPerState[startState->stateNumber]. Ignoring the cache helped and made tests green.

mike-lischke · 2024-08-04T16:36:57Z

But disabling the cache will certainly make everything slower. The must be a difference in behavior of the cache in both TS and C++.

vityaman · 2024-08-05T09:52:36Z

I did some more research. What I found.

The problem is that cache setsPerState is not idempotent. FollowSetsHolder contains the same FollowSetWithPath::path, FollowSetWithPath::intervals and combined, but different FollowSetWithPath::following.

How is that possible while procedures determineFollowSets -> collectFollowSets -> getFollowingTokens look quite isolated? Quite, but not entirely, because getFollowingTokens uses ignoredTokens in a such way that it will exclude an ignored token from following sequence.

In ExprTest::MostSimpleSetup and ExprTest::TypicalSetup test cases, we have different sets of CodeCompletionCore::ignoredTokens. As {ExprLexer::ID, ExprLexer::EQUAL} are not ignored at the first scenario, they are inside the following sequence. But in the second scenario we ignore them, and therefore we exclude them.

Based on this fact, I can say that current C++ port behavior is correct relatively to getFollowingTokens correctness, as it returns an empty following set, while TypeScript version returns non-empty only because of that setsPerState cache.

@mike-lischke, I propose to drop cache at CodeCompletionCore configuration update. It can be done by adding code to property setter or checking that configuration was not changed from the last collectCandidates (check hashCode and then equals).

If I could design a library differently, I would make everything as immutable as possible to avoid such bugs with caches and also improve performance by building data structures with higher level of specialization for a task. I don't see the advantage of having mutable ignoredTokens for example, as in the typical usage this field is not changed. For example, I prefer to instantiate this class as well as lexer, parser and other things once, and on a new input do reset on everything and rerun collectCandidates with different caretTokenIndex and context. But maybe I should grep over antlr-c3 usages over the GitHub and check how it is used by other people.

Also, I would like to say that this behavior of getFollowingTokensis strange. Because of ignoredTokens we can produce invalid following sequence, for example, {SET: [ID, NUMBER] } instead of {SET: [ID, ASSIGN, NUMBER]}. In suggestions system it is quite unexpected output. I think, it is more logical to return tokens until the first ignored one, for example {SET: [ID]}, or don't check for token ignorance at getFollowingTokens at all. Because I thought that ignoredTokens exists to exclude a token from that tokens: Map<Token, Following> keys.

mike-lischke · 2024-08-06T07:49:17Z

That will take some time for me to analyze and understand. Currently I have no bandwidth to dive deep into the c3 algorithm again.

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke added the possible bug label Jul 28, 2024

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Jul 31, 2024

mike-lischke#136 Fix CppParser::SimpleExample test

0f5a5f0

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Jul 31, 2024

mike-lischke#136 Fix CppParser::SimpleExample test

f32f9a2

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 1, 2024

mike-lischke#136 Fix CppParser::SimpleExample test

80b9e3f

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 1, 2024

#136 Fix CppParser::SimpleExample test

1bf9474

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 4, 2024

mike-lischke#136 Break TypeScript Expr tests

98e345d

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 4, 2024

mike-lischke#136 Break TypeScript Expr tests

1f640a2

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 4, 2024

mike-lischke#136 Switch Expr MostSimpleSetup and TypicalSetup tests o…

30e9d08

…rdering

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 7, 2024

mike-lischke#136 Run tests in random order

d0b5ef9

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 7, 2024

mike-lischke#136 Run tests in random order

b503f7c

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 7, 2024

mike-lischke#136 Run tests in random order

2cac0e6

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 7, 2024

mike-lischke#136 Do not run shuffled tests

acf2a8b

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 8, 2024

#136 Run tests in random order

b3e7615

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 8, 2024

#136 Do not run shuffled tests

5f7b5c5

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 12, 2024

mike-lischke#136 Remove FollowSet cache dependency on ignoredTokens

bf5bc8a

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 12, 2024

mike-lischke#136 Fix test assertion for unordered follow tokens

09ee945

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 12, 2024

mike-lischke#136 Remove FollowSet cache dependency on ignoredTokens

5eaad7b

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 12, 2024

mike-lischke#136 Fix test assertion for unordered follow tokens

60d72ef

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman mentioned this issue Aug 12, 2024

#136 Remove FollowSet cache dependency on ignoredTokens #149

Merged

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 26, 2024

mike-lischke#136 Remove ignored following tokens

76ed648

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 26, 2024

mike-lischke#136 Extract overall results output to procedure

2f26aaa

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 26, 2024

mike-lischke#136 Remove FollowSet cache dependency on ignoredTokens

4d3c263

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 26, 2024

mike-lischke#136 Fix test assertion for unordered follow tokens

81a1329

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 26, 2024

mike-lischke#136 Remove ignored following tokens

7bbae95

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

vityaman added a commit to vityaman/antlr4-c3 that referenced this issue Aug 26, 2024

mike-lischke#136 Extract overall results output to procedure

c1ad650

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 26, 2024

#136 Remove FollowSet cache dependency on ignoredTokens

5c176b1

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 26, 2024

#136 Fix test assertion for unordered follow tokens

c24224c

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 26, 2024

#136 Remove ignored following tokens

a9a7b2e

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

mike-lischke pushed a commit that referenced this issue Aug 26, 2024

#136 Extract overall results output to procedure

e4e3480

Signed-off-by: vityaman <vityaman.dev@yandex.ru>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix C++ port behavior divergence with reference implementation #136

Fix C++ port behavior divergence with reference implementation #136

vityaman commented Jul 28, 2024

vityaman commented Aug 4, 2024 •

edited

Loading

mike-lischke commented Aug 4, 2024

vityaman commented Aug 5, 2024 •

edited

Loading

mike-lischke commented Aug 6, 2024

Fix C++ port behavior divergence with reference implementation #136

Fix C++ port behavior divergence with reference implementation #136

Comments

vityaman commented Jul 28, 2024

Description

Related issues and MRs

vityaman commented Aug 4, 2024 • edited Loading

mike-lischke commented Aug 4, 2024

vityaman commented Aug 5, 2024 • edited Loading

mike-lischke commented Aug 6, 2024

vityaman commented Aug 4, 2024 •

edited

Loading

vityaman commented Aug 5, 2024 •

edited

Loading