-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TinkerPop tests are failing in master branch (closed after 6 hours) #3747
Comments
The last commit for which TP tests supped to passed on February 22 was made on February 17: d30e789 (started TP tests to verify TP tests still pass for that commit here (PASSED 5 times). Branch is here) Here is a list of commits which we had from February 22 to March 1 (the issue is most likely in one of those commits). 94b3da5 (started TP tests here (PASSED 5 times). Branch is here) 20e14a7 69e01db 9684494 6725fd3 3429215 (started TP tests here (PASSED 5 times) . Branch is here) 245e79f fd3b92c a44647a ba672b9 49c5f80 (snapshot return commit. Skipping this commit) cdb6a93 (doc only change. Skipping this commit) 19a8637 (started TP tests here (1st run PASSED, 2nd run FAILED). Branch is here) 76f9a6b (started TP tests here (FAILED 2 times). Branch is here) 671672b (started TP tests here (FAILED 2 times). Branch is here) fcc8b33 (started TP tests here (FAILED 1st run, PASSED 2nd run). Branch is here) 4475507 (started TP tests here (FAILED 2 times). Branch is here) 2c70076 (doc only change. Skipping this commit) |
Update: tests for the previous commit 19a8637 Passed the first time, but failed the second time. |
Fixes JanusGraph#3747 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
Fixes JanusGraph#3747 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
I tried to revert the failing commit here #3749 but looks like it didn't help. It could be that above investigation of a wrong commit could be influenced by instability of GitHub Actions. That said, it could be that we just have now multiple commits in Thus, I think the problem is one of the below problems:
It's a little bit tricky to find our the root cause of this issue and why TinkerPop tests are not passing for @JanusGraph/committers in case anyone can investigate this issue as well or have some thoughts about how to fix TP tests for GitHub Actions that would be great. |
…sts] [tp-tests] Investigation of JanusGraph#3747 Related 3749# Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…sts] [tp-tests] Investigation of JanusGraph#3747 Related 3749# Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
More tests: Finished (Failed):
Finished (Passed):
Wasn't executed?
|
…sts] [tp-tests] Investigation of JanusGraph#3747 Related 3749# Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…sts] [tp-tests] Investigation of JanusGraph#3747 Related 3749# Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…sts] [tp-tests] Investigation of JanusGraph#3747 Related 3749# Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
Investigation of JanusGraph#3747 Related JanusGraph#3749 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
Executing more tests shows that the last 245e79f (started TP tests here . Branch is here) fd3b92c (started TP tests here . Branch is here) (PASSED 1 time) a44647a (started TP tests here . Branch is here) (FAILED 1 time) Started tests executions for these commits to see which one may fail. |
Update: With the ongoing investigation the scope of the trouble commits is reduced to this commit only: a44647a I noticed that both The strange thing is that those TinkerPop CQL tests are either finishing in the next window [from 4 hours to 4:50 hours] or don't finish at all. From many times over these days when I was running tests I have never seen a time when those tests finish in 5:10, 5:30, 5:55 (i.e. they always finish under 5 hours). This brings me to thoughts that potentially it could be unrelated to CQL requests throughput but to something else (just a theory). In case anyone knows why would some TP tests hang for over 6 hours for CQL storage backend when CQL executor pool is disabled it would be really great if you could share your thoughts here because currently I'm not getting it. The merged PR of that commit is #3607 cc @li-boxuan @cdegroc because we had some related discussions here #3170 |
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
Started TP tests for TP tests for #3760 but with executor service used for mutation queries (don't understand why would we use it for those queries because mutation queries don't have any deserialization work to be done as slice queries. Nevertheless, we do have 2 implementations for some reason (with and without executor service used). TP tests in #3760 before ensuring |
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
As the above research showed. The long running TinkerPop tests were not related to the |
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). Fixes JanusGraph#2406 Fixes JanusGraph#3747 Fixes JanusGraph#3759 Related to JanusGraph#3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
…tests] [tp-tests] (#3760) * Implement multi-key slice async queries for CQL storage backend [cql-tests] [tp-tests] - Adds multiQuery support to CQL storage backend. - Ensure `storage.parallel-backend-ops` thread pool is created only for storage backends which don't support multiQuery (multi-key slice operations). - Change purpose of `storage.cql.executor-service` to be used for results deserialization jobs only (not for IO operations). - Add possibility to specify back pressure class. Fixes #2406 Fixes #3747 Fixes #3759 Related to #3170 Signed-off-by: Oleksandr Porunov <alexandr.porunov@gmail.com>
On February 22 we had the last time TinkerPop tests CI executed without any problem (scheduled weekly job).
Starting from March 1 the scheduled job failed and it's failing from that time each week. There were no any successful runs after that.
It either means: We made a bug somewhere from February 22 to March 1 or something changed in the CI execution (less powerful machines or something else.
Previously TP tests job were taking 4:45 hours in average. From March 1 it takes over 6 hours, but GitHub actions kill TP tests after 6 hours.
Scheduled execution jobs can be found here: https://github.com/JanusGraph/janusgraph/actions?query=branch%3Amaster+event%3Aschedule
The problem was noticed in the next PRs: #3418 , #3743
Link to the branch with 2 canceled after 6 hours recent jobs: https://github.com/porunov/janusgraph/actions/runs/4809004095/jobs/8559714661
For Cassandra 3:
For Cassandra 4:
Stack Trace (if you have one)
The text was updated successfully, but these errors were encountered: