[ML] AIOps: Functional/API integration tests for text field support for log rate analysis #168177

walterra · 2023-10-06T06:50:43Z

Summary

This updates the artificial dataset generator for log rate analysis to allow to create variants including text fields.
The artificial dataset is now used for 4 variants of functional and API integration tests: Testing spike and dip with both with and without a text field.

The new tests surfaced some issues that were fixed as part of this PR:

Getting the counts of log patterns in combination with individual significant terms ended up with to granular groups. This PR adds additional queries to get counts for log patterns in combination with item sets already derived from significant terms.
The support value is returned by the frequent item sets agg and is used as a threshold whether to include an item set for grouping. This was missing from significant log patterns and is fixed by this PR.
Adds a check to not get frequent item sets for log patterns if there are no significant terms.
The way we fetched log patterns using a time filter that spans the whole of the baseline start to the deviation end caused problems with analysing dips. This PR updates those queries to only fetch the actual baseline and deviation time range.
The integration tests caught an issue where we'd still fetch the histogram for log patterns even if we'd request grouping information only.

After:

Checklist

Unit or functional tests were updated or added to match the most common scenarios
This was checked for breaking API changes and was labeled appropriately

walterra · 2023-10-06T19:30:21Z

Flaky test runner: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3397

🔴 1/50 runs failed

…ields

…xt fields

…ant terms

walterra · 2023-10-09T07:45:09Z

Flaky test runner: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3411

✅ 100/100 runs passed

elasticmachine · 2023-10-09T08:56:37Z

Pinging @elastic/ml-ui (:ml)

jgowdyelastic · 2023-10-10T14:38:32Z

x-pack/plugins/aiops/server/routes/queries/fetch_terms_2_categories_counts.ts

+            fieldValue,
+          })),
+          category.fieldName,
+          { key: `${category.key}`, count: category.doc_count, examples: [] },


There is a bug I've recently discovered where the query created from the category key will not match any documents.
I will create an issue for it, but I don't immediately know how to fix it.
Basically, from what have seen, if the document has a string that looks like this foo:bar the category key might look like this foo bar. These will not match in the query.
There is a risk that changing the query will cause it to become more greedy and match documents which aren't in the category.
Using the regex from the category would probably work, but could be very expensive, and may not work at all if the cluster disallows expensive queries.

This comment is just a heads up that you may get a 0 count here, depending on the data.

x-pack/plugins/aiops/server/routes/queries/get_simple_hierarchical_tree.ts

…-tests

jgowdyelastic

LGTM

qn895 · 2023-10-10T16:16:45Z

Code changes LGTM 🎉 Minor nit comment to rename df variable in functions to a clearer name in the future.

kibana-ci · 2023-10-10T17:15:19Z

💛 Build succeeded, but was flaky

Buildkite Build
Commit: 432120f

Failed CI Steps

Investigations - Security Solution Cypress Tests #2

Test Failures

[job] [logs] Investigations - Security Solution Cypress Tests #2 / Timeline search and filters Update kqlMode for timeline should be able to update timeline kqlMode with filter should be able to update timeline kqlMode with filter

Metrics [docs]

✅ unchanged

History

💚 Build #166212 succeeded e3ca450
💚 Build #166002 succeeded aa0ec35acd61957bd786c9df8024fd8b912a33e4
💔 Build #165832 failed 29646a6deb20eac52cd3fa56c9542a5681cdc296
💔 Build #165823 failed b401f9d40544d96aed0f58f411bc1b59af06ba1e
💔 Build #165804 failed b017dce03541928c1ef6bc7889eed5415bee25de

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

cc @walterra

…or log rate analysis (elastic#168177) This updates the artificial dataset generator for log rate analysis to allow to create variants including text fields. The artificial dataset is now used for 4 variants of functional and API integration tests: Testing spike and dip with both with and without a text field. The new tests surfaced some issues that were fixed as part of this PR: - Getting the counts of log patterns in combination with individual significant terms ended up with to granular groups. This PR adds additional queries to get counts for log patterns in combination with item sets already derived from significant terms. - The `support` value is returned by the frequent item sets agg and is used as a threshold whether to include an item set for grouping. This was missing from significant log patterns and is fixed by this PR. - Adds a check to not get frequent item sets for log patterns if there are no significant terms. - The way we fetched log patterns using a time filter that spans the whole of the baseline start to the deviation end caused problems with analysing dips. This PR updates those queries to only fetch the actual baseline and deviation time range. - The integration tests caught an issue where we'd still fetch the histogram for log patterns even if we'd request grouping information only. (cherry picked from commit 9259f48)

kibanamachine · 2023-10-10T17:30:41Z

💚 All backports created successfully

Status	Branch	Result
✅	8.11

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…pport for log rate analysis (#168177) (#168516) # Backport This will backport the following commits from `main` to `8.11`: - [[ML] AIOps: Functional/API integration tests for text field support for log rate analysis (#168177)](#168177)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Walter Rafelsberger <walter.rafelsberger@elastic.co>

…or log rate analysis (elastic#168177) This updates the artificial dataset generator for log rate analysis to allow to create variants including text fields. The artificial dataset is now used for 4 variants of functional and API integration tests: Testing spike and dip with both with and without a text field. The new tests surfaced some issues that were fixed as part of this PR: - Getting the counts of log patterns in combination with individual significant terms ended up with to granular groups. This PR adds additional queries to get counts for log patterns in combination with item sets already derived from significant terms. - The `support` value is returned by the frequent item sets agg and is used as a threshold whether to include an item set for grouping. This was missing from significant log patterns and is fixed by this PR. - Adds a check to not get frequent item sets for log patterns if there are no significant terms. - The way we fetched log patterns using a time filter that spans the whole of the baseline start to the deviation end caused problems with analysing dips. This PR updates those queries to only fetch the actual baseline and deviation time range. - The integration tests caught an issue where we'd still fetch the histogram for log patterns even if we'd request grouping information only.

walterra self-assigned this Oct 6, 2023

walterra added the release_note:skip Skip the PR/issue when compiling release notes label Oct 6, 2023

walterra force-pushed the 167467-ml-aiops-log-rate-analysis-functional-tests branch from b017dce to b401f9d Compare October 6, 2023 08:50

peteharverson mentioned this pull request Oct 6, 2023

[ML] Increase Test Coverage 8.11.0 #164562

Closed

10 tasks

walterra force-pushed the 167467-ml-aiops-log-rate-analysis-functional-tests branch from b401f9d to 29646a6 Compare October 6, 2023 09:16

walterra added bug Fixes for quality problems that affect the customer experience :ml Feature:ML/AIOps ML AIOps features: Change Point Detection, Log Pattern Analysis, Log Rate Analysis v8.11.0 v8.12.0 labels Oct 6, 2023

walterra force-pushed the 167467-ml-aiops-log-rate-analysis-functional-tests branch from 29646a6 to aa0ec35 Compare October 6, 2023 15:25

walterra changed the title ~~[ML] AIOps: Functional tests for text field support for log rate analysis~~ [ML] AIOps: Functional/API integration tests for text field support for log rate analysis Oct 6, 2023

walterra added 14 commits October 9, 2023 08:42

extends artificial dataset generation to include variants with text f…

4bdf18f

…ields

fix time range to get categories for dips

33111b2

run additional queries to get counts of sig terms based groups and te…

74069a1

…xt fields

improve types and naming.

4b89e94

fix calculating support value for item sets with log patterns

7fc555d

move minimum_support to const

582e140

fix check to not run grouping for log patterns if there's no signific…

e172076

…ant terms

add comment on how to calculat support for log patterns

c4f870a

fix API integration tests

df7ecd3

adds API integration test for dip

e305dca

adds API integration test for spike with text field

5254d32

adds API integration test for dip with text field

992f9a7

improve assertion error message for chunkCounter

daa1b78

adds a check to only delete index if it exists

e3ca450

walterra force-pushed the 167467-ml-aiops-log-rate-analysis-functional-tests branch from aa0ec35 to e3ca450 Compare October 9, 2023 07:42

walterra marked this pull request as ready for review October 9, 2023 08:56

walterra requested a review from a team as a code owner October 9, 2023 08:56

walterra requested review from alvarezmelissa87 and jgowdyelastic October 9, 2023 08:57

jgowdyelastic reviewed Oct 10, 2023

View reviewed changes

qn895 reviewed Oct 10, 2023

View reviewed changes

x-pack/plugins/aiops/server/routes/queries/get_simple_hierarchical_tree.ts Outdated Show resolved Hide resolved

walterra added 2 commits October 10, 2023 17:30

Merge branch 'main' into 167467-ml-aiops-log-rate-analysis-functional…

b44f38a

…-tests

rename df to itemSets

432120f

jgowdyelastic approved these changes Oct 10, 2023

View reviewed changes

qn895 approved these changes Oct 10, 2023

View reviewed changes

walterra merged commit 9259f48 into elastic:main Oct 10, 2023

walterra deleted the 167467-ml-aiops-log-rate-analysis-functional-tests branch October 10, 2023 17:24

kibanamachine mentioned this pull request Oct 10, 2023

[8.11] [ML] AIOps: Functional/API integration tests for text field support for log rate analysis (#168177) #168516

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] AIOps: Functional/API integration tests for text field support for log rate analysis #168177

[ML] AIOps: Functional/API integration tests for text field support for log rate analysis #168177

walterra commented Oct 6, 2023 •

edited by kibanamachine

Loading

walterra commented Oct 6, 2023 •

edited

Loading

walterra commented Oct 9, 2023 •

edited

Loading

elasticmachine commented Oct 9, 2023

jgowdyelastic Oct 10, 2023 •

edited

Loading

jgowdyelastic left a comment

qn895 commented Oct 10, 2023

kibana-ci commented Oct 10, 2023

kibanamachine commented Oct 10, 2023

[ML] AIOps: Functional/API integration tests for text field support for log rate analysis #168177

[ML] AIOps: Functional/API integration tests for text field support for log rate analysis #168177

Conversation

walterra commented Oct 6, 2023 • edited by kibanamachine Loading

Summary

Checklist

walterra commented Oct 6, 2023 • edited Loading

walterra commented Oct 9, 2023 • edited Loading

elasticmachine commented Oct 9, 2023

jgowdyelastic Oct 10, 2023 • edited Loading

Choose a reason for hiding this comment

jgowdyelastic left a comment

Choose a reason for hiding this comment

qn895 commented Oct 10, 2023

kibana-ci commented Oct 10, 2023

💛 Build succeeded, but was flaky

Failed CI Steps

Test Failures

Metrics [docs]

History

kibanamachine commented Oct 10, 2023

💚 All backports created successfully

Questions ?

walterra commented Oct 6, 2023 •

edited by kibanamachine

Loading

walterra commented Oct 6, 2023 •

edited

Loading

walterra commented Oct 9, 2023 •

edited

Loading

jgowdyelastic Oct 10, 2023 •

edited

Loading