[DRAFT] Attempting Cypress test fixes #186562

rylnd · 2024-06-20T19:56:31Z

🚧 This is based on #181926, and is me attempting to fix cypress tests without mucking up the activity log there.

This is mostly based on the current test plan. It's not wired up yet, nor are there any actual implementations.

These now have type errors, since ML rules don't yet accept suppression fields. We have our next task!

`node scripts/openapi/generate`

We're now asserting that suppression fields are present on the generated alerts, which they're not, because we haven't implemented them yet. That's the next step!

* Adds call getIsSuppressionActive in our rule executor, and necessary dependencies * Adds suppression fields to ML rule schema * Adds feature flag for ML suppression

I noticed that it doesn't look like we're including a lot of timing info in the ML executor; adding this to validate that, and document what we _are_ recording.

This will light up the paths that we need to implement. Next!

This adds all the parameters necessary to invoke this method (if relevant) in the ML rule executor. Given the relative simplicity of the ML rule type, I'm guessing that many of these values are irrelevant/unused in this case, but I haven't yet investigated that. Next step is to exercise this implementation against the FTR tests, and see if the behavior is what we expect. Once that's done, we can try to pare down what we need/use. I also added some TODOs in the course of this work to check some potential bugs I noticed.

Tests were failing as rules were being created without suppression params. Fixed!

We've got suppression fields making it into ML alerts for the first time! Now, to test the various suppression conditions.

I realized that most of these tests were using es_archiver to insert anomalies into an index, but our tests were only ever using a single one of those anomalies. In order to ensure these tests are independent of the data in that archive, I've created and leveraged a helper to delete all the persisted anomalies, and then use existing tooling to manually insert the anomalies needed for our tests. All of the current tests are green; there are just a few more permutations that still need to be implemented.

This tests all of the interesting permutations of alert suppression for ML rules, both with per-execution and interval suppression durations. I added a few TODOs noting unexpected (to me) behavior; we'll see what others think.

The behavior demonstrated in this test is in fact expected, as the suppression duration window applies to the alert creation time, not the original anomaly time.

Most other rule types have both a "fill" task and a "fillAndContinue" task; this adds that pattern for ML rules on the Define step.

These are failing because I haven't yet enabled the suppression UI for ML rules. Once that's done, we can start validating these tests.

I don't know if we need to touch this file, but I'm making a note to come back to it.

This mainly involved modifying `useAlertSuppression`, as well as some logic in the rule form's data parsing. At this point, I believe the frontend should be working.

This just adds some mock values for our new params related to suppression. All of the test coverage is in our integration tests.

We missed an import of a referenced type.

I think the failure here was because the value `agent.name` has multiple matching fields, which caused this task to fail. By using the down arrow to select the first matching field for a given value, this task is now much more robust.

These tests actually uncovered a deficiency in the ML rule creation flows, where autocomplete is not correct. This means that it's currently impossible to add/edit alert suppression for an existing ML rule (via the UI). Details in elastic#183100.

There were no less than four assertions in this test that relied on there being no other rules present in the environment, but nothing was being done to ensure that was the case. I can't imagine why these were skipped!

I want to run these in the flaky runner to get a sense of how/where they're still failing, for now.

We were over-eagerly disabling these fields when the ML checks were not relevant.

ML Rule Suppression UI Improvements

Conflicts: x-pack/test/security_solution_cypress/config.ts

…-fix'

The undefined value was coming from our abstracted hook, and the hook now correctly handles the undefined value. This union is no longer needed.

The former just fails with "element not found" while the current version will actually show the text that it did find.

These changes are taken from my investigation in elastic#182183. Note that this only changes this configuration for one of the ML jobs, but there are two contained in this archive. Since the tests are currently only concerned with the first job, so am I. If in the course of investigation I find that we're not using the other job's anomalies, I'll be deleting them since it's just test overhead (and potentially a source of more bugs).

There was an additional parameter added to the rule parameters. Rather than keep this test in sync with all of the possible parameters, we just assert on the ones we care about.

I believe this is the cause of some sporadic failures when running these tests together, as they all now have the implicit "start job when rule is created" functionality. By stopping the datafeeds before each suite, none of the relevant jobs should be in the full "started" state required by our validation, so we effectively have a "clean" state. In figuring the above out I was also made aware of the "force stop datafeed and close job" endpoint, which would probably be a more robust solution, but for now this works.

The rule params are typed as a union of all possible allowed values of `machine_learning_job_id`, while our helpers just expect `string[]`. Since the value in question is verifiably a `string[]`, I'm telling TS as much.

I saw this explicitly not work locally, but maybe CI is playing by different rules.

rylnd · 2024-06-20T19:56:42Z

/ci

The issue that the previous method (treating the first item different from the rest) seems to be a consequence of there being "state" in the dropdown list after the first item is selected. By hitting ESC after each item is typed/selected, we get rid of this state, and so we can treat each item separately again.

rylnd · 2024-06-20T22:07:52Z

/ci

rylnd · 2024-06-21T03:00:18Z

/ci

rylnd · 2024-06-21T18:41:28Z

/ci

I'm not sure this will have the intended effect (of making the job state more deterministic), but it's worth a shot.

This endpoint will return a 404 if the job(s) being requested are not found. This should not fail the test.

rylnd · 2024-06-21T21:15:40Z

/ci

It was previously possible to get into a state where the fields were not loading, but were also an empty array. Since the test simply waits for the field to be enabled, this meant there was a chance it would attempt to fill unavailable options in the combobox. This removes that possibility by ensuring that fields are present before enabling the field.

We have the analogous setting in the ess config, and the test-specific flags that get respected on CI, but I had not run the serverless cypress locally before, and neglected to add this config for that case.

kibana-ci · 2024-06-21T22:00:45Z

💔 Build Failed

Failed CI Steps

Test Failures

[job] [logs] Serverless Detection Engine - Security Solution Cypress Tests #2 / Machine Learning Detection Rules - Creation with Alert Suppression when ML jobs have run when all jobs are running allows a rule with interval suppression to be created and displayed allows a rule with interval suppression to be created and displayed
[job] [logs] Serverless Detection Engine - Security Solution Cypress Tests #2 / Machine Learning Detection Rules - Creation with Alert Suppression when ML jobs have run when all jobs are running allows a rule with per-execution suppression to be created and displayed allows a rule with per-execution suppression to be created and displayed
[job] [logs] Serverless Detection Engine - Security Solution Cypress Tests #4 / Machine Learning Detection Rules - Editing with Alert Suppression allows editing of a rule to remove suppression configuration allows editing of a rule to remove suppression configuration
[job] [logs] Serverless Detection Engine - Security Solution Cypress Tests #4 / Machine Learning Detection Rules - Editing without Alert Suppression allows editing of a rule to add suppression configuration allows editing of a rule to add suppression configuration

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`securitySolution`	5507	5509	+2

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`securitySolution`	13.6MB	13.6MB	+3.4KB

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`securitySolution`	84.2KB	84.3KB	+68.0B

Unknown metric groups

References to deprecated APIs

id	before	after	diff
`securitySolution`	576	577	+1

History

💔 Build #217228 failed 3414ff1cde2f6d2a4db08763ae1f50069da95a53
💔 Build #217055 failed af4e56b6eaf170df286266263d71acd77b762c00
💔 Build #217027 failed a2bb85f
💔 Build #217004 failed c9ece66

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

rylnd · 2024-07-09T22:14:09Z

Closed in favor of the related FTR investigation here

rylnd added 30 commits April 25, 2024 21:52

Add outline of integration test scenarios

ad461bb

This is mostly based on the current test plan. It's not wired up yet, nor are there any actual implementations.

Fleshing out more of our suppression execution tests

8c0f6c1

These now have type errors, since ML rules don't yet accept suppression fields. We have our next task!

Declare alert suppression fields as optional for ML rules

01bcf8e

Generated new types from new schema

c42b339

`node scripts/openapi/generate`

First legitimately failing test

b78c531

We're now asserting that suppression fields are present on the generated alerts, which they're not, because we haven't implemented them yet. That's the next step!

Extract executor params to interface

cad4183

Merge branch 'main' into ml_rule_alert_suppression

5377c6d

Adding more ML suppression functionality as typescript and tests dictate

6dbb88f

* Adds call getIsSuppressionActive in our rule executor, and necessary dependencies * Adds suppression fields to ML rule schema * Adds feature flag for ML suppression

Declare ML rule to be suppressible

e29c3d7

Add ML rule to general suppression schema tests

12ad5f5

Add placeholder for ML executor functionality

c8b7c6a

I noticed that it doesn't look like we're including a lot of timing info in the ML executor; adding this to validate that, and document what we _are_ recording.

Declare our new executor parameters needed for rule suppression

7f317cf

This will light up the paths that we need to implement. Next!

Enable feature flag in FTR tests

703084f

Handle ML suppression params in rule converters

b9de69e

Tests were failing as rules were being created without suppression params. Fixed!

First passing integration test

7e63b4d

We've got suppression fields making it into ML alerts for the first time! Now, to test the various suppression conditions.

Flesh out remaining API integration tests

99aaffe

This tests all of the interesting permutations of alert suppression for ML rules, both with per-execution and interval suppression durations. I added a few TODOs noting unexpected (to me) behavior; we'll see what others think.

Update test description in response to feedback

ee86fb1

The behavior demonstrated in this test is in fact expected, as the suppression duration window applies to the alert creation time, not the original anomaly time.

Add non-destructive form filling task for ML rules

b5e809d

Most other rule types have both a "fill" task and a "fillAndContinue" task; this adds that pattern for ML rules on the Define step.

Remove unused helper

10a9a42

Add cypress tests around creating/editing ML rules with suppression

f010c1c

These are failing because I haven't yet enabled the suppression UI for ML rules. Once that's done, we can start validating these tests.

Add TODO for later

5358bc4

I don't know if we need to touch this file, but I'm making a note to come back to it.

Add missing frontend logic to enable ML alert suppression

4804eef

This mainly involved modifying `useAlertSuppression`, as well as some logic in the rule form's data parsing. At this point, I believe the frontend should be working.

Fix ML executor unit tests

03833d1

This just adds some mock values for our new params related to suppression. All of the test coverage is in our integration tests.

Fix type error

184375d

We missed an import of a referenced type.

Add alert suppression fields to alerting integration snapshot

982649f

Merge branch 'main' into ml_rule_alert_suppression

c738d51

rylnd and others added 15 commits June 17, 2024 23:20

Ensure test has clean setup

742503b

There were no less than four assertions in this test that relied on there being no other rules present in the environment, but nothing was being done to ensure that was the case. I can't imagine why these were skipped!

Remove exclusivity from FTR tests, add TODO for re-skipping

f5cbaa5

I want to run these in the flaky runner to get a sense of how/where they're still failing, for now.

Fix suppression fields for non-ML cases

8e9d4c5

We were over-eagerly disabling these fields when the ML checks were not relevant.

Merge pull request #9 from rylnd/ml_rule_suppression_warnings

e6aae21

ML Rule Suppression UI Improvements

Merge branch 'main' into ml_rule_alert_suppression

0cf3a51

Conflicts: x-pack/test/security_solution_cypress/config.ts

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

792373e

…-fix'

Revert change to form field types

a3117a1

The undefined value was coming from our abstracted hook, and the hook now correctly handles the undefined value. This union is no longer needed.

Update cypress tests to reflect latest copy

9e77939

Better assertion

a42b966

The former just fails with "element not found" while the current version will actually show the text that it did find.

Fix test assertion by loosening constraints

d71d5b0

There was an additional parameter added to the rule parameters. Rather than keep this test in sync with all of the possible parameters, we just assert on the ones we care about.

Fix type error in test helper

5f1e4fa

The rule params are typed as a union of all possible allowed values of `machine_learning_job_id`, while our helpers just expect `string[]`. Since the value in question is verifiably a `string[]`, I'm telling TS as much.

Debugging output

fbea139

Try filling the combobox with downarrow all the time

c9ece66

I saw this explicitly not work locally, but maybe CI is playing by different rules.

Stop both datafeeds and jobs between ML tests

4adac8b

I'm not sure this will have the intended effect (of making the job state more deterministic), but it's worth a shot.

rylnd force-pushed the test_ml_cypress_fixes branch from 3414ff1 to 4adac8b Compare June 21, 2024 20:48

Don't fail if no jobs found during setup

004c2dc

This endpoint will return a 404 if the job(s) being requested are not found. This should not fail the test.

rylnd added 2 commits June 21, 2024 16:50

Add ML feature flag for serverless cypress

86f4d6f

We have the analogous setting in the ess config, and the test-specific flags that get respected on CI, but I had not run the serverless cypress locally before, and neglected to add this config for that case.

rylnd closed this Jul 9, 2024

rylnd deleted the test_ml_cypress_fixes branch July 9, 2024 22:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Attempting Cypress test fixes #186562

[DRAFT] Attempting Cypress test fixes #186562

rylnd commented Jun 20, 2024

rylnd commented Jun 20, 2024

rylnd commented Jun 20, 2024

rylnd commented Jun 21, 2024

rylnd commented Jun 21, 2024

rylnd commented Jun 21, 2024

kibana-ci commented Jun 21, 2024 •

edited

Loading

References to deprecated APIs

rylnd commented Jul 9, 2024

[DRAFT] Attempting Cypress test fixes #186562

[DRAFT] Attempting Cypress test fixes #186562

Conversation

rylnd commented Jun 20, 2024

rylnd commented Jun 20, 2024

rylnd commented Jun 20, 2024

rylnd commented Jun 21, 2024

rylnd commented Jun 21, 2024

rylnd commented Jun 21, 2024

kibana-ci commented Jun 21, 2024 • edited Loading

💔 Build Failed

Failed CI Steps

Test Failures

Metrics [docs]

Module Count

Async chunks

Page load bundle

References to deprecated APIs

History

rylnd commented Jul 9, 2024

kibana-ci commented Jun 21, 2024 •

edited

Loading