[FEATURE][ML] Stregthen source dest validations for DF analytics #43399

dimitris-athanasiou · 2019-06-19T20:11:10Z

This adds the following validations to the put and start data frame analytics APIs:

dest index name is valid
source index exists
dest index is not included in source index
dest index is matching a single index at most

This adds the following validations: - dest index name is valid - source index exists - dest index is not included in source index - dest index is matching a single index at most

elasticmachine · 2019-06-19T20:11:12Z

Pinging @elastic/ml-core

benwtrent

✅

benwtrent · 2019-06-20T12:52:51Z

x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/dataframe/SourceDestValidator.java

+        this.indexNameExpressionResolver = Objects.requireNonNull(indexNameExpressionResolver);
+    }
+
+    public void check(DataFrameAnalyticsConfig config) {


I suppose it is just a Java thing, but I have trouble with creating an object with state, but an internal method is only called once. Seems like a static check(ClusterState clusterState, IndexNameExpressionResolver indexNameExpressionResolver, DataFrameAnalyticsConfig config) is fewer lines of code and less state to break.

Of course, I am hypocritical with this :). I often make these classes myself.

I know what you mean. I insist going for objects though because I think they lend themselves better for expansion. On the other hand, when one sees a static method it discourages refactoring to make an object. Of course, only time will tell. My view is that when paradigms are not clearly better or worse than alternatives, we have to try them out and wait for empirical evidence to reward us or slap us in the face :-)

droberts195

LGTM

I just left one idea for another test

droberts195 · 2019-06-20T13:15:40Z

x-pack/plugin/src/test/resources/rest-api-spec/test/ml/data_frame_analytics_crud.yml

+              "index": "index-source"
+            },
+            "dest": {
+              "index": "<script>Foo"


It might be worth adding one more test, where the destination index name is a valid wildcard, e.g. mydest*, to prove that wildcarded destinations are not allowed.

[FEATURE][ML] Stregthen source dest validations for DF analytics

f421ec7

This adds the following validations: - dest index name is valid - source index exists - dest index is not included in source index - dest index is matching a single index at most

dimitris-athanasiou added the :ml Machine learning label Jun 19, 2019

Fix failing tests

f9dfd65

benwtrent approved these changes Jun 20, 2019

View reviewed changes

droberts195 approved these changes Jun 20, 2019

View reviewed changes

dimitris-athanasiou added 2 commits June 21, 2019 17:22

Add test with dest index being a wildcard pattern

268a083

Fix test failure in security suite

7fe9410

dimitris-athanasiou merged commit d86bea5 into elastic:feature-ml-data-frame-analytics Jun 24, 2019

dimitris-athanasiou deleted the stregthen-source-dest-validations-for-df-analytics branch June 24, 2019 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE][ML] Stregthen source dest validations for DF analytics #43399

[FEATURE][ML] Stregthen source dest validations for DF analytics #43399

dimitris-athanasiou commented Jun 19, 2019

elasticmachine commented Jun 19, 2019

benwtrent left a comment

benwtrent Jun 20, 2019

dimitris-athanasiou Jun 20, 2019

droberts195 left a comment

droberts195 Jun 20, 2019

[FEATURE][ML] Stregthen source dest validations for DF analytics #43399

[FEATURE][ML] Stregthen source dest validations for DF analytics #43399

Conversation

dimitris-athanasiou commented Jun 19, 2019

elasticmachine commented Jun 19, 2019

benwtrent left a comment

Choose a reason for hiding this comment

benwtrent Jun 20, 2019

Choose a reason for hiding this comment

dimitris-athanasiou Jun 20, 2019

Choose a reason for hiding this comment

droberts195 left a comment

Choose a reason for hiding this comment

droberts195 Jun 20, 2019

Choose a reason for hiding this comment