Onboard ACStor targets #976

shlokshah-dev · 2024-09-18T20:32:23Z

PR Description

This PR onboards the targets for Azure Container Storage (ACStor)

New Feature Checklist

List telemetry added about the feature.
Link to the one-pager about the feature.
List any tasks necessary for release (3P docs, AKS RP chart changes, etc.) after merging the PR.
Attach results of scale and perf testing.

Tests Checklist

bragi92 · 2024-09-18T23:11:34Z

otelcollector/configmapparser/default-prom-configs/acstorCapacityProvisionerDefaultFile.yml

+scrape_configs:
+- job_name: acstor-capacity-provisioner
+  honor_labels: true
+  scrape_interval: 10s


I see that you're using 10s as the scrape interval instead of the default that we set using $$SCRAPE_INTERVAL$$. Any specific reason for this?

No specific reason as such, I just used this as we are using the same in our current setup. Any concerns which you anticipate from deviating from the default?

I think using the default setup makes sense to keep everything consistent unless there is a good reason to deviate from it.
The pros for 30 second scrapes are : reduced resource load on the pod and reduced data volume (which can also help with query performance).

Based on the list of metrics I see in the minimal ingestion profile list I would recommend to use the 30 second value i.e. the $$SCRAPE_INTERVAL$$ parameter which we replace during config map parsing.

@bragi92 Updated the scrape_interval to use default!

otelcollector/configmapparser/default-prom-configs/acstorCapacityProvisionerDefaultFile.yml

vishiy · 2024-09-25T18:17:38Z

otelcollector/configmapparser/default-prom-configs/acstorCapacityProvisionerDefaultFile.yml

+    action: keep
+    regex: metrics
+
+  # If prometheus.io/path is specified, scrape this path instead of /metrics


please remove all commented out part of config. I put it for future reference when i converted them...

vishiy · 2024-09-25T18:18:50Z

otelcollector/configmaps/ama-metrics-settings-configmap.yaml

@@ -29,6 +29,8 @@ data:
    controlplane-kube-scheduler = false
    controlplane-kube-controller-manager = false
    controlplane-etcd = true
+    acstor-capacity-provisioner = false


dont we want to default both these new targets to "true" , so thay way, customer doesn't have to go thru additional step to enable thru config map ?

But what if a cluster doesn't have ACStor enabled? Would keeping the default to true give out any errors when it doesn't find the targets?

since these are pod discoveries, if those pods aren't there , it wont discover and hence wont scrape..

Makes sense. So, in case if ACStor is installed later, will the customer need to re-apply the config map? Or does the discovery happen automatically?

vishiy · 2024-09-25T18:19:25Z

otelcollector/shared/configmap/mp/tomlparser-default-scrape-settings.go

@@ -25,6 +25,8 @@ func (fcl *FilesystemConfigLoader) SetDefaultScrapeSettings() (map[string]string
 	config["networkobservabilityHubble"] = "true"
 	config["networkobservabilityCilium"] = "true"
 	config["noDefaultsEnabled"] = "false"
+	config["acstor-capacity-provisioner"] = "false"


default to true for both?

vishiy · 2024-09-25T18:24:44Z

otelcollector/shared/configmap/mp/tomlparser-scrape-interval.go

@@ -93,7 +95,8 @@ func processConfigMap() map[string]string {
 		"WINDOWSKUBEPROXY_SCRAPE_INTERVAL", "PROMETHEUS_COLLECTOR_HEALTH_SCRAPE_INTERVAL",
 		"POD_ANNOTATION_SCRAPE_INTERVAL", "KAPPIEBASIC_SCRAPE_INTERVAL",
 		"NETWORKOBSERVABILITYRETINA_SCRAPE_INTERVAL", "NETWORKOBSERVABILITYHUBBLE_SCRAPE_INTERVAL",
-		"NETWORKOBSERVABILITYCILIUM_SCRAPE_INTERVAL",
+		"NETWORKOBSERVABILITYCILIUM_SCRAPE_INTERVAL", "ACSTORCAPACITYPROVISIONER_SCRAPE_INTERVAL",


Can you pick these 2 new environment vars, and add them to be part of our telemetry in telemetry.go (under fluent-bit/src) ? This will help us to determine if these targets are enabled for any given cluster where managed prom is enabled. You could also add your keeplists to telemetry, to see what other metrics of ours, customers are adding to the defaults, so that way we can add to our defaults in the future as needed. This telemetry is also in the same file as in the above comment...

Added the env variables and keep lists to telemetry.go

add branch to test

bragi92 · 2024-10-07T19:14:50Z

/azp run

azure-pipelines · 2024-10-07T19:15:01Z

Azure Pipelines successfully started running 1 pipeline(s).

shlokshah-dev requested a review from a team as a code owner September 18, 2024 20:32

github-actions bot added the size/L label Sep 18, 2024

bragi92 reviewed Sep 18, 2024

View reviewed changes

shlokshah-dev force-pushed the shlok/acstor-onboarding branch 2 times, most recently from d054b41 to 04664f9 Compare September 19, 2024 22:50

Onboard ACStor targets

50448c3

shlokshah-dev force-pushed the shlok/acstor-onboarding branch from 04664f9 to 50448c3 Compare September 23, 2024 20:56

vishiy reviewed Sep 25, 2024

View reviewed changes

otelcollector/configmapparser/default-prom-configs/acstorCapacityProvisionerDefaultFile.yml Outdated Show resolved Hide resolved

vishiy reviewed Sep 25, 2024

View reviewed changes

shlokshah-dev and others added 4 commits September 25, 2024 16:01

fix: fixed review comments and updated telemetry

052e38b

Update azure-pipeline-build.yml

a4dfdd7

add branch to test

Merge branch 'main' into shlok/acstor-onboarding

60c5006

modify config merger with acstor targets

35eb938

shlokshah-dev force-pushed the shlok/acstor-onboarding branch from 6cb3e04 to 35eb938 Compare October 1, 2024 18:02

vishiy and others added 5 commits October 2, 2024 11:03

try logging file path & fix scrape indendation

d0760d8

add isConfigReader

25e19f9

fix labels

724a0e8

fix io metrics naming

5e8197e

Merge branch 'main' into shlok/acstor-onboarding

d1e6331

fix: fixed typo in metric name

53a041f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onboard ACStor targets #976

Onboard ACStor targets #976

shlokshah-dev commented Sep 18, 2024

bragi92 Sep 18, 2024 •

edited

Loading

shlokshah-dev Sep 19, 2024

bragi92 Sep 19, 2024

shlokshah-dev Sep 19, 2024

vishiy Sep 25, 2024

shlokshah-dev Sep 25, 2024

vishiy Sep 25, 2024

shlokshah-dev Sep 25, 2024

vishiy Sep 25, 2024

shlokshah-dev Sep 25, 2024

shlokshah-dev Sep 25, 2024

vishiy Sep 25, 2024

shlokshah-dev Sep 25, 2024

vishiy Sep 25, 2024 •

edited

Loading

shlokshah-dev Sep 25, 2024

bragi92 commented Oct 7, 2024

azure-pipelines bot commented Oct 7, 2024

Onboard ACStor targets #976

Are you sure you want to change the base?

Onboard ACStor targets #976

Conversation

shlokshah-dev commented Sep 18, 2024

PR Description

New Feature Checklist

Tests Checklist

bragi92 Sep 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vishiy Sep 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bragi92 commented Oct 7, 2024

azure-pipelines bot commented Oct 7, 2024

bragi92 Sep 18, 2024 •

edited

Loading

vishiy Sep 25, 2024 •

edited

Loading