Fix bug: too many cloudwatch metrics #1885

leon-barrett · 2016-10-11T18:18:17Z

Cloudwatch metrics were being added incorrectly. The most obvious
symptom of this was that too many metrics were being added. A simple
check against the name of the metric proved to be a sufficient fix. In
order to test the fix, a metric selection function was factored out.

leon-barrett · 2016-10-18T16:41:20Z

Hi, maintainers. Do you have any feedback about this pull request? We have a workaround for ourselves, but it would be nice to get this fix merged.

sparrc · 2016-11-03T18:25:43Z

from what I can tell there appears to be overlap between #1885 and #1903. I would appreciate if one of you could open an issue with a full writeup on what exactly is wrong the cloudwatch plugin.

As this plugin is particularly difficult to reproduce problems on, please be sure to include examples of requests and responses, and links to documentation where necessary.

Please also provide concrete examples that illustrate what is wrong with the current behavior of the plugin.

If I am wrong and there isn't overlap, then feel free to open two separate issues. In either case I think they need to be seen and discussed before I can accept a PR changing the behavior of this plugin.

johnrengelman · 2016-11-05T02:08:12Z

@sparrc I looked at this change and it looks correct. I verified using the provided spec. What is happening currently is that when using wildcard dimensions, the metric is selected multiple times because it has an inner loop that ranges over all available metrics from the Cloudwatch API but it doesn't compare that it's looking at the same metric that's configured.

So if I have this:

selectedMetrics = ["Connections"]

and Cloudwatch is returning:

availableMetrics = ["Connections", "Disk", "CPU"]

then the code is currently doing:

for _, sm := range selectedMetrics {
  for _, am := range availableMetrics {
    //if the available metric has the same possible dimensions as the metric being looked at, 
    //then it's appended.
    if (isSelected(am, dimension)) { 
      append(sm)
    }
  }
}

So the change here is correct in that it first checks that we are comparing our selected metric against only those metrics in cloudwatch of the same name.

johnrengelman · 2016-11-05T02:12:27Z

plugins/inputs/cloudwatch/cloudwatch.go

 				}
 				for _, name := range m.MetricNames {
 					for _, metric := range allMetrics {
-						if isSelected(metric, m.Dimensions) {
+						if name == *metric.MetricName && isSelected(metric, m.Dimensions) {


The name logic check should just be folded into the isSelected method.
So, isSelected(name, metric, m.Dimensions)

sparrc · 2016-11-05T09:07:04Z

OK, fair enough, thanks for the review @johnrengelman

leon-barrett · 2016-11-07T18:58:57Z

Thanks for taking a close look at my fix. I'll try to be more complete in explanations in future pull requests.

sparrc

Looks good, just update the changelog and I'll get it merged

sparrc · 2016-12-05T16:58:17Z

CHANGELOG.md

@@ -55,6 +55,7 @@ continue sending logs to /var/log/telegraf/telegraf.log.

 ### Bugfixes

+- [#1885](https://github.com/influxdata/telegraf/pull/1885): Fix over-querying of cloudwatch metrics


move this into "features" under the 1.2 section

Cloudwatch metrics were being added incorrectly. The most obvious symptom of this was that too many metrics were being added. A simple check against the name of the metric proved to be a sufficient fix. In order to test the fix, a metric selection function was factored out.

* Fix bug: too many cloudwatch metrics Cloudwatch metrics were being added incorrectly. The most obvious symptom of this was that too many metrics were being added. A simple check against the name of the metric proved to be a sufficient fix. In order to test the fix, a metric selection function was factored out. * Go fmt cloudwatch * Cloudwatch isSelected checks metric name * Move cloudwatch line in changelog to 1.2 features

leon-barrett force-pushed the too-many-metrics-fix branch from b39b1b6 to d838221 Compare October 11, 2016 18:19

sparrc closed this Nov 3, 2016

sparrc mentioned this pull request Nov 3, 2016

Fix unit testing w/ go 1.7.1 and support enhanced wildcard dimensions #1903

Closed

3 tasks

johnrengelman reviewed Nov 5, 2016

View reviewed changes

sparrc reopened this Nov 5, 2016

leon-barrett force-pushed the too-many-metrics-fix branch from e4e1fb4 to b5babc9 Compare November 7, 2016 19:01

johnrengelman mentioned this pull request Nov 8, 2016

Cloudwatch multi result #1795

Closed

2 tasks

sparrc added this to the 1.2.0 milestone Nov 25, 2016

sparrc approved these changes Dec 5, 2016

View reviewed changes

leon-barrett added 4 commits December 5, 2016 11:15

Go fmt cloudwatch

ddeeb15

Cloudwatch isSelected checks metric name

1800f9d

Move cloudwatch line in changelog to 1.2 features

83b8e80

leon-barrett force-pushed the too-many-metrics-fix branch from 315768c to 83b8e80 Compare December 5, 2016 19:19

Merge branch 'master' into too-many-metrics-fix

6900e10

sparrc merged commit 6e24161 into influxdata:master Dec 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug: too many cloudwatch metrics #1885

Fix bug: too many cloudwatch metrics #1885

leon-barrett commented Oct 11, 2016

leon-barrett commented Oct 18, 2016

sparrc commented Nov 3, 2016

johnrengelman commented Nov 5, 2016

johnrengelman Nov 5, 2016

leon-barrett Nov 7, 2016

sparrc commented Nov 5, 2016

leon-barrett commented Nov 7, 2016

sparrc left a comment

sparrc Dec 5, 2016

leon-barrett Dec 5, 2016

		@@ -55,6 +55,7 @@ continue sending logs to /var/log/telegraf/telegraf.log.

		### Bugfixes

		- [#1885](https://github.com/influxdata/telegraf/pull/1885): Fix over-querying of cloudwatch metrics

Fix bug: too many cloudwatch metrics #1885

Fix bug: too many cloudwatch metrics #1885

Conversation

leon-barrett commented Oct 11, 2016

leon-barrett commented Oct 18, 2016

sparrc commented Nov 3, 2016

johnrengelman commented Nov 5, 2016

johnrengelman Nov 5, 2016

Choose a reason for hiding this comment

leon-barrett Nov 7, 2016

Choose a reason for hiding this comment

sparrc commented Nov 5, 2016

leon-barrett commented Nov 7, 2016

sparrc left a comment

Choose a reason for hiding this comment

sparrc Dec 5, 2016

Choose a reason for hiding this comment

leon-barrett Dec 5, 2016

Choose a reason for hiding this comment