Grafana autocomplete of labels issue #186

geekhead · 2019-06-28T16:00:03Z

Hey, just been trying out promxy sitting in front of two VictoriaMetrics instances and noticed that in Grafana the autocomplete of labels do not work as expected. I have to create another curly brace to get it to pop up but having double curly braces is invalid. It's weird but I've attached some screenshots of it.

This screenshot is showing the correct behavior going straight to my VictoriaMetrics backend:

This screenshot is going directly to Promxy but it's not autocompleting until I do another curly brace:

I'm not sure which side is the culprit but let me know if you need any further info. Thanks

jacksontj · 2019-06-28T18:31:04Z

That is interesting. When I run grafana I actually don't get the autocomplete until I type the equal, a quote, and a character (both for prometheus direct and promxy):

Promxy doesn't implement anything special for grafana, so the easiest way to check if there is a difference is to check the API call that grafana is making. If you look in developer tools in your browser you'll see the 2 calls:

So to compare I'd look to see if those results vary -- they should be identical (assuming its promxy in front of the single instance). If they vary, could you share the results?

geekhead · 2019-07-01T20:11:31Z

@jacksontj Thanks for the response. I'll check it out and report back.

geekhead · 2019-07-02T15:51:21Z

@jacksontj The issue appears to be Promxy returning a status code 422 when Grafana tries to send the metric name via an XHR request to get a list of labels.

Here's the result going directly to a VictoriaMetrics backend data source:

And going directly to Promxy:

I also see the same issue being reported in the Promxy logs:
[ip address] - - [02/Jul/2019 15:48:41] "GET /api/v1/series HTTP/1.1 422 168" 0.001203 match%5B%5D=%7B__name__%3D%22up%22%7D

jacksontj · 2019-07-02T16:33:40Z

Thanks for the details! With that I was able to reproduce the issue, and unfortunately its an upstream client issue (prometheus/client_golang#614); TLDR if you don't pass time to that API call (as grafana hasn't) prometheus sets a huge time range which can't validly be converted to the timestamp format prometheus requires-- so it fails. I have a few ideas on fixes, but we'll see what ideas upstream has :)

jacksontj · 2019-07-06T16:11:54Z

I have a fix open upstream for this (prometheus/prometheus#5734) but unfortunately that'll require a change to the server (prometheus or victoriametrics) the root cause is a shortcoming in stdlib's time.Parse.

jacksontj · 2019-07-06T16:16:29Z

I have also created an issue on VictoriaMetrics (VictoriaMetrics/VictoriaMetrics#88) to handle this case. Unfortunately this is a server-side issue (e.g. prometheus or VM) so not a ton I can do from the promxy side.

jacksontj · 2019-07-06T22:07:50Z

So while applying the upstream fix into promxy (solving the case where promxy is the downstream) I decided to add in a promxy-side workaround as well. Both of these are in #194 which is included in https://github.com/jacksontj/promxy/releases/tag/v0.0.43

I'll goahead and close out this issue as there is a pending fix upstream and a workaround within promxy on the latest release. If you are still seeing issues feel free to reopen, but from my testing they should be fixed :)

geekhead · 2019-07-07T15:57:39Z

Unfortunately, the issue still persists. I no longer get a 422 status code but it still doesn't pre-populate the labels and I see only this in the promxy debug logs: DEBU[2019-07-07T15:17:26Z] Select matchers="[__name__=\"netdata_system_ram_MiB_average\"]" selectParams="<nil>" took="980.739µs" 172.31.69.141 - - [07/Jul/2019 15:17:26] "GET /api/v1/series HTTP/1.1 200 59" 0.001275 match%5B%5D=%7B__name__%3D%22netdata_system_ram_MiB_average%22%7D

jacksontj · 2019-07-07T16:00:03Z

Does the API response have values? The 200 indicates it's completed without error, so that is definitely odd.

…

On Sun, Jul 7, 2019, 8:57 AM David Peterson ***@***.***> wrote: Unfortunately, the issue still persists. I no longer get a 422 status code but it still doesn't pre-populate the labels and I see only this in the promxy debug logs: DEBU[2019-07-07T15:17:26Z] Select matchers="[__name__=\"netdata_system_ram_MiB_average\"]" selectParams="<nil>" took="980.739µs" 172.31.69.141 - - [07/Jul/2019 15:17:26] "GET /api/v1/series HTTP/1.1 200 59" 0.001275 match%5B%5D=%7B__name__%3D%22netdata_system_ram_MiB_average%22%7D — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#186?email_source=notifications&email_token=AAMLPHKUPLMHNSSKXQZOIRDP6IG7HA5CNFSM4H4GT7OKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODZLOH4I#issuecomment-509010929>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAMLPHLUZUCV27YGI2T2WCLP6IG7HANCNFSM4H4GT7OA> .

cc #186 and $193

jacksontj · 2019-07-07T16:38:16Z

Linking over here-- #193 (comment)

There was an issue with the workaround for timezones forward of GMT, fix is is #195 .

cc #186 and #193

jacksontj · 2019-07-09T14:38:01Z

Once the better client fix is merged (prometheus/client_golang#617) I'll actually remove the workaround entirely as it won't be needed.

jacksontj · 2019-07-10T03:04:44Z

@geekhead FYI I was able to reproduce similar behavior prior to the most recent fix, the problem I saw was an int64 overflow for UnixNano() which ended up making the "startTime" hugely positive and the "endTime" hugely negative. So I expect that the issue is gone in this new release (I am no longer able to repro), as always if you still see the issue definitely reopen :)

hatemosphere · 2019-07-10T10:36:28Z

@jacksontj with 0.0.44 we are now getting another kind of error on HPA:

E0710 10:32:30.184118       1 provider.go:207] unable to update list of all metrics: unable to fetch metrics for query "traefik_entrypoint_requests_total{namespace!=\"\",pod!=\"\",job=\"traefik-ingress-external-addon\"}": execution: 422: "end"=9223309901257973760ms is out of allowed range [-9223372036854 ... 9223372036854]
E0710 10:32:30.190915       1 periodic_metric_lister.go:60] unable to update list of all metrics: unable to fetch metrics for query "rabbitmq_queue_messages_ready": execution: 422: "end"=9223309901257973760ms is out of allowed range [-9223372036854 ... 9223372036854]

should i create another issue or?

jacksontj · 2019-07-10T15:30:19Z

@hatemosphere lets create another issue for that, that seems to be something with the downstream, when you create the issue can you provide details on the setup you have (I don't have a provider.go in promxy, so presumably thats VM or prom?)

jacksontj · 2019-07-11T14:38:20Z

@hatemosphere looks like your issue is from VictoriaMetrics, and it looks like it was fixed in a later version (VictoriaMetrics/VictoriaMetrics@54bd21e seems to be the fix)

jacksontj added the question label Jul 2, 2019

jacksontj added the bug label Jul 2, 2019

jacksontj mentioned this issue Jul 6, 2019

add support for minTime/maxTime on promql API VictoriaMetrics/VictoriaMetrics#88

Closed

This was referenced Jul 6, 2019

Add workaround to upstream boundary time issue #194

Merged

422 Unprocessable Entity error due to too large default "end" time #193

Closed

jacksontj closed this as completed Jul 6, 2019

jacksontj reopened this Jul 7, 2019

jacksontj added a commit that referenced this issue Jul 7, 2019

Always pass down UTC times to the querier

bd1de1e

cc #186 and $193

jacksontj mentioned this issue Jul 7, 2019

Always pass down UTC times to the querier #195

Merged

jacksontj added a commit that referenced this issue Jul 7, 2019

Always pass down UTC times to the querier

c964542

cc #186 and #193

jacksontj added a commit that referenced this issue Jul 7, 2019

Always pass down UTC times to the querier

00a5e0b

cc #186 and #193

jacksontj mentioned this issue Jul 9, 2019

Fix for min/max times #197

Merged

jacksontj closed this as completed in #197 Jul 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grafana autocomplete of labels issue #186

Grafana autocomplete of labels issue #186

geekhead commented Jun 28, 2019

jacksontj commented Jun 28, 2019

geekhead commented Jul 1, 2019

geekhead commented Jul 2, 2019

jacksontj commented Jul 2, 2019

jacksontj commented Jul 6, 2019

jacksontj commented Jul 6, 2019

jacksontj commented Jul 6, 2019

geekhead commented Jul 7, 2019

jacksontj commented Jul 7, 2019 via email

jacksontj commented Jul 7, 2019

jacksontj commented Jul 9, 2019

jacksontj commented Jul 10, 2019

hatemosphere commented Jul 10, 2019

jacksontj commented Jul 10, 2019

jacksontj commented Jul 11, 2019

Grafana autocomplete of labels issue #186

Grafana autocomplete of labels issue #186

Comments

geekhead commented Jun 28, 2019

jacksontj commented Jun 28, 2019

geekhead commented Jul 1, 2019

geekhead commented Jul 2, 2019

jacksontj commented Jul 2, 2019

jacksontj commented Jul 6, 2019

jacksontj commented Jul 6, 2019

jacksontj commented Jul 6, 2019

geekhead commented Jul 7, 2019

jacksontj commented Jul 7, 2019 via email

jacksontj commented Jul 7, 2019

jacksontj commented Jul 9, 2019

jacksontj commented Jul 10, 2019

hatemosphere commented Jul 10, 2019

jacksontj commented Jul 10, 2019

jacksontj commented Jul 11, 2019