Add support when JetStream cluster not on kubernetes #4102

mfadhlika · 2023-01-13T07:08:38Z

I'm trying to use NATS Jetstream scaler, previously I workaround the lack of NATS Jetstream scaler by using Prometheus in middle of NATS and Keda. I run into issue with the NATS Jetstream scaler because our stack use 3 VMs for NATS Jetstream cluster and round robin load balancer.

Currently the scaler assume NATS node always have <node>.<monitoringURL> as URL for the node, which is not the case when NATS cluster is not on kubernetes. <monitoringURL>/varz endpoint respond with

curl -s -X GET "http://nats.company.internal:8222/varz" | jq .cluster
{
  "name": "nats",
  "addr": "172.0.0.2",
  "cluster_port": 4221,
  "auth_timeout": 2,
  "urls": [
    "172.0.0.3:4221", // <--- VM's IP
    "172.0.0.4:4221"
  ],
  "tls_timeout": 2
}

Because the scaler assume it's on kubernetes, it splits url by . (dot) and take first item as node and we get 172.nats.company.internal:8222 for the natsJetStreamMonitoringNodeURL as shown in code.

for _, clusterURL := range jetStreamServerResp.Cluster.HostUrls {
	node := strings.Split(clusterURL, ".")[0]
	natsJetStreamMonitoringNodeURL, err := s.getNATSJetStreamMonitoringNodeURL(node)
	if err != nil {
		return err
	}
	...
}

This PR only works if monitoring port of all node is the same as monitoringEndpoint

Checklist

Changelog has been updated and is aligned with our changelog requirements
Commits are signed with Developer Certificate of Origin (DCO - learn more)

Fixes #4101

semgrep-app · 2023-01-13T07:09:30Z

Semgrep found 6 sprintf-host-port findings:

pkg/scalers/nats_jetstream_scaler.go: L398, L383, L383, L398, L383, L398

use net.JoinHostPort instead of fmt.Sprintf($XX, nodeHostname)

_{Created by sprintf-host-port.}

zroubalik

unit tests are faling:

=== RUN   TestNATSJetStreamIsActive
    nats_jetstream_scaler_test.go:262: Expected error for 'Fail - Bad leader name (clustered)' but got success 
    nats_jetstream_scaler_test.go:266: Expected 'Fail - Bad leader name (clustered)' 'isActive=false', got 'true'
--- FAIL: TestNATSJetStreamIsActive (0.01s)
=== RUN   TestNewNATSJetStreamScaler
--- PASS: TestNewNATSJetStreamScaler (0.00s)
=== RUN   TestNATSJetStreamGetMetrics
    nats_jetstream_scaler_test.go:318: Expected error for 'Fail - Bad leader name (clustered)' but got success 
--- FAIL: TestNATSJetStreamGetMetrics (0.17s)

you can test this locally by running the individual test or make test

Also Static checks are failing because of the use of deprecated package.

CHANGELOG.md

mfadhlika · 2023-01-13T14:24:02Z

@zroubalik the failing unit tests probably no longer relevant because the changes don't use the consumer leader name as monitoring url anymore. Should I remove the test? or is there any other use case where bad leader name is unfavorable?

zroubalik

@mfadhlika yeah, sorry for the delay I missed the notification. Please update the unit tests to reflect the new behavior and also extend the coverage if possible.

zroubalik · 2023-02-21T18:36:07Z

@mfadhlika any update on this please?

mfadhlika · 2023-02-22T01:43:23Z

@zroubalik sorry, been busy past couple weeks. I'll try to update this weekend

CHANGELOG.md

zroubalik · 2023-03-02T09:56:16Z

/run-e2e nats*
Update: You can check the progress here

Signed-off-by: Muhammad Fadhlika <git@fadhlika.com>

zroubalik

LGTM

@mfadhlika thanks a lot for the contibution!

Signed-off-by: Muhammad Fadhlika <git@fadhlika.com>

rayjanoka · 2023-05-01T23:59:24Z

This change seems to have broken the scaler on kubernetes.

I'm thinking we need to put the test that was removed back and get these changes working with that.

I'll do some troubleshooting.

mfadhlika · 2023-05-02T00:49:11Z

@rayjanoka I did read your PR before working on this. All I did is replacing node monitoring url from <node name>.<nats monitoring url> to IPs/URLs from connectUrls. I was using nats cluster with 3 node without stream replica when first testing this

rayjanoka · 2023-05-02T01:34:10Z

hmm I don't see connect_urls in my /varz. Do I need to upgrade NATS or something?

➜ curl -s 'localhost:8223/varz' | grep connect_urls

rayjanoka · 2023-05-02T01:51:06Z

I see, I use --no_advertise in nats which does not include these connect_urls on purpose.

https://github.com/nats-io/nats-server/blob/main/server/route.go#LL1705C2-L1709C3

rayjanoka · 2023-05-02T01:54:02Z

So maybe we can add back the old way and make it backward compatible...?

I think I get this error because it can't find connect_urls.

result: runtime error: invalid memory address or nil pointer dereference

and maybe we can make the testing include my use case...

mfadhlika · 2023-05-02T01:59:08Z

I agree we should make it backward compatible, should we look for connect_urls then fallback to old way if empty or use some flag in config?

rayjanoka · 2023-05-02T02:48:57Z

yes, I think fallback is good if connect_urls doesn't exist.

mfadhlika requested a review from a team as a code owner January 13, 2023 07:08

mfadhlika force-pushed the main branch from 8e878c4 to 80efcb9 Compare January 13, 2023 07:21

zroubalik reviewed Jan 13, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

mfadhlika force-pushed the main branch 3 times, most recently from 0718c0c to dbc8463 Compare January 13, 2023 14:19

mfadhlika mentioned this pull request Jan 17, 2023

NATS JetStream scaler assume the cluster runs on kubernetes #4101

Closed

mfadhlika force-pushed the main branch from dbc8463 to 47d5bc1 Compare January 24, 2023 10:39

zroubalik reviewed Feb 8, 2023

View reviewed changes

mfadhlika force-pushed the main branch 4 times, most recently from 25c7f9b to c02f62b Compare February 28, 2023 04:23

zroubalik reviewed Mar 2, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Add support when JetStream cluster not on kubernetes

d678015

Signed-off-by: Muhammad Fadhlika <git@fadhlika.com>

mfadhlika force-pushed the main branch from c02f62b to d678015 Compare March 2, 2023 12:39

zroubalik approved these changes Mar 2, 2023

View reviewed changes

zroubalik merged commit 55c9c74 into kedacore:main Mar 2, 2023

xoanmm pushed a commit to xoanmm/keda that referenced this pull request Mar 22, 2023

Add support when JetStream cluster not on kubernetes (kedacore#4102)

0d60c85

Signed-off-by: Muhammad Fadhlika <git@fadhlika.com>

mfadhlika mentioned this pull request May 4, 2023

NATS JetStream scaler broke if nodes are not advertised #4524

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support when JetStream cluster not on kubernetes #4102

Add support when JetStream cluster not on kubernetes #4102

mfadhlika commented Jan 13, 2023 •

edited

Loading

semgrep-app bot commented Jan 13, 2023

zroubalik left a comment •

edited

Loading

mfadhlika commented Jan 13, 2023 •

edited

Loading

zroubalik left a comment

zroubalik commented Feb 21, 2023

mfadhlika commented Feb 22, 2023

zroubalik commented Mar 2, 2023 •

edited by github-actions bot

Loading

zroubalik left a comment

rayjanoka commented May 1, 2023 •

edited

Loading

mfadhlika commented May 2, 2023 •

edited

Loading

rayjanoka commented May 2, 2023

rayjanoka commented May 2, 2023

rayjanoka commented May 2, 2023 •

edited

Loading

mfadhlika commented May 2, 2023 •

edited

Loading

rayjanoka commented May 2, 2023

Add support when JetStream cluster not on kubernetes #4102

Add support when JetStream cluster not on kubernetes #4102

Conversation

mfadhlika commented Jan 13, 2023 • edited Loading

Checklist

semgrep-app bot commented Jan 13, 2023

zroubalik left a comment • edited Loading

Choose a reason for hiding this comment

mfadhlika commented Jan 13, 2023 • edited Loading

zroubalik left a comment

Choose a reason for hiding this comment

zroubalik commented Feb 21, 2023

mfadhlika commented Feb 22, 2023

zroubalik commented Mar 2, 2023 • edited by github-actions bot Loading

zroubalik left a comment

Choose a reason for hiding this comment

rayjanoka commented May 1, 2023 • edited Loading

mfadhlika commented May 2, 2023 • edited Loading

rayjanoka commented May 2, 2023

rayjanoka commented May 2, 2023

rayjanoka commented May 2, 2023 • edited Loading

mfadhlika commented May 2, 2023 • edited Loading

rayjanoka commented May 2, 2023

mfadhlika commented Jan 13, 2023 •

edited

Loading

zroubalik left a comment •

edited

Loading

mfadhlika commented Jan 13, 2023 •

edited

Loading

zroubalik commented Mar 2, 2023 •

edited by github-actions bot

Loading

rayjanoka commented May 1, 2023 •

edited

Loading

mfadhlika commented May 2, 2023 •

edited

Loading

rayjanoka commented May 2, 2023 •

edited

Loading

mfadhlika commented May 2, 2023 •

edited

Loading