1.0 stabilization #124

fabxc · 2017-04-18T05:53:02Z

As discussed in the last SIG instrumentation meeting, we plan to do a first stable release of kube-state-metrics.
As we have been mostly adding functionality for a while, rather than changing existing one, there's nothing fundamental to change here.

double-check all existing metrics for compliance with our guidelines
double-check current functionality does not conflict with future plans of fetching partial metrics, i.e. only for pod metrics for certrain deployments
load test kube-state-metrics to ensure it scales with large clusters and derive resource requirements (@piosz, can you help with that?)
provide deployment manifest that scales with cluster size using pod nanny

piosz · 2017-04-19T11:13:46Z

@kubernetes/api-reviewers are there any guidelines/requirements for graduating a feature to GA which doesn't Kubernetes API?

cc @wojtek-t @gmarek re: scalability testing

smarterclayton · 2017-04-23T01:02:56Z

I think we'd probably recommend documenting what the compatibility expectations of that feature are going forward (in a doc in that repo), make sure there is a process for API changes reasonably consistent with the use of the goals, and then make sure the feature repo contains an issue to the graduated feature.

piosz · 2017-07-03T14:38:08Z

@loburm will help with scalability tests

brancz · 2017-07-17T13:27:39Z

I'm now back from vacation and can help coordinate. @loburm and @piosz let me know how/when you want to tackle this.

loburm · 2017-07-20T08:27:48Z

@brancz for the scalability testing we need to have some scenario to test against. I think we should concentrate only on testing metric related to nodes and pods mostly (I assume that other parts should consume significantly smaller amount of resources). How many nodes and pod should be present in the test scenario?

brancz · 2017-07-20T09:43:08Z

@loburm I'm completely new to the load tests, so I suggest to start with whatever seems reasonable to you. My thoughts are the same as yours, the number of pods metrics are expected to increase linearly with the number of other objects, so focusing on those and nodes sounds perfect for our load scenarios.

Testing with the recommended upper bound of recommended pods/nodes in a single cluster would be best to see if we can actually handle this, but I'm not sure that's reasonable given that we have never performed load tests before.

piosz · 2017-07-20T09:49:00Z

We had a chat offline and we will try to test the following scenarios:

100 nodes cluster, 30pods/node
O(1000) nodes cluster, 30 pods/node

@loburm will verify:

what is the approximate resource usage in both cases
what is the average request latency
whether there any other obvious issues

brancz · 2017-07-20T09:51:24Z

Great thanks a lot @piosz and @loburm !

matthiasr · 2017-07-20T10:04:46Z

One issue (that we can only do so much about) is the size of `/metrics` and the time it takes Prometheus to scrape it. Putting some bound on that could inform future decisions on adding metrics.

…

On Thu, Jul 20, 2017, 11:51 Frederic Branczyk ***@***.***> wrote: Great thanks a lot @piosz <https://github.com/piosz> and @loburm <https://github.com/loburm> ! — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#124 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAICBrPEar4LErT__nbok0UGNaJcxtfSks5sPyMdgaJpZM4M_5pt> .

brancz · 2017-07-20T10:13:48Z

Thanks for the heads up @matthiasr! Yes that's one of the bottle-necks I can see happening. We may have to start thinking of sharding strategies for kube-state-metrics.

piosz · 2017-07-20T14:13:39Z

Do you think it would be possible to have some kind of pagination support for /metrics handler?

andyxning · 2017-07-20T14:44:23Z

How about we split the scrape endpoints according to different collectors, i.e., pods state metrics is available on /metrics/pods.

Or, how about we support both /metrics to fetch all metrics and /metrics/* to every collectors.

Cons: this will make Prometheus configuration more complicated.

brancz · 2017-07-20T14:50:21Z

What @andyxning is suggesting is certainly possible, but is likely to just postpone the problem. @piosz I'm not aware of any precedence of that, but paging within the same instance of kube-state-metrics would also just postpone the problem, as I can imagine that the memory consumption is also very large in cases where response timeouts are hit.

loburm · 2017-07-20T14:59:27Z

@andyxning I think that will add unnecessary complexity and as I have understood common rule is to expose metrics on /metrics. We can achieve the same by creating multiple instances of kube-state-metrics and each one is responsible for one or multiple collectors we have a special flag for this.

And let me first perform some tests and once we have some real numbers, we can start thinking about possible issues and how they can be solved.

brancz · 2017-07-20T15:02:43Z

Completely agree with @loburm, measure first.

matthiasr · 2017-07-20T15:45:45Z

yeah, I didn't mean this as "needs immediate changes", but it would be good to measure and monitor for regression. Our cluster is fairly sizable, and the response is big, but not unmanageable. For now I'd just like to have a rough idea of what to expect as we grow the cluster more :) Even "if your cluster has >10k pods, raise the scrape timeout to at least 20s" is something to work with.

andyxning · 2017-07-21T01:02:02Z

Aggreed with @loburm. Btw, It still needs to add more configurations to Prometheus for one cluster . :)

brancz · 2017-07-25T08:19:27Z

@lobrum any updates on how the scalability tests are coming along?

loburm · 2017-07-27T09:11:52Z

Yesterday I have finished testing kube-state-metrics on 100 and 500 node clusters. Today trying to perform it on 1000 node cluster, but have small problems with density test. But base on the first numbers I can say that memory, cpu, latency depend on the number of nodes almost linearly.

I'll prepare small report soon and will share with you.

loburm · 2017-07-27T15:19:06Z

Sorry that it took so much time, running scalability test on 1000 node cluster was a bit tricky.

I have written all numbers down in the doc: https://docs.google.com/document/d/1hm5XrM9dYYY085yOnmMDXu074E4RxjM7R5FS4-WOflo/edit?usp=sharing
If you want I can add additional screenshots from the kubernetes dashboard to the doc (except 1000 node cluster, heapster just died).

fabxc · 2017-07-28T13:52:14Z

Thank you very much @loburm. Overall I see no concerns around scalability. In fact, we are quite surprised the memory usage stays that low. That should make us good to go for 1.0 soon.

andyxning · 2017-07-28T15:51:10Z

@loburm

Empty cluster - cluster without pods (only a system one present).
Loaded - 30 pods per node in average.
After request - cpu and memory usage during metrics fetching.

I am curious about the three stages. Can you please explain it more detailly. :)

only a system one present.
- only one system pod?
what is the difference about Loaded and After request?

matthiasr · 2017-07-28T18:20:02Z

Sweet! should we distill this into a recommendation for resources? 2MB per node (minimum 200MB) + 0.001 cores per node (0.01 minimum)?

…

On Fri, Jul 28, 2017, 17:51 Ning Xie ***@***.***> wrote: @loburm <https://github.com/loburm> Empty cluster - cluster without pods (only a system one present). Loaded - 30 pods per node in average. After request - cpu and memory usage during metrics fetching. I am curious about the three stages. Can you please explain it more detailly. :) - only a system one present. - only one system pod? - what is the difference about Loaded and After request? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#124 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAICBtAKQDG6FSZ_zsaQ-YzU0avM8EoDks5sSgNugaJpZM4M_5pt> .

loburm · 2017-07-31T08:06:11Z

@andyxning empty cluster has only pods that belong to kube-system namespace and created by scalability test at the beginning:

dashboard
heapster
grafana, influxdb
node problem detector
kube-proxy
kube-dns
fluentd

in average it's near 4-5 pods at the beginning. So at the end we really have 34-35 pods per node.

"Loaded" was measured when cluster was stabilized after all pods created. "After request" - after fetching metrics from "/metrics" it really increases memory usage and gives a short peak in cpu usage.

andyxning · 2017-07-31T11:20:30Z

@loburm Got it. Thanks for the detailed explaination.

piosz · 2017-08-01T09:06:16Z

Thanks @loburm. It seems from scalability point of view kube-state-metrics is ready for 1.0.
@matthiasr could you please add what you wrote to the documentation?

brancz · 2017-08-01T09:10:04Z

@piosz I'm preparing everything for the release and am hopeful that I can then publish the release this week. We'll have to fix #192 though before we do. I'm already on it. Do you think we should first cut rc's or just the 1.0 straight out?

matthiasr · 2017-08-01T15:24:10Z

let's do RCs

brancz · 2017-08-02T13:15:32Z

rc.1 is out: I published quay.io/coreos/kube-state-metrics:v1.0.0-rc.1 for testing, and @loburm will publish the image on gcr.io within the next half an hour.

brancz · 2017-08-02T13:36:00Z

@loburm now has published the image on gcr: gcr.io/google_containers/kube-state-metrics:v1.0.0-rc.1

WIZARD-CXY · 2017-08-03T02:35:52Z

@fabxc how about "provide deployment manifest that scales with cluster size using pod nanny", do you need any help working on it? I think I can help with it based on this resource recommendation #196

smarterclayton · 2017-08-23T17:14:30Z

Some additional metrics from a reasonably large production cluster (on 1.0.0+ fix for owner NPE)

175 nodes
2k namespaces (most of them have roughly one to two services, deployments, pods)
170k samples scraped by ksm (on top of 800k base samples scraped)
ksm uses 400m of memory and 0.07 core
Scraping these samples added 500m to a 4.5gb heap prometheus and 0.03 core (on top of 0.25 core steady state)
2.7k pod series (for the various kube_pod_*)
Rate of change of pods on this cluster is between 2-3 pods per minute

WIZARD-CXY · 2017-08-24T01:58:45Z

@smarterclayton good to know

andyxning · 2017-09-12T18:52:01Z

@brancz @fabxc Since 1.0 has been released, should we just close this tracking issue?

WIZARD-CXY · 2017-09-13T02:16:03Z

OK，I confirm the last unchecked item which is scaling with cluster size using pod nanny is done

andyxning · 2017-09-13T02:52:23Z

Since #200 has added support for providing deployment manifest that scales with cluster size using pod nanny. Closing this now.

brancz mentioned this issue Jul 27, 2017

*: cut 0.6.0 #170

Closed

brancz added this to the 1.0 milestone Jul 28, 2017

brancz mentioned this issue Aug 1, 2017

README: add resource recommendation #196

Merged

WIZARD-CXY mentioned this issue Aug 3, 2017

add autoscaler for kube-state-metrics #200

Merged

andyxning closed this as completed Sep 13, 2017

brancz mentioned this issue Sep 15, 2017

kube-state-metrics consuming too much memory #257

Closed

lilic mentioned this issue Oct 23, 2020

Cannot list resource "ingresses" in API group "extensions" after permission adjustment #1272

Closed

1.0 stabilization #124

1.0 stabilization #124

Comments

fabxc commented Apr 18, 2017 • edited by andyxning Loading

piosz commented Apr 19, 2017

smarterclayton commented Apr 23, 2017

piosz commented Jul 3, 2017

brancz commented Jul 17, 2017

loburm commented Jul 20, 2017

brancz commented Jul 20, 2017

piosz commented Jul 20, 2017

brancz commented Jul 20, 2017

matthiasr commented Jul 20, 2017 via email

brancz commented Jul 20, 2017

piosz commented Jul 20, 2017

andyxning commented Jul 20, 2017 • edited Loading

brancz commented Jul 20, 2017

loburm commented Jul 20, 2017

brancz commented Jul 20, 2017

matthiasr commented Jul 20, 2017

andyxning commented Jul 21, 2017

brancz commented Jul 25, 2017

loburm commented Jul 27, 2017

loburm commented Jul 27, 2017

fabxc commented Jul 28, 2017

andyxning commented Jul 28, 2017

matthiasr commented Jul 28, 2017 via email

loburm commented Jul 31, 2017

andyxning commented Jul 31, 2017

piosz commented Aug 1, 2017

brancz commented Aug 1, 2017 • edited Loading

matthiasr commented Aug 1, 2017

brancz commented Aug 2, 2017

brancz commented Aug 2, 2017

WIZARD-CXY commented Aug 3, 2017

smarterclayton commented Aug 23, 2017 • edited Loading

WIZARD-CXY commented Aug 24, 2017

andyxning commented Sep 12, 2017 • edited Loading

WIZARD-CXY commented Sep 13, 2017

andyxning commented Sep 13, 2017

fabxc commented Apr 18, 2017 •

edited by andyxning

Loading

andyxning commented Jul 20, 2017 •

edited

Loading

brancz commented Aug 1, 2017 •

edited

Loading

smarterclayton commented Aug 23, 2017 •

edited

Loading

andyxning commented Sep 12, 2017 •

edited

Loading