[Proposal] Core Metrics in Kubelet #252

dashpole · 2017-01-05T15:55:40Z

Included in the Monitoring Architecture is the existence of a set of "Core Metrics". This goal of this proposal is to fully define what those metrics are, and provide a blueprint for how they will be collected on the node.

Issue: #39102

My analysis of what metrics we use and dont use in the kubelet can be found here.

cc: @kubernetes/sig-node-misc @kubernetes/sig-scheduling-misc @kubernetes/sig-autoscaling-misc @kubernetes/sig-instrumentation-misc
cc: @piosz @luxas @DirectXMan12

derekwaynecarr

I think we should document URL patterns for what is invoked to gather this information. I assume this is a new endpoint to the existing stats api.

derekwaynecarr · 2017-01-05T18:50:53Z

contributors/design-proposals/core-metrics-pipeline.md

+### Background
+The [Monitoring Architecture](https://github.com/kubernetes/kubernetes/blob/master/docs/design/monitoring_architecture.md) proposal contains a blueprint for a set of metrics referred to as "Core Metrics".  The purpose of this proposal is to specify what those metrics are, and how they will be collected on the node.
+
+CAdvisor is an open source container monitoring solution which only monitors containers, and has no concept of k8s constructs like pods or volumes.  Kubernetes vendors cAdvisor into its codebase, and uses cAdvisor as a library with functions that enable it to collect metrics on containers.  The kubelet can then combine container-level metrics from cAdvisor with the kubelet's knowledge of k8s constructs like pods to produce the kubelet Summary statistics, which provides metrics for use by the kubelet, or by users through the Summary API.  cAdvisor works by collecting metrics at an interval (10 seconds), and the kubelet then simply querries these cached metrics whenever it has a need for them.


s/CAdvisor/cAdvisor
s/querries/querres

Note: The 10 seconds is a configurable interval.

derekwaynecarr · 2017-01-05T18:52:09Z

contributors/design-proposals/core-metrics-pipeline.md

+
+It is very cumbersome to make changes or bugfixes in cAdvisor, because that then needs to be vendored back into kubernetes.
+
+CAdvisor is structured to collect metrics on an interval, which is appropriate for a stand-alone metrics collector.  However, many functions in the kubelet are latency-sensitive (eviction, for example), and would benifit from a more "On-Demand" metrics collection design.


derekwaynecarr · 2017-01-05T20:21:29Z

contributors/design-proposals/core-metrics-pipeline.md

+  // The memory capacity, in bytes  
+  CapacityBytes *uint64 `json:"capacitybytes,omitempty"`  
+  // The available memory, in bytes  
+  AvailableBytes *uint64 `json:"availablebytes,omitempty"`  


i know the definitions are probably intended to be consistent with cAdvisor, but maybe include more detail on what this field means.

+1. Is this based on working set or total usage

derekwaynecarr · 2017-01-05T20:24:51Z

contributors/design-proposals/core-metrics-pipeline.md

+The code that provides this data currently resides in cAdvisor.  I propose moving this to the kubelet as well.
+
+```go
+type CoreInfo struct {  


is this supposed to be MachineInfo?

Yeah. I think so too.

derekwaynecarr · 2017-01-05T21:08:30Z

contributors/design-proposals/core-metrics-pipeline.md

+## Introduction
+
+### Background
+The [Monitoring Architecture](https://github.com/kubernetes/kubernetes/blob/master/docs/design/monitoring_architecture.md) proposal contains a blueprint for a set of metrics referred to as "Core Metrics".  The purpose of this proposal is to specify what those metrics are, and how they will be collected on the node.


I still dislike the term "Core Metrics". The monitoring architecture referred to this as a system metrics, and it split system metrics into core and non-core pieces. I would prefer we call this "System resource metrics API" and let the non-core stuff be treated as opaque resource metrics apis.

@smarterclayton

This was meant to be a proposal for the core metrics portion. Are you saying that we should expose system metrics rather than just core metrics through any external URL endpoint?

I think the argument is that the name "core metrics" is wrong, or not philosophically aligned with our general Kubernetes wide naming conventions (wearing my "what are we trying to build as a project" hat momentarily).

I have badgered Derek a bit about this because "core" isn't a great word to describe things. I was looking for a more precise name for this general approach, and in particular the name that this API group will have to align well to what it is surfacing. "metrics" is not appropriate (this is not generic metrics). "core-metrics" is not precise (it does not define what is core).

"resource-metrics", "system-resource-metrics", "utilization-metrics" are all more accurate. Before we merge this, I'd like to converge on a name that is not "core" to describe this subset of all possible metrics.

"resource-metrics", "system-resource-metrics", "utilization-metrics"

So, the name of the API served to expose these metrics to Kubernetes components (the HPA controller, the scheduler in the future) is the "resource metrics API" (also known as the "master metrics API", but that's a bit of a misnomer). I, personally, would lean towards the "system resource metrics API", "kubelet metrics API", or something along those lines, to distinguish it from the "resource metrics API".

Agreed with the argument with the term of core-metrics. How about kubelet-resource-metrics?

I think basic metrics has the same ambiguous meaning as core metrics here. I heard @vishh's concern about kubelet- prefix for the naming. How about calling it node-resource-metrics or system-resource-metrics?

derekwaynecarr · 2017-01-05T21:09:30Z

contributors/design-proposals/core-metrics-pipeline.md

+I will move all code pertaining to collection and processing of core metrics from cAdvisor into kubernetes.  
+I will vendor the new core metrics code back into cAdvisor. 
+I will modify volume stats collection so that it relies on this code.  
+I will modify the structure of stats collection code to be "On-Demand"   


We need a mechanism to rate limit or control how frequently on-demand stat collection can occur. Did you have thoughts here? Even at existing 15s intervals, file system usage stats are very expensive to compute.

My current thinking is that queries would contain a recency time.Duration parameter. Pieces of the kubelet that need the most recent information can set it low, and pieces that can do with approximately correct measures could set it higher. We could set a minimum to enforce that the system is not overwhelmed.

+1. A lot of thought has gone into cAdvisor's stats collection to make it performant and resource efficient.

s/querries/queries s/CAdvisor/cAdvisor s/CoreInfo/MachineInfo

vishh · 2017-01-05T21:16:08Z

contributors/design-proposals/core-metrics-pipeline.md

+### Background
+The [Monitoring Architecture](https://github.com/kubernetes/kubernetes/blob/master/docs/design/monitoring_architecture.md) proposal contains a blueprint for a set of metrics referred to as "Core Metrics".  The purpose of this proposal is to specify what those metrics are, and how they will be collected on the node.
+
+CAdvisor is an open source container monitoring solution which only monitors containers, and has no concept of k8s constructs like pods or volumes.  Kubernetes vendors cAdvisor into its codebase, and uses cAdvisor as a library with functions that enable it to collect metrics on containers.  The kubelet can then combine container-level metrics from cAdvisor with the kubelet's knowledge of k8s constructs like pods to produce the kubelet Summary statistics, which provides metrics for use by the kubelet, or by users through the Summary API.  cAdvisor works by collecting metrics at an interval (10 seconds), and the kubelet then simply querries these cached metrics whenever it has a need for them.


Note: The 10 seconds is a configurable interval.

vishh · 2017-01-05T21:16:35Z

contributors/design-proposals/core-metrics-pipeline.md

+
+CAdvisor is an open source container monitoring solution which only monitors containers, and has no concept of k8s constructs like pods or volumes.  Kubernetes vendors cAdvisor into its codebase, and uses cAdvisor as a library with functions that enable it to collect metrics on containers.  The kubelet can then combine container-level metrics from cAdvisor with the kubelet's knowledge of k8s constructs like pods to produce the kubelet Summary statistics, which provides metrics for use by the kubelet, or by users through the Summary API.  cAdvisor works by collecting metrics at an interval (10 seconds), and the kubelet then simply querries these cached metrics whenever it has a need for them.
+
+Currently, cAdvisor collects a large number of metrics related to system and container performance. However, only some of these metrics are consumed by the kubelet summary API, and many are not used.  The kubelet summary API is published to the kubelet summary API endpoint.  Some of the metrics provided by the summary API are consumed internally, but most are not used internally.


nit: Define the summary API here.

vishh · 2017-01-05T21:17:19Z

contributors/design-proposals/core-metrics-pipeline.md

+### Motivations
+
+Giving the kubelet the role of both providing metrics for its own use, and providing metrics for users has a couple problems
+ - First, it is clearly inefficent to collect metrics and not use them.  The kubelet uses only a small portion of the metrics it collects, and thus presents considerable extra overhead to any users who do not use them, or prefer a third party monitoring solution.


what overhead? State it explicitly.

removed in favor of leaving separate pipeline motivations to monitoring architecture proposal

vishh · 2017-01-05T21:18:11Z

contributors/design-proposals/core-metrics-pipeline.md

+
+Giving the kubelet the role of both providing metrics for its own use, and providing metrics for users has a couple problems
+ - First, it is clearly inefficent to collect metrics and not use them.  The kubelet uses only a small portion of the metrics it collects, and thus presents considerable extra overhead to any users who do not use them, or prefer a third party monitoring solution.
+ - Second, as the number of metrics collected grows over time, the kubelet will gain more and more overhead for collecting, processing, and publishing these metrics.  Since the metrics users may want is unbounded, the kubelet's resource overhead could easily grow to unreasonable levels.


Why do you think the metrics collected will grow over time? Are you explaining the rationale behind wanting a separate "core metrics" pipeline here?

Yes. I will remove this, as it is explained in the monitoring architecture.

vishh · 2017-01-05T21:18:59Z

contributors/design-proposals/core-metrics-pipeline.md

+ - First, it is clearly inefficent to collect metrics and not use them.  The kubelet uses only a small portion of the metrics it collects, and thus presents considerable extra overhead to any users who do not use them, or prefer a third party monitoring solution.
+ - Second, as the number of metrics collected grows over time, the kubelet will gain more and more overhead for collecting, processing, and publishing these metrics.  Since the metrics users may want is unbounded, the kubelet's resource overhead could easily grow to unreasonable levels.
+
+It is very cumbersome to make changes or bugfixes in cAdvisor, because that then needs to be vendored back into kubernetes.


This will be the case moving forward for the entire organization since more components are expected to move into separate projects.

vishh · 2017-01-05T21:36:32Z

contributors/design-proposals/core-metrics-pipeline.md

+  // MachineID reported by the node. For unique machine identification  
+  // in the cluster this field is prefered. Learn more from man(5)  
+  // machine-id: http://man7.org/linux/man-pages/man5/machine-id.5.html  
+  MachineID string `json:"machineID" protobuf:"bytes,1,opt,name=machineID"`  


Why do we have a proto annotation here and not for other objects?

vishh · 2017-01-05T21:37:28Z

contributors/design-proposals/core-metrics-pipeline.md

+  // SystemUUID reported by the node. For unique machine identification  
+  // MachineID is prefered. This field is specific to Red Hat hosts  
+  // https://access.redhat.com/documentation/en-US/Red_Hat_Subscription_Management/1/html/RHSM/getting-system-uuid.html  
+  SystemUUID string `json:"systemUUID" protobuf:"bytes,2,opt,name=systemUUID"`  


IIRC, @dchen1107 and @philips had intended to drop MachineID in favor of SystemUUID or vice versa. Now might be the time to do that.

vishh · 2017-01-05T21:38:01Z

contributors/design-proposals/core-metrics-pipeline.md

+
+## Implementation Plan
+
+I will move all code pertaining to collection and processing of core metrics from cAdvisor into kubernetes.  


nit: Instead of I suggest dashpole@.

or just don't mention the first person:

move all code pertaining to collection and processing of core metrics...

changed to @dashpole

vishh · 2017-01-05T21:39:07Z

contributors/design-proposals/core-metrics-pipeline.md

+I will move all code pertaining to collection and processing of core metrics from cAdvisor into kubernetes.  
+I will vendor the new core metrics code back into cAdvisor. 
+I will modify volume stats collection so that it relies on this code.  
+I will modify the structure of stats collection code to be "On-Demand"   


+1. A lot of thought has gone into cAdvisor's stats collection to make it performant and resource efficient.

vishh · 2017-01-05T21:40:02Z

contributors/design-proposals/core-metrics-pipeline.md

+Tenative future work, not included in this proposal:  
+Obtain all runtime-specific information needed to collect metrics from the CRI.  
+Create a third party metadata API, whose function is to provide third party monitoring solutions with kubernetes-specific data (pod-container relationships, for example).  
+Modify cAdvisor to be "stand alone", and run in a seperate binary from the kubelet.  It will consume the above metadata API, and provide the summary API.  


Is this a suggestion or a plan of record?

Suggested, i think. I view this mostly as a worst-case proposal. It may be that there is a better, and easier way to continue to provide the summary api, but at least providing stand-alone cAdvisor is a viable path that achieves the high level requirements.

vishh · 2017-01-05T21:58:38Z

Kubelet has to avoid overcomiting CPU and for that it needs CPU capacity. Any first class resource that can be requested explicitly via the API, kubelet will have to track capacity and availability for it.

…

On Thu, Jan 5, 2017 at 1:56 PM, David Ashpole ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In contributors/design-proposals/core-metrics-pipeline.md <#252>: > + +Integration with CRI will not be covered in this proposal. In future proposals, integrating with CRI may provide a better abstraction of information required by the core metrics pipeline to collect metrics. + +## Design + +This design covers only the internal Core Metrics Pipeline. + +High level requirements for the design are as follows: + - Do not break existing users. We should continue to provide the full summary API by default. + - The kubelet collects the minimum possible number of metrics for full kubernetes functionality. + - Code for collecting core metrics resides in the kubernetes codebase. + - Metrics can be fetched "On Demand", giving the kubelet more up-to-date stats. + +Metrics requirements, based on kubernetes component needs, are as follows: + - Kubelet + - Node-level capacity and availability metrics for Disk and Memory The kubelet does not use any CPU metrics, actually. — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#252>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKMmZREJTwgdoypOnBVKNl_WMrbdSks5rPWb1gaJpZM4Lb1Jr> .

Addressed some nits, removed some of motivation, as it is covered by Architecture, s/DiskUsage/FilesystemUsage s/DiskResources/FilesystemResources

cAdvisor vendoring no longer changed. Clarified that standalone cAdvisor will take over providing summary api.

dashpole · 2017-01-05T22:27:50Z

Kubelet has to avoid overcomiting CPU and for that it needs CPU capacity.
Any first class resource that can be requested explicitly via the API,
kubelet will have to track capacity and availability for it.

Ahh, right. CPU capacity is included in MachineInfo in cAdvisor's api, which I have moved to the CoreStats portion. But the kubelet doesn't consume any CPU usage or availability metrics.

vishh · 2017-01-05T23:59:52Z

Got it! Ideally, we need capacity, availability and usage for nodes, pods and containers. If they are available from a single source it will be user friendly.

…

On Thu, Jan 5, 2017 at 2:00 PM, David Ashpole ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In contributors/design-proposals/core-metrics-pipeline.md <#252>: > + - Container-level usage metrics for Disk, CPU, and Memory + - Horizontal-Pod-Autoscaler (HPA) + - Node-level capacity and availability metrics for CPU and Memory + - Pod-level usage metrics for CPU and Memory + +More details on how I intend to achieve these high level goals can be found in the Implementation Plan. + +In order to continue to provide the full summary API, either the kubelet or a stand-alone version of cAdvisor will need to publish these metrics. + +This Core Metrics API will be versioned to account for version-skew between kubernetes components. + +This proposal purposefully omits many metrics that may eventually become core metrics. This is by design. Once metrics are needed for an internal use case, they can be added to the core metrics API. + +### Proposed Core Metrics API: + +An important difference between the current summary api and the proposed core metrics api is that per-pod stats in the core metrics api contain only usage data, and not capacity-related statistics. This is more accurate since a pod's resource capacity is really defined by its "requests" and "limits", and it is a better reflection of how the kubelet uses the data. The kubelet finds which resources are constrained using node-level capacity and availability data, and then chooses which pods to take action on based on the pod's usage of the constrained resource. If neccessary, capacity for resources a pod consumes can still be correlated with node-level resources using this format of stats. Sorry, maybe I am being too philosophical. All I am trying to convey is that it doesnt make sense for pods to have a "capacity". Rather, they have usage (xxx bytes of disk, for example), and a resource they are using (filesystem device, for example). A "Node" has resources, and a "Pod" has usage. But I can still find how much of a resource is available by correlating the two. — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#252>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKC5jvB-smWaZ5JHdj2WFgOkfnFPPks5rPWgTgaJpZM4Lb1Jr> .

euank · 2017-01-06T01:25:33Z

cc @lucab

dashpole · 2017-01-06T18:49:23Z

@DirectXMan12 I was not aware of the "Resource Metrics API". Does this api encompass all metrics used by components in Kubernetes other than the kubelet? Is this api enpoint exposed by all nodes, or just the master node?

DirectXMan12

a couple a changes/nits.

DirectXMan12 · 2017-01-06T18:46:12Z

contributors/design-proposals/core-metrics-pipeline.md

+## Introduction
+
+### Background
+The [Monitoring Architecture](https://github.com/kubernetes/kubernetes/blob/master/docs/design/monitoring_architecture.md) proposal contains a blueprint for a set of metrics referred to as "Core Metrics".  The purpose of this proposal is to specify what those metrics are, and how they will be collected on the node.


"resource-metrics", "system-resource-metrics", "utilization-metrics"

So, the name of the API served to expose these metrics to Kubernetes components (the HPA controller, the scheduler in the future) is the "resource metrics API" (also known as the "master metrics API", but that's a bit of a misnomer). I, personally, would lean towards the "system resource metrics API", "kubelet metrics API", or something along those lines, to distinguish it from the "resource metrics API".

DirectXMan12 · 2017-01-06T18:48:34Z

contributors/design-proposals/core-metrics-pipeline.md

+This design covers only the internal Core Metrics Pipeline.
+
+High level requirements for the design are as follows:
+ - Do not break existing users.  We should continue to provide the full summary API by default.


I think @vishh is saying that you should qualify this, e.g.:

We should continue to provide the fully summary API as an optional addon. Once the monitoring pipeline is converted to use the new API, the summary API will be moved out into a component which can optionally be served using a standalone copy of cAdvisor

or something to that effect.

DirectXMan12 · 2017-01-06T19:00:29Z

Does [the resource metrics API] api encompass all metrics used by components in Kubernetes other than the kubelet?

That's supposed to be the end goal. It's called the "master metrics API" in the monitoring vision, and the "resource metrics API" elsewhere (e.g. https://github.com/kubernetes/community/blob/master/contributors/design-proposals/resource-metrics-api.md).

Is this api enpoint exposed by all nodes, or just the master node?

It is/will be exposed as a "normal" Kubernetes API server API -- it's not exposed by nodes. Currently, it's exposed several ways by Heapster, in the future, it will be exposed as a discoverable API by the metrics-server component.

dashpole · 2017-01-06T19:23:53Z

@DirectXMan12 Thanks! See the updated requirements section.

DirectXMan12

looking pretty good, but we should address the general naming question.

timstclair · 2017-01-24T02:20:51Z

Rate is not well defined in the current proposal which uses on-demand stats gathering. Why not push that to the service which needs it, polling on the desired window? (thereby enabling multiple averaging windows)? On Mon, Jan 23, 2017 at 5:34 PM, Vish Kannan <notifications@github.com> wrote:

…

Actually, UsageRateNanoCores or instantaneous usage is what we need. Use cases include HPA, kubectl top, dashboard, etc. LoadAvg I agree is not necessary now. On Mon, Jan 23, 2017 at 5:28 PM, Dawn Chen ***@***.***> wrote: > It is much easier to add them later when required, but much harder to > remove the existing ones. In this case, I think both UsageRateNanoCores and > LoadAvg are not required for core metrics API for now. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#252 (comment) >, > or mute the thread > <https://github.com/notifications/unsubscribe- auth/AGvIKHoDn6yFRa5uftBcl-Np1PF8hiVOks5rVVPZgaJpZM4Lb1Jr> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#252 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADHtaB2PJAftDR0xMTbash4CN3eJ9CE8ks5rVVU5gaJpZM4Lb1Jr> .

dchen1107 · 2017-01-24T17:46:11Z

From @vishh:

Actually, UsageRateNanoCores or instantaneous usage is what we need. Use
cases include HPA, kubectl top, dashboard, etc.

We already export cumulative CPU usage and timestamp when it is collects. Shouldn't the usage rate can be derived from those stats by server when needed.

From @timstclair:

Rate is not well defined in the current proposal which uses on-demand stats
gathering. Why not push that to the service which needs it, polling on the
desired window? (thereby enabling multiple averaging windows)?

Agreed.

vishh · 2017-01-24T21:58:10Z

We have discussed and tried that in the past and at the end realized that calculating rate at the node level is much simpler across the stack for resource management purposes. On Mon, Jan 23, 2017 at 6:21 PM, Tim St. Clair <notifications@github.com> wrote:

…

Rate is not well defined in the current proposal which uses on-demand stats gathering. Why not push that to the service which needs it, polling on the desired window? (thereby enabling multiple averaging windows)? On Mon, Jan 23, 2017 at 5:34 PM, Vish Kannan ***@***.***> wrote: > Actually, UsageRateNanoCores or instantaneous usage is what we need. Use > cases include HPA, kubectl top, dashboard, etc. > > LoadAvg I agree is not necessary now. > > On Mon, Jan 23, 2017 at 5:28 PM, Dawn Chen ***@***.***> > wrote: > > > It is much easier to add them later when required, but much harder to > > remove the existing ones. In this case, I think both UsageRateNanoCores > and > > LoadAvg are not required for core metrics API for now. > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub > > <#252# issuecomment-274673647 > >, > > or mute the thread > > <https://github.com/notifications/unsubscribe- > auth/AGvIKHoDn6yFRa5uftBcl-Np1PF8hiVOks5rVVPZgaJpZM4Lb1Jr> > > > . > > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#252 (comment) >, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ ADHtaB2PJAftDR0xMTbash4CN3eJ9CE8ks5rVVU5gaJpZM4Lb1Jr> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#252 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKG64OvA-uTSGAvVafKI8nM1ZcfvDks5rVWANgaJpZM4Lb1Jr> .

dchen1107 · 2017-01-25T18:17:50Z

From @vishh:

We have discussed and tried that in the past and at the end realized that
calculating rate at the node level is much simpler across the stack for
resource management purposes.

This is not true in today's Heapster implementation which serves HPA, kubectl top, dashboard, etc. It appears that there are 2 types of cpu usage metrics:

.node.cpu.usageNanoCores (cAdvisor) -> StandardMetrics.MetricCpuUsage (Heapster) - current cpu usage metric ==> mapping to UsageRateNanoCores proposed here
.node.cpu.usageCoreNanoSecons (cAdvisor) -> RateMetrics.MetricCpuUsageRate (Heapster) - cumulative cpu usage rate ==> mapping to CumulativeUsageNanoSeconds proposed here

In metrics/processors/rate_calculator.go https://github.com/kubernetes/heapster/blob/master/metrics/processors/rate_calculator.go#L60-L63 - there is a special handling for core.MetricCpuUsage which has strong assumption that it operates on cumulative values.

I search heapster's codebase, I think usageNanoCores is ignored by heapster completely.

dchen1107 · 2017-01-25T18:18:14Z

LGTM

addressed comments.

timstclair · 2017-01-25T19:46:27Z

I search heapster's codebase, I think usageNanoCores is ignored by heapster completely.

Yes, as far as I know the usageNanoCores metric isn't used anywhere.

dchen1107 · 2017-01-25T23:20:30Z

@dashpole can you squash all commits, then we are ready to go.

vishh · 2017-01-25T23:25:00Z

That's odd. I don't have time to go over heapster's code base now to see if the current behavior is desired or is just a remnant from the past when we did not have instantaneous metrics. The primary reason for introducing "instantaneous usage" instead of "cumulative" is to make it easy for consumers to interpret cpu usage data. Given that we expose capacity in terms of integral cores, it is difficult for users to consume cumulative nanoseconds as usage. Exposing "instantaneous usage in cpu cores" will be a user friendly API IMHO. As for heapster, if the node exposes instantaneous data, it should switch to that instead.

…

On Wed, Jan 25, 2017 at 11:46 AM, Tim St. Clair ***@***.***> wrote: I search heapster's codebase, I think usageNanoCores is ignored by heapster completely. Yes, as far as I know the usageNanoCores metric isn't used anywhere. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#252 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKBVBd6xCJzDnXS4rJCsK6C6Np4Buks5rV6aYgaJpZM4Lb1Jr> .

piosz · 2017-01-31T10:32:07Z

@dashpole can we please do not merge PRs with 31 completely useless commits?

piosz

Sorry for being late to the party.

The set of metrics being collected looks reasonable and should meet metrics-server requirements.

A few comments inline.

piosz · 2017-01-31T10:44:58Z

contributors/design-proposals/core-metrics-pipeline.md

+  // The time at which these Metrics were updated.  
+  Timestamp metav1.Time  
+  // Cumulative CPU usage (sum of all cores) since object creation.  
+  CumulativeUsageNanoSeconds *uint64   


Cumulative cpu usage needs to be accompanied with time window from which it was collected.

piosz · 2017-01-31T10:46:54Z

contributors/design-proposals/core-metrics-pipeline.md

+## Implementation Plan 
+@dashpole will modify the structure of metrics collection code to be "On-Demand".   
+
+Suggested, tentative future work, which may be covered by future proposals:  


This should be under section: future improvements rather than implementation plan.

piosz · 2017-01-31T10:49:50Z

contributors/design-proposals/core-metrics-pipeline.md

+To keep performance bounded while still offering metrics "On-Demand", all calls to get metrics are cached, and a minimum recency is established to prevent repeated metrics computation.  Before computing new metrics, the previous metrics are checked to see if they meet the recency requirements of the caller.  If the age of the metrics meet the recency requirements, then the cached metrics are returned.  If not, then new metrics are computed and cached.  
+
+## Implementation Plan 
+@dashpole will modify the structure of metrics collection code to be "On-Demand".   


High level comment:
IMO the purpose of such technical proposal is to define what should be done and how rather than who will be working on this. From technical POV it's irrelevant information here.

removed references to people

piosz · 2017-01-31T11:00:00Z

contributors/design-proposals/core-metrics-pipeline.md

+Implementation:  
+To keep performance bounded while still offering metrics "On-Demand", all calls to get metrics are cached, and a minimum recency is established to prevent repeated metrics computation.  Before computing new metrics, the previous metrics are checked to see if they meet the recency requirements of the caller.  If the age of the metrics meet the recency requirements, then the cached metrics are returned.  If not, then new metrics are computed and cached.  
+
+## Implementation Plan 


The implementation plan is rahter poor. You should either expand or remove it.

Removed in favor of a Future Work section.

piosz · 2017-01-31T11:09:18Z

contributors/design-proposals/core-metrics-pipeline.md

+ - Kubernetes can be configured to run a default "third party metrics provider" as a daemonset.  Possibly standalone cAdvisor.
+
+## Rollout Plan
+Once this set of metrics is accepted, @dashpole will begin discussions on the format, and design of the endpoint that exposes them.  The node resource metrics endpoint (TBD) will be added alongside the current Summary API in an upcoming release.  This should allow concurrent developments of other portions of the system metrics pipeline (metrics-server, for example).  Once this addition is made, all other changes will be internal, and will not require any API changes.  


As we discussed offline nobody works on metrics-server in Q1/1.6. Once metrics-server will be the main consumer of the API I think we should wait for it rather than add this in the upcoming release.

s/an upcoming/a future
It was not meant to mean this release, but an upcoming release. This should make that more explicit.

Makes sense.

piosz · 2017-01-31T11:09:19Z

contributors/design-proposals/core-metrics-pipeline.md

+
+## Rollout Plan
+Once this set of metrics is accepted, @dashpole will begin discussions on the format, and design of the endpoint that exposes them.  The node resource metrics endpoint (TBD) will be added alongside the current Summary API in an upcoming release.  This should allow concurrent developments of other portions of the system metrics pipeline (metrics-server, for example).  Once this addition is made, all other changes will be internal, and will not require any API changes.  
+@dashpole will also start discussions on integrating with the CRI, and discussions on how to provide an out-of-the-box solution for the "third party monitoring" pipeline on the node.  One current idea is a standalone verison of cAdvisor, but any third party metrics solution could serve this function as well.


Not sure if I understand it correctly. According to the mentioned Monitoring Architecture vision, there won't be any out-of-the-box solution for 3rd monitoring pipeline but rather clear integration points.

cc @fgrzadkowski

Ill remove that from the proposal then. When you come in march, we can discuss the best way to transition from our current metrics situation to the Monitoring Architecture you outlined.

dashpole · 2017-01-31T15:26:55Z

@piosz I used the git "squash and merge" tool. It should only be one commit in the master branch, but you still see all the commits here.

piosz · 2017-02-01T08:51:40Z

@dashpole thanks for the explanation. Indeed there is only one commit in the master branch. Magic.

This document provides the contents, and uses of core metrics.

keep dates in order; add beta0 to details

This document provides the contents, and uses of core metrics.

* Add new Reviewer role with all the needed info Signed-off-by: Rigs Caballero <grca@google.com> * Change "Content Reviewer" to "Documentation Reviewer" Signed-off-by: Rigs Caballero <grca@google.com> * Fix confusing clause. Signed-off-by: Rigs Caballero <grca@google.com> * Fix typo and remove lists' intros Signed-off-by: Rigs Caballero <grca@google.com>

Moved from kubernetes/docs/proposals

d411d57

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jan 5, 2017

dashpole requested a review from DirectXMan12 January 5, 2017 15:57

dashpole assigned dashpole, dchen1107 and piosz and unassigned dashpole Jan 5, 2017

dashpole mentioned this pull request Jan 5, 2017

Separate core metrics from user metrics pipeline kubernetes/kubernetes#39102

Closed

derekwaynecarr previously requested changes Jan 5, 2017

View reviewed changes

Addressed comments

3bad120

s/querries/queries s/CAdvisor/cAdvisor s/CoreInfo/MachineInfo

vishh reviewed Jan 5, 2017

View reviewed changes

dashpole added 4 commits January 5, 2017 14:06

More comments

1e69459

Addressed some nits, removed some of motivation, as it is covered by Architecture, s/DiskUsage/FilesystemUsage s/DiskResources/FilesystemResources

Changes

d0aaf9a

cAdvisor vendoring no longer changed. Clarified that standalone cAdvisor will take over providing summary api.

s/WorkingsetBytes/UsedBytes

29a4bf8

s/I/@dashpole

25abf52

dashpole added 2 commits January 5, 2017 14:29

node-level cpu metrics in kubelet requirements

11ea97f

Improve memory and cpu documentation

13eee17

DirectXMan12 suggested changes Jan 6, 2017

View reviewed changes

dashpole added 2 commits January 6, 2017 11:09

summary API change, cpu description

919764e

Include Resource Metrics API

04dc0aa

Configurable interval

fdcfed7

DirectXMan12 previously requested changes Jan 6, 2017

View reviewed changes

Define summary API

40a60f2

addressed timstclair changes

33160be

dchen1107 added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 25, 2017

dashpole merged commit c595a0d into master Jan 25, 2017

dashpole deleted the core_metrics_api branch January 25, 2017 23:25

yujuhong mentioned this pull request Jan 26, 2017

Container Stats/Metrics support for Windows Containers kubernetes/kubernetes#34327

Closed

piosz changed the title ~~[Proposal] Core Metrics API~~ [Proposal] Core Metrics in Kubelet Jan 31, 2017

piosz reviewed Jan 31, 2017

View reviewed changes

dashpole restored the core_metrics_api branch January 31, 2017 16:20

dashpole mentioned this pull request Jan 31, 2017

Kubelet "Core metrics" requested changes #313

Closed

dashpole deleted the core_metrics_api branch January 31, 2017 16:25

dashpole mentioned this pull request Jan 31, 2017

Core Metrics changes #316

Merged

Random-Liu mentioned this pull request Feb 22, 2017

CRI: Get ImageFS info for the runtime kubernetes/kubernetes#33048

Closed

ruebenramirez pushed a commit to ruebenramirez/community that referenced this pull request Apr 22, 2017

[Proposal] Core Metrics (kubernetes#252)

a92764e

This document provides the contents, and uses of core metrics.

shyamjvs pushed a commit to shyamjvs/community that referenced this pull request Sep 22, 2017

Merge pull request kubernetes#252 from kubernetes/grodrigues3-patch-1

c97bfca

keep dates in order; add beta0 to details

MadhavJivrajani pushed a commit to MadhavJivrajani/community that referenced this pull request Nov 30, 2021

[Proposal] Core Metrics (kubernetes#252)

6cb1755

This document provides the contents, and uses of core metrics.


		It is very cumbersome to make changes or bugfixes in cAdvisor, because that then needs to be vendored back into kubernetes.

		CAdvisor is structured to collect metrics on an interval, which is appropriate for a stand-alone metrics collector. However, many functions in the kubelet are latency-sensitive (eviction, for example), and would benifit from a more "On-Demand" metrics collection design.


		CAdvisor is an open source container monitoring solution which only monitors containers, and has no concept of k8s constructs like pods or volumes. Kubernetes vendors cAdvisor into its codebase, and uses cAdvisor as a library with functions that enable it to collect metrics on containers. The kubelet can then combine container-level metrics from cAdvisor with the kubelet's knowledge of k8s constructs like pods to produce the kubelet Summary statistics, which provides metrics for use by the kubelet, or by users through the Summary API. cAdvisor works by collecting metrics at an interval (10 seconds), and the kubelet then simply querries these cached metrics whenever it has a need for them.

		Currently, cAdvisor collects a large number of metrics related to system and container performance. However, only some of these metrics are consumed by the kubelet summary API, and many are not used. The kubelet summary API is published to the kubelet summary API endpoint. Some of the metrics provided by the summary API are consumed internally, but most are not used internally.


		## Implementation Plan

		I will move all code pertaining to collection and processing of core metrics from cAdvisor into kubernetes.

[Proposal] Core Metrics in Kubelet #252

[Proposal] Core Metrics in Kubelet #252

Conversation

dashpole commented Jan 5, 2017

derekwaynecarr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vishh commented Jan 5, 2017 via email

dashpole commented Jan 5, 2017

vishh commented Jan 5, 2017 via email

euank commented Jan 6, 2017

dashpole commented Jan 6, 2017 • edited Loading

DirectXMan12 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DirectXMan12 commented Jan 6, 2017

dashpole commented Jan 6, 2017

DirectXMan12 left a comment

Choose a reason for hiding this comment

timstclair commented Jan 24, 2017 via email

dchen1107 commented Jan 24, 2017 • edited Loading

vishh commented Jan 24, 2017 via email

dchen1107 commented Jan 25, 2017

dchen1107 commented Jan 25, 2017

timstclair commented Jan 25, 2017

dchen1107 commented Jan 25, 2017

vishh commented Jan 25, 2017 via email

piosz commented Jan 31, 2017

piosz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dashpole commented Jan 31, 2017

piosz commented Feb 1, 2017

dashpole commented Jan 6, 2017 •

edited

Loading

dchen1107 commented Jan 24, 2017 •

edited

Loading