Horizontal pod autoscaler through memory

Design proposals

Query K8S API

This HPA is designed to run in a pod (in "kube-system" namespace) in K8S cluster. The service account will be used to access K8S API.

Pull metrics

Memory metrics are pulled from Prometheus which should be deployed in the cluster and expose its service with K8S Service. Some parameters can be used to specify Prometheus Service:

  -prom-name string
        Name of Prometheus service (default "prometheus")
  -prom-namespace string
        Namespace of Prometheus service (default "kube-system")
  -prom-port int
        Port of Prometheus service (default 9090)
  -prom-scheme string
        Scheme of Prometheus service (default "http")

HPA resources

A 3rd party resource is created to define the memory-based HPA resource. It was defined similarly with K8S HorizontalPodAutoscaler:

type MemHpa struct {
	unversioned.TypeMeta `json:",inline"`
	// There is a bug when using 3rd party resources: https://github.com/kubernetes/client-go/issues/8
	// so ObjectMeta was combined not embedded
	MetaData v1.ObjectMeta `json:"metadata,omitempty"`
	Spec MemHPASpec `json:"spec,omitempty"`
	Status MemHPAScalerStatus `json:"status,omitempty"`
}

type MemHPASpec struct {
	ScaleTargetRef autoscaling.CrossVersionObjectReference `json:"scaleTargetRef"`
	MinReplicas *int32 `json:"minReplicas,omitempty"`
	MaxReplicas int32 `json:"maxReplicas"`
	TargetUtilizationPercentage *int32 `json:"targetUtilizationPercentage,omitempty"`
}

type MemHPAScalerStatus struct {
	ObservedGeneration *int64 `json:"observedGeneration,omitempty"`
	LastScaleTime *unversioned.Time `json:"lastScaleTime,omitempty"`
	CurrentReplicas int32 `json:"currentReplicas"`
	DesiredReplicas int32 `json:"desiredReplicas"`
	CurrentUtilizationPercentage int32 `json:"currentCPUUtilizationPercentage"`
}

type MemHpaList struct {
	unversioned.TypeMeta `json:",inline"`
	unversioned.ListMeta `json:"metadata,omitempty"`
	Items []MemHpa `json:"items"`
}

The client package in the project can be used to query the MemHpa resource

Autoscaling Algorithm

It is similar with K8S Horizontal Pod Autoscaling.

K8S list and watch API are used to watch modifications of MemHpa resources. Rescaling maybe triggered by one of following conditions:

A MemHpa resource was created or modified
or every 30 seconds

.spec.scaleTargetRef is used to fetch Pods and Scale subresource of the referenced pod controller. Pods are used to calculate sum of memory limits by which sum of metrics is divided to get utilization.

How to run

Build

Docker must be installed in your environment. Then just run:

make docker-build

Then the image will be built. If you want to specify another image name:

make IMAGE=your-image-name docker-build

Run the following command to build and push image:

make push

Run in K8S

You can use deployment-in-cluster.yaml to run this memory-based HPA controller in a K8S Deployment and create a MemHpa resource with memhpa-demo.yaml to reference your pod controller

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Godeps		Godeps
apis		apis
app		app
client		client
controller		controller
k8s-compose/demo		k8s-compose/demo
vendor		vendor
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
Makefile		Makefile
README.md		README.md
hpa.go		hpa.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Horizontal pod autoscaler through memory

Design proposals

Query K8S API

Pull metrics

HPA resources

Autoscaling Algorithm

How to run

Build

Run in K8S

About

Releases

Packages

Languages

FlyingShit-XinHuang/memhpa

Folders and files

Latest commit

History

Repository files navigation

Horizontal pod autoscaler through memory

Design proposals

Query K8S API

Pull metrics

HPA resources

Autoscaling Algorithm

How to run

Build

Run in K8S

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages