-
Notifications
You must be signed in to change notification settings - Fork 549
Conversation
2 similar comments
src/cleaner/scripts/clean_docker.py
Outdated
|
||
|
||
# Clean logic v1: kill largest container | ||
white_list = ["k8s_kube", "k8s_pylon", "k8s_zookeeper", "k8s_rest-server", "k8s_yarn", "k8s_hadoop", "k8s_job-exporter", "k8s_watchdog", "k8s_grafana", "k8s_node-exporter", "k8s_webportal", "k8s_prometheus", "k8s_nvidia-drivers", "k8s_etcd-container", "k8s_apiserver-container", "k8s_docker-cleaner", "kubelet"] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@fanyangCS
Seems we should add dev-box
here before we complete dev-box management
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please list the restrictions that the white list should follow. It is best to raise these questions on design phase not in PR review.
src/cleaner/scripts/clean_docker.py
Outdated
|
||
|
||
# Clean logic v1: kill largest container | ||
white_list = ["k8s_kube", "k8s_pylon", "k8s_zookeeper", "k8s_rest-server", "k8s_yarn", "k8s_hadoop", "k8s_job-exporter", "k8s_watchdog", "k8s_grafana", "k8s_node-exporter", "k8s_webportal", "k8s_prometheus", "k8s_nvidia-drivers", "k8s_etcd-container", "k8s_apiserver-container", "k8s_docker-cleaner", "kubelet"] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't remember we have some container starting with k8s_yarn
.
Whatever, I think you could simply regard k8s_
and kubelet
as our service.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, fogget yarn exporter
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As a conclusion in the meeting, we decided to list all our core services here and not just use k8s prefix to filter.
2. Add interval as var 3. Kill docker, send signal
Add markdown document for cleaner configuration
2. Add interval as var 3. Kill docker, send signal
Add markdown document for cleaner configuration
function kill_handler() | ||
{ | ||
printf "%s %s\n" \ | ||
"[INFO]" "Docker container killed due to disk pressure. If your job needs large disk space, please use HDFS or NFS to store your data." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Container killed probably due to disk or memory pressure.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After experiment, I changed the signal to SIGUSR1(10) and the container can handle the signal as expected. So if we trapped signal 10, means the container is killed due to disk pressure.
Clean logic V0.1