Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make info logging non-blocking #209

Closed
marseel opened this issue Feb 8, 2021 · 14 comments
Closed

Make info logging non-blocking #209

marseel opened this issue Feb 8, 2021 · 14 comments
Labels
kind/feature Categorizes issue or PR as related to a new feature. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale.

Comments

@marseel
Copy link
Member

marseel commented Feb 8, 2021

/kind feature

Describe the solution you'd like
I would like to make it possible to make logging with info level non-blocking.
Draft idea:

  • Add buffer for logging with limit of X logs that can be stored.
  • If buffer is not full, add log to buffer
  • If buffer is full, increment counter of skipped logs
  • Every Y seconds check if some logs were skipped and log aggregated information like "Z info logs were skipped" with warning level.

This change would make info logs best effort and non-blocking.

Anything else you would like to add:
During scalability tests of kubernetes, we've observed that logging can significantly degrade performance of masters. Under high load, logging was taking up to 10s.

@k8s-ci-robot k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 8, 2021
@marseel
Copy link
Member Author

marseel commented Feb 8, 2021

/cc @serathius

@serathius
Copy link

Having info logs as non-blocking sounds reasonable assuming that we will not change the default behavior, but introduce a flag to maintain backward compatibility
/cc @thockin @DirectXMan12 @dims
WDYT?

@dims
Copy link
Member

dims commented Feb 8, 2021

@serathius i'll let the others chime in first. my question would be, is there a prior art in some other logging library?

@wojtek-t
Copy link
Member

wojtek-t commented Feb 9, 2021

my question would be, is there a prior art in some other logging library?

I don't know open-source logging libraries TBH, but I know that this is something we're doing internally.

@fejta-bot
Copy link

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 10, 2021
@mborsz
Copy link
Member

mborsz commented May 10, 2021

/remove-lifecycle stale

@k8s-ci-robot k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 10, 2021
@erain
Copy link

erain commented Jun 10, 2021

/cc @erain

@k8s-triage-robot
Copy link

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue or PR as fresh with /remove-lifecycle stale
  • Mark this issue or PR as rotten with /lifecycle rotten
  • Close this issue or PR with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 8, 2021
@wojtek-t
Copy link
Member

wojtek-t commented Sep 8, 2021

/remove-lifecycle stale

@k8s-ci-robot k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 8, 2021
@serathius
Copy link

Related change that introduces buffering info logs in Json logging format kubernetes/kubernetes#104873

@serathius
Copy link

@marseel Info log buffering was implemented in v1.23 kubernetes as part of JSON log format. Please check if this fulfills your requirements.

@k8s-triage-robot
Copy link

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue or PR as fresh with /remove-lifecycle stale
  • Mark this issue or PR as rotten with /lifecycle rotten
  • Close this issue or PR with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 22, 2022
@serathius
Copy link

Closing as feature was implemented.
/close

@k8s-ci-robot
Copy link

@serathius: Closing this issue.

In response to this:

Closing as feature was implemented.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale.
Projects
None yet
Development

No branches or pull requests

9 participants