Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RFE: Expose usage metrics via Che Server API #11242

Closed
3 tasks done
aditya-konarde opened this issue Sep 17, 2018 · 9 comments
Closed
3 tasks done

RFE: Expose usage metrics via Che Server API #11242

aditya-konarde opened this issue Sep 17, 2018 · 9 comments
Labels
kind/task Internal things, technical debt, and to-do tasks to be performed. severity/P1 Has a major impact to usage or development of the system.

Comments

@aditya-konarde
Copy link

aditya-konarde commented Sep 17, 2018

Description

As an engineer working on a maintaining a hosted Eclipse Che Server, I would like to have access to metrics on the the number of provisioned Che accounts on the Server. Currently, this can be obtained with queries to the backing database. The request here is to expose these metrics via a metrics endpoint, so that they can be consumed directly by a monitoring system.

The metrics I am looking at are:

The developers might be able to add in more metrics which are needed for development/troubleshooting. Exposing these will definitely help with a faster feedback loop for the developers, and help us look at trends in the usage for better capacity planning.

CC @ibuziuk

Reproduction Steps

Not applicable

OS and version:
Not applicable

Diagnostics:
Not applicable

@l0rd
Copy link
Contributor

l0rd commented Sep 18, 2018

@aditya-konarde I am afraid we don't have such a concept of a "super user" or "admin" in Che but @skabashnyuk may provide more details.

@ghost ghost added the kind/question Questions that haven't been identified as being feature requests or bugs. label Sep 19, 2018
@skabashnyuk
Copy link
Contributor

@aditya-konarde @l0rd we do have a concept of "super user" or "admin". We call it user with "manageSystem" permission.

Number of provisioned Che accounts

Small remark. If Che is connected to Keycloak or other OIDC provider it will mean a number of users who at least once used Che.

Number of Che Workspaces

ok

Number of Che workspace per user OR the average number of workspaces per user

We might need pagination here.

I'll try to make a prototype of json so we can make sure we are talking about the same thing.

@skabashnyuk skabashnyuk added kind/task Internal things, technical debt, and to-do tasks to be performed. kind/planning A checklist of issues for planning a particular sprint. team/platform and removed kind/question Questions that haven't been identified as being feature requests or bugs. kind/planning A checklist of issues for planning a particular sprint. labels Sep 19, 2018
@mshaposhnik
Copy link
Contributor

So it might be like:
GET /api/metrics?metric_type=total_accounts
GET /api/metrics?metric_type=total_workspaces
GET /api/metrics?metric_type=average_workspaces
with the JSON output in form:

{
 "total_workspaces": "136",
 "running_workspaces: "62",
 "timestamp" : "2018-10-23 13:34:55"
}

@aditya-konarde
Copy link
Author

@fche fyi^

@aditya-konarde
Copy link
Author

Best if we use a prometheus-style format: https://github.com/prometheus/docs/blob/master/content/docs/instrumenting/exposition_formats.md#text-format-example

@skabashnyuk
Copy link
Contributor

@aditya-konarde Interesting idea. I need to synchronise this approach with @l0rd @ibuziuk and @davidfestal . AFAIK they doing(going to do) something similar. See more redhat-developer/rh-che#950.

@ibuziuk
Copy link
Member

ibuziuk commented Nov 9, 2018

Number of provisioned Che accounts
Number of Che Workspaces
Number of Che workspace per user OR the average number of workspaces per user

Those metrics are really interesting and nice to have, but they will not really help from maintaining perspective. IMO, the most important metric that we could currently get is list of workspaces that are running more than X minutes (with info about user / namespace). I believe it should be the first usage metric exposed via Che Server API

@skabashnyuk
Copy link
Contributor

skabashnyuk commented Dec 2, 2018

@ibuziuk @aditya-konarde FYI

@ibuziuk
Copy link
Member

ibuziuk commented Dec 20, 2018

@skabashnyuk based on our SLA discussion I have created the following issues:

Please, consider those issues as a priority from the overall list of metrics.

@skabashnyuk skabashnyuk reopened this Mar 6, 2019
@skabashnyuk skabashnyuk added the severity/P1 Has a major impact to usage or development of the system. label Mar 6, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/task Internal things, technical debt, and to-do tasks to be performed. severity/P1 Has a major impact to usage or development of the system.
Projects
None yet
Development

No branches or pull requests

5 participants