-
Notifications
You must be signed in to change notification settings - Fork 996
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: Instrument Feast using Prometheus and OpenTelemetry (#4366)
feat: instrument feature store This commit adds opentelemetry to monitor Feast Signed-off-by: Twinkll Sisodia <tsisodia@redhat.com>
- Loading branch information
1 parent
8eceff2
commit a571e08
Showing
26 changed files
with
928 additions
and
75 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,108 @@ | ||
## Adding Monitoring | ||
To add monitoring to the Feast Feature Server, follow these steps: | ||
|
||
### Workflow | ||
|
||
Feast instrumentation Using OpenTelemetry and Prometheus - | ||
![Workflow](samples/workflow.png) | ||
|
||
### Deploy Prometheus Operator | ||
Follow the Prometheus Operator documentation to install the operator - | ||
https://github.com/prometheus-operator/prometheus-operator/blob/main/Documentation/user-guides/getting-started.md | ||
|
||
### Deploy OpenTelemetry Operator | ||
Before installing OTEL Operator, install `cert-manager` and validate the `pods` should spin up -- | ||
``` | ||
kubectl apply -f https://github.com/open-telemetry/opentelemetry-operator/releases/latest/download/opentelemetry-operator.yaml | ||
``` | ||
|
||
Follow the documentation for further installation steps - | ||
https://github.com/open-telemetry/opentelemetry-operator | ||
|
||
### Configure OpenTelemetry Collector | ||
Add the OpenTelemetry Collector configuration under the metrics section in your values.yaml file. | ||
|
||
Example values.yaml: | ||
|
||
``` | ||
metrics: | ||
enabled: true | ||
otelCollector: | ||
endpoint: "otel-collector.default.svc.cluster.local:4317" #sample | ||
headers: | ||
api-key: "your-api-key" | ||
``` | ||
|
||
### Add instrumentation annotation and environment variables in the deployment.yaml | ||
|
||
``` | ||
template: | ||
metadata: | ||
{{- with .Values.podAnnotations }} | ||
annotations: | ||
{{- toYaml . | nindent 8 }} | ||
instrumentation.opentelemetry.io/inject-python: "true" | ||
``` | ||
|
||
``` | ||
- name: OTEL_EXPORTER_OTLP_ENDPOINT | ||
value: http://{{ .Values.service.name }}-collector.{{ .Release.namespace }}.svc.cluster.local:{{ .Values.metrics.endpoint.port}} | ||
- name: OTEL_EXPORTER_OTLP_INSECURE | ||
value: "true" | ||
``` | ||
|
||
### Add checks | ||
Add metric checks to all manifests and deployment file - | ||
|
||
``` | ||
{{ if .Values.metrics.enabled }} | ||
apiVersion: opentelemetry.io/v1alpha1 | ||
kind: Instrumentation | ||
metadata: | ||
name: feast-instrumentation | ||
spec: | ||
exporter: | ||
endpoint: http://{{ .Values.service.name }}-collector.{{ .Release.Namespace }}.svc.cluster.local:4318 # This is the default port for the OpenTelemetry Collector | ||
env: | ||
propagators: | ||
- tracecontext | ||
- baggage | ||
python: | ||
env: | ||
- name: OTEL_METRICS_EXPORTER | ||
value: console,otlp_proto_http | ||
- name: OTEL_LOGS_EXPORTER | ||
value: otlp_proto_http | ||
- name: OTEL_PYTHON_LOGGING_AUTO_INSTRUMENTATION_ENABLED | ||
value: "true" | ||
{{end}} | ||
``` | ||
|
||
### Add manifests to the chart | ||
Add Instrumentation, OpenTelemetryCollector, ServiceMonitors, Prometheus Instance and RBAC rules as provided in the [samples/](https://github.com/feast-dev/feast/tree/91540703c483f1cd03b534a1a45bc4ccdcf79f81/infra/charts/feast-feature-server/samples) directory. | ||
|
||
For latest updates please refer the official repository - https://github.com/open-telemetry/opentelemetry-operator | ||
|
||
### Deploy Feast | ||
Deploy Feast and set `metrics` value to `true`. | ||
|
||
Example - | ||
``` | ||
helm install feast-release infra/charts/feast-feature-server --set metric=true --set feature_store_yaml_base64="" | ||
``` | ||
|
||
## See logs | ||
Once the opentelemetry is deployed, you can search the logs to see the required metrics - | ||
|
||
``` | ||
oc logs otelcol-collector-0 | grep "Name: feast_feature_server_memory_usage\|Value: 0.*" | ||
oc logs otelcol-collector-0 | grep "Name: feast_feature_server_cpu_usage\|Value: 0.*" | ||
``` | ||
``` | ||
-> Name: feast_feature_server_memory_usage | ||
Value: 0.579426 | ||
``` | ||
``` | ||
-> Name: feast_feature_server_cpu_usage | ||
Value: 0.000000 | ||
``` |
19 changes: 19 additions & 0 deletions
19
infra/charts/feast-feature-server/samples/instrumentation.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
apiVersion: opentelemetry.io/v1alpha1 | ||
kind: Instrumentation | ||
metadata: | ||
name: feast-instrumentation | ||
spec: | ||
exporter: | ||
endpoint: <endpoint> # eg: http://{{ .Values.service.name }}-collector.{{ .Release.Namespace }}.svc.cluster.local:4318 | ||
env: | ||
propagators: | ||
- tracecontext | ||
- baggage | ||
python: | ||
env: | ||
- name: OTEL_METRICS_EXPORTER | ||
value: console,otlp_proto_http | ||
- name: OTEL_LOGS_EXPORTER | ||
value: otlp_proto_http | ||
- name: OTEL_PYTHON_LOGGING_AUTO_INSTRUMENTATION_ENABLED | ||
value: "true" |
53 changes: 53 additions & 0 deletions
53
infra/charts/feast-feature-server/samples/otel-collector.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,53 @@ | ||
# API reference https://github.com/open-telemetry/opentelemetry-operator/blob/main/docs/api.md | ||
# Refs for v1beta1 config: https://github.com/open-telemetry/opentelemetry-operator/issues/3011#issuecomment-2154118998 | ||
apiVersion: opentelemetry.io/v1beta1 | ||
kind: OpenTelemetryCollector | ||
metadata: | ||
name: otelcol | ||
spec: | ||
mode: statefulset | ||
image: otel/opentelemetry-collector-contrib:0.102.1 | ||
targetAllocator: | ||
enabled: true | ||
serviceAccount: opentelemetry-targetallocator-sa | ||
prometheusCR: | ||
enabled: true | ||
podMonitorSelector: {} | ||
serviceMonitorSelector: {} | ||
## If uncommented, only service monitors with this label will get picked up | ||
# app: feast | ||
config: | ||
receivers: | ||
otlp: | ||
protocols: | ||
grpc: {} | ||
http: {} | ||
prometheus: | ||
config: | ||
scrape_configs: | ||
- job_name: 'otelcol-collector' | ||
scrape_interval: 10s | ||
static_configs: | ||
- targets: [ '0.0.0.0:8888' ] | ||
|
||
processors: | ||
batch: {} | ||
|
||
exporters: | ||
logging: | ||
verbosity: detailed | ||
|
||
service: | ||
pipelines: | ||
traces: | ||
receivers: [otlp] | ||
processors: [batch] | ||
exporters: [logging] | ||
metrics: | ||
receivers: [otlp, prometheus] | ||
processors: [] | ||
exporters: [logging] | ||
logs: | ||
receivers: [otlp] | ||
processors: [batch] | ||
exporters: [logging] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
apiVersion: monitoring.coreos.com/v1 | ||
kind: ServiceMonitor | ||
metadata: | ||
labels: | ||
app: feast | ||
name: otel-sm-1 | ||
spec: | ||
endpoints: | ||
- port: metrics | ||
namespaceSelector: | ||
matchNames: | ||
- <namespace> # helm value - {{ .Release.Namespace }} | ||
selector: | ||
matchLabels: | ||
app.kubernetes.io/component: opentelemetry-collector | ||
app.kubernetes.io/managed-by: opentelemetry-operator |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
kind: Prometheus | ||
metadata: | ||
name: prometheus | ||
spec: | ||
evaluationInterval: 30s | ||
podMonitorSelector: | ||
matchLabels: | ||
app: feast | ||
portName: web | ||
replicas: 1 | ||
scrapeInterval: 30s | ||
serviceAccountName: prometheus-k8s | ||
serviceMonitorSelector: | ||
matchLabels: | ||
app: feast |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,68 @@ | ||
apiVersion: v1 | ||
kind: ServiceAccount | ||
metadata: | ||
name: opentelemetry-targetallocator-sa | ||
--- | ||
apiVersion: rbac.authorization.k8s.io/v1 | ||
kind: ClusterRole | ||
metadata: | ||
name: opentelemetry-targetallocator-role-1 | ||
annotations: | ||
meta.helm.sh/release-name: "feast-release" | ||
meta.helm.sh/release-namespace: "feast-val" | ||
labels: | ||
app.kubernetes.io/managed-by: "Helm" | ||
rules: | ||
- apiGroups: | ||
- monitoring.coreos.com | ||
resources: | ||
- servicemonitors | ||
- podmonitors | ||
verbs: | ||
- '*' | ||
- apiGroups: [""] | ||
resources: | ||
- namespaces | ||
verbs: ["get", "list", "watch"] | ||
- apiGroups: [""] | ||
resources: | ||
- nodes | ||
- nodes/metrics | ||
- services | ||
- endpoints | ||
- pods | ||
verbs: ["get", "list", "watch"] | ||
- apiGroups: [""] | ||
resources: | ||
- configmaps | ||
verbs: ["get"] | ||
- apiGroups: | ||
- discovery.k8s.io | ||
resources: | ||
- endpointslices | ||
verbs: ["get", "list", "watch"] | ||
- apiGroups: | ||
- networking.k8s.io | ||
resources: | ||
- ingresses | ||
verbs: ["get", "list", "watch"] | ||
- nonResourceURLs: ["/metrics"] | ||
verbs: ["get"] | ||
--- | ||
apiVersion: rbac.authorization.k8s.io/v1 | ||
kind: ClusterRoleBinding | ||
metadata: | ||
name: opentelemetry-targetallocator-rb-1 | ||
annotations: | ||
meta.helm.sh/release-name: "feast-release" | ||
meta.helm.sh/release-namespace: "feast-val" | ||
labels: | ||
app.kubernetes.io/managed-by: "Helm" | ||
subjects: | ||
- kind: ServiceAccount | ||
name: opentelemetry-targetallocator-sa | ||
namespace: <namespace> # helm value - {{ .Release.Namespace }} | ||
roleRef: | ||
kind: ClusterRole | ||
name: opentelemetry-targetallocator-role-1 | ||
apiGroup: rbac.authorization.k8s.io |
16 changes: 16 additions & 0 deletions
16
infra/charts/feast-feature-server/samples/service-monitor.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
apiVersion: monitoring.coreos.com/v1 | ||
kind: ServiceMonitor | ||
metadata: | ||
labels: | ||
app: feast | ||
name: otel-sm | ||
spec: | ||
endpoints: | ||
- port: metrics | ||
namespaceSelector: | ||
matchNames: | ||
- <namespace> # helm value - {{ .Release.Namespace }} | ||
selector: | ||
matchLabels: | ||
app.kubernetes.io/component: opentelemetry-collector | ||
app.kubernetes.io/managed-by: opentelemetry-operator |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.