UNCLASSIFIED - NO CUI

Snippets Groups Projects

Currently supported Big Bang Version is 2.51

Project 'platform-one/big-bang/apps/core/monitoring' was moved to 'big-bang/product/packages/monitoring'. Please update any links and bookmarks that may still have the old path.

Prometheus scrapes completed pods

Currently Prometheus will attempt to scrape all endpoints regardless of the pod status. This surfaced with Anchore, which runs several jobs with istio sidecars. Prometheus will attempt to scrape metrics from the sidecars even after the jobs have completed. This can be seen in the Prometheus targets list when deploying istio + anchore + sso or an upgrade to trigger jobs.

A brief investigation yielded this PR which seems to indicate Prometheus has knowledge of the pods' status/phase - https://github.com/prometheus/prometheus/pull/4824

What we want to do is somehow exclude resources from the targets when they are in a non-Running phase. There may be a way to do this via config in the helm chart for monitoring, or possibly a setting in each servicemonitor? Don't be afraid to reach out/post an issue upstream with the Prometheus community to find out what options we might have.

There could be a number of outcomes from this.

AC (if feature is available to do this):

Prometheus/ServiceMonitors updated to only scrape Running pods

AC (if feature is not available):

Upstream issue posted as feature request to be able to only scrape Running pods, new BB issue opened to track that one

1 of 2 checklist items completed · Edited 2 years ago

Designs

Child items ...

Activity

Micah Nagel added monitoring teamcore/security labels 3 years ago

added monitoring teamcore/security labels
Micah Nagel changed title from Have Prometheus NOT scrape completed pods to Prometheus scrapes completed pods 3 years ago

changed title from Have Prometheus NOT scrape completed pods to Prometheus scrapes completed pods
Micah Nagel added kindbug label 3 years ago

added kindbug label
Jason Krause added priority6 label 3 years ago

added priority6 label
Jason Krause set weight to 3 3 years ago

set weight to 3
bigbang bot added stale label 3 years ago

added stale label
bigbang bot removed stale label 3 years ago

removed stale label
bigbang bot added stale label 2 years ago

added stale label
Micah Nagel @micah.nagel · 2 years ago

Author Contributor

Was notified of this issue upstream - https://github.com/prometheus-operator/prometheus-operator/issues/4816

It seems to indicate that there is a path forward with relabelings on the podMonitor. I'll be exploring that to see if its viable.
Micah Nagel assigned to @micah.nagel 2 years ago

assigned to @micah.nagel
Micah Nagel mentioned in merge request !148 (merged) 2 years ago

mentioned in merge request !148 (merged)
Micah Nagel removed stale label 2 years ago

removed stale label
Micah Nagel added statusdoing label 2 years ago

added statusdoing label
Micah Nagel changed milestone to %1.37.0 2 years ago

changed milestone to %1.37.0
Micah Nagel changed iteration to Big Bang Iterations Jun 14, 2022 - Jun 27, 2022 2 years ago

changed iteration to Big Bang Iterations Jun 14, 2022 - Jun 27, 2022
Micah Nagel added statusreview label and removed statusdoing label 2 years ago

added statusreview label and removed statusdoing label
Micah Nagel marked the checklist item Prometheus/ServiceMonitors updated to only scrape Running pods as completed 2 years ago

marked the checklist item Prometheus/ServiceMonitors updated to only scrape Running pods as completed
Ryan Garcia closed with merge request big-bang/bigbang!1793 (merged) 2 years ago

closed with merge request big-bang/bigbang!1793 (merged)
bigbang bot removed statusreview label 2 years ago

removed statusreview label

Please register or sign in to reply

Due date

None

Health status

None

Confidentiality

Confidentiality controls have moved to the issue actions menu () at the top of the page.

0 Participants

UNCLASSIFIED - NO CUI