This is a cache of https://docs.openshift.com/rosa/observability/overview/index.html. It is a snapshot of the page at 2024-11-25T03:20:56.496+0000.
About Observability - Observability overview | Observability | Red Hat OpenShift Service on AWS
×

Red Hat OpenShift Observability provides real-time visibility, monitoring, and analysis of various system metrics, logs, and events to help users quickly diagnose and troubleshoot issues before they impact systems or applications. To help ensure the reliability, performance, and security of your applications and infrastructure, Red Hat OpenShift Service on AWS offers the following Observability components:

  • Monitoring

  • Logging

Red Hat OpenShift Observability connects open-source observability tools and technologies to create a unified Observability solution. The components of Red Hat OpenShift Observability work together to help you collect, store, deliver, analyze, and visualize data.

With the exception of monitoring, Red Hat OpenShift Observability components have distinct release cycles separate from the core Red Hat OpenShift Service on AWS release cycles. See the Red Hat OpenShift Operator Life Cycles page for their release compatibility.

Monitoring

Monitor the in-cluster health and performance of your applications running on Red Hat OpenShift Service on AWS with metrics and customized alerts for CPU and memory usage, network connectivity, and other resource usage. Monitoring stack components are deployed and managed by the Cluster Monitoring Operator.

Monitoring stack components are deployed by default in every Red Hat OpenShift Service on AWS installation and are managed by the Cluster Monitoring Operator (CMO). These components include Prometheus, Alertmanager, Thanos Querier, and others. The CMO also deploys the Telemeter Client, which sends a subset of data from platform Prometheus instances to Red Hat to facilitate Remote Health Monitoring for clusters.

For more information, see Monitoring overview and About remote health monitoring.

Logging

Collect, visualize, forward, and store log data to troubleshoot issues, identify performance bottlenecks, and detect security threats. In logging 5.7 and later versions, users can configure the LokiStack deployment to produce customized alerts and recorded metrics.