Red Hat offers cluster administrators and developers the Network Observability Operator to observe the network traffic for OKD clusters. The Network Observability Operator uses the eBPF technology to create network flows, which are then enriched with OKD information. The flows are available as Prometheus metrics or as logs in Loki. You can view and analyze this stored information in the OKD console for further insight and troubleshooting.
The Network Observability Operator provides the FlowCollector
API custom resource. A FlowCollector
instance is a cluster-scoped resource that enables configuration of network flow collection. This instance deploys pods and services that form a monitoring pipeline.
The eBPF
agent is deployed as a daemonset
object and creates the network flows. The pipeline collects and enriches network flows with Kubernetes metadata before storing them in Loki or generating Prometheus metrics.
You can optionally integrate the Network Observability Operator with other components to enhance its functionality and scalability. Supported optional dependencies include the Loki Operator for flow storage, and AMQ Streams for large-scale data handling with Kafka.
You can use Loki as the backend to store all collected flows with a maximal level of details. It is recommended to use the Red Hat supported Loki Operator to install Loki. You can also choose to use network observability without Loki, but you need to consider some factors. For more information, see "Network observability without Loki".
Kafka provides scalability, resiliency and high availability in the OKD cluster for large scale deployments. If you choose to use Kafka, it is recommended to use Red Hat supported AMQ Streams Operator.
OKD console integration offers an overview, a topology view, and traffic flow tables. The Network observability metrics dashboards in Observe → Dashboards are available only to users with administrator access.
To enable multi-tenancy for developer access and for administrators with limited access to namespaces, you must specify permissions by defining roles. For more information, see "Enabling multi-tenancy in network observability". |
In the OKD console on the Overview tab, you can view the overall aggregated metrics of the network traffic flow on the cluster. You can choose to display the information by cluster, node, namespace, owner, pod, and service. Filters and display options can further refine the metrics. For more information, see "Observing the network traffic from the Overview view".
In Observe → Dashboards, the Netobserv dashboards provide a quick overview of the network flows in your OKD cluster. The Netobserv/Health dashboard provides metrics about the health of the Operator. For more information, see "Network observability metrics" and "Viewing health information".
The OKD console offers the Topology tab which displays a graphical representation of the network flows and the amount of traffic. The topology view represents traffic between the OKD components as a network graph. You can refine the graph by using the filters and display options. You can access the information for cluster, zone, udn, node, namespace, owner, pod, and service.
You can quickly debug and troubleshoot networking issues with network observability by using the Network Observability command-line interface (CLI), oc netobserv
. The Network Observability CLI is a flow and packet visualization tool that relies on eBPF agents to stream collected data to an ephemeral collector pod. It requires no persistent storage during the capture. After the run, the output is transferred to your local machine. This enables quick, live insight into packets and flow data without installing the Network Observability Operator.