As a cluster administrator, you can effectively manage traffic for pods behind a MetalLB load-balancer service with multiple host interfaces by implementing features from MetalLB, NMState, and OVN-Kubernetes. By combining these features in this context, you can provide symmetric routing, traffic segregation, and support clients on different networks with overlapping CIDR addresses.
To achieve this functionality, learn how to implement virtual routing and forwarding (VRF) instances with MetalLB, and configure egress services.
Configuring symmetric traffic by using a VRF instance with MetalLB and an egress service is a Technology Preview feature only. Technology Preview features are not supported with Red Hat production service level agreements (SLAs) and might not be functionally complete. Red Hat does not recommend using them in production. These features provide early access to upcoming product features, enabling customers to test functionality and provide feedback during the development process. For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope. |
When you use MetalLB with multiple host interfaces, MetalLB exposes and announces a service through all available interfaces on the host. This can present challenges relating to network isolation, asymmetric return traffic and overlapping CIDR addresses.
One option to ensure that return traffic reaches the correct client is to use static routes. However, with this solution, MetalLB cannot isolate the services and then announce each service through a different interface. Additionally, static routing requires manual configuration and requires maintenance if remote sites are added.
A further challenge of symmetric routing when implementing a MetalLB service is scenarios where external systems expect the source and destination IP address for an application to be the same. The default behavior for OpenShift Container Platform is to assign the IP address of the host network interface as the source IP address for traffic originating from pods. This is problematic with multiple host interfaces.
You can overcome these challenges by implementing a configuration that combines features from MetalLB, NMState, and OVN-Kubernetes.
You can overcome the challenges of implementing symmetric routing by using NMState to configure a VRF instance on a host, associating the VRF instance with a MetalLB BGPPeer
resource, and configuring an egress service for egress traffic with OVN-Kubernetes.
The configuration process involves three stages:
Configure a NodeNetworkConfigurationPolicy
custom resource (CR) to associate a VRF instance with a network interface.
Use the VRF routing table to direct ingress and egress traffic.
BGPPeer
Configure a MetalLB BGPPeer
resource to use the VRF instance on a network interface.
By associating the BGPPeer
resource with the VRF instance, the designated network interface becomes the primary interface for the BGP session, and MetalLB advertises the services through this interface.
Configure an egress service to choose the network associated with the VRF instance for egress traffic.
Optional: Configure an egress service to use the IP address of the MetalLB load-balancer service as the source IP for egress traffic.
You can configure symmetric network routing for applications behind a MetalLB service that require the same ingress and egress network paths.
This example associates a VRF routing table with MetalLB and an egress service to enable symmetric routing for ingress and egress traffic for pods behind a LoadBalancer
service.
|
Install the OpenShift CLI (oc
).
Log in as a user with cluster-admin
privileges.
Install the Kubernetes NMState Operator.
Install the MetalLB Operator.
Create a NodeNetworkConfigurationPolicy
CR to define the VRF instance:
Create a file, such as node-network-vrf.yaml
, with content like the following example:
apiVersion: nmstate.io/v1
kind: NodeNetworkConfigurationPolicy
metadata:
name: vrfpolicy (1)
spec:
nodeSelector:
vrf: "true" (2)
maxUnavailable: 3
desiredState:
interfaces:
- name: ens4vrf (3)
type: vrf (4)
state: up
vrf:
port:
- ens4 (5)
route-table-id: 2 (6)
- name: ens4 (7)
type: ethernet
state: up
ipv4:
address:
- ip: 192.168.130.130
prefix-length: 24
dhcp: false
enabled: true
routes: (8)
config:
- destination: 0.0.0.0/0
metric: 150
next-hop-address: 192.168.130.1
next-hop-interface: ens4
table-id: 2
route-rules: (9)
config:
- ip-to: 172.30.0.0/16
priority: 998
route-table: 254 (10)
- ip-to: 10.132.0.0/14
priority: 998
route-table: 254
1 | The name of the policy. |
2 | This example applies the policy to all nodes with the label vrf:true . |
3 | The name of the interface. |
4 | The type of interface. This example creates a VRF instance. |
5 | The node interface that the VRF attaches to. |
6 | The name of the route table ID for the VRF. |
7 | The IPv4 address of the interface associated with the VRF. |
8 | Defines the configuration for network routes. The next-hop-address field defines the IP address of the next hop for the route. The next-hop-interface field defines the outgoing interface for the route. In this example, the VRF routing table is 2 , which references the ID that you define in the egressService CR. |
9 | Defines additional route rules. The ip-to fields must match the Cluster Network CIDR and Service Network CIDR. You can view the values for these CIDR address specifications by running the following command: oc describe network.config/cluster . |
10 | The main routing table that the Linux kernel uses when calculating routes has the ID 254 . |
Apply the policy by running the following command:
$ oc apply -f node-network-vrf.yaml
Create a BGPPeer
custom resource (CR):
Create a file, such as frr-via-vrf.yaml
, with content like the following example:
apiVersion: metallb.io/v1beta2
kind: BGPPeer
metadata:
name: frrviavrf
namespace: metallb-system
spec:
myASN: 100
peerASN: 200
peerAddress: 192.168.130.1
vrf: ens4vrf (1)
1 | Specifies the VRF instance to associate with the BGP peer. MetalLB can advertise services and make routing decisions based on the routing information in the VRF. |
Apply the configuration for the BGP peer by running the following command:
$ oc apply -f frr-via-vrf.yaml
Create an IPAddressPool
CR:
Create a file, such as first-pool.yaml
, with content like the following example:
apiVersion: metallb.io/v1beta1
kind: IPAddressPool
metadata:
name: first-pool
namespace: metallb-system
spec:
addresses:
- 192.169.10.0/32
Apply the configuration for the IP address pool by running the following command:
$ oc apply -f first-pool.yaml
Create a BGPAdvertisement
CR:
Create a file, such as first-adv.yaml
, with content like the following example:
apiVersion: metallb.io/v1beta1
kind: BGPAdvertisement
metadata:
name: first-adv
namespace: metallb-system
spec:
ipAddressPools:
- first-pool
peers:
- frrviavrf (1)
nodeSelectors:
- matchLabels:
egress-service.k8s.ovn.org/test-server1: "" (2)
1 | In this example, MetalLB advertises a range of IP addresses from the first-pool IP address pool to the frrviavrf BGP peer. |
2 | In this example, the egressService CR configures the source IP address for egress traffic to use the load-balancer service IP address. Therefore, you must specify the load-balancer node for return traffic to use the same return path for the traffic originating from the pod. |
Apply the configuration for the BGP advertisement by running the following command:
$ oc apply -f first-adv.yaml
Create an egressService
CR:
Create a file, such as egress-service.yaml
, with content like the following example:
apiVersion: k8s.ovn.org/v1
kind: egressService
metadata:
name: server1 (1)
namespace: test (2)
spec:
sourceIPBy: "LoadBalancerIP" (3)
nodeSelector:
matchLabels:
vrf: "true" (4)
network: "2" (5)
1 | Specify the name for the egress service. The name of the egressService resource must match the name of the load-balancer service that you want to modify. |
2 | Specify the namespace for the egress service. The namespace for the egressService must match the namespace of the load-balancer service that you want to modify. The egress service is namespace-scoped. |
3 | This example assigns the LoadBalancer service ingress IP address as the source IP address for egress traffic. |
4 | If you specify LoadBalancer for the sourceIPBy specification, a single node handles the LoadBalancer service traffic. In this example, only a node with the label vrf: "true" can handle the service traffic. If you do not specify a node, OVN-Kubernetes selects a worker node to handle the service traffic. When a node is selected, OVN-Kubernetes labels the node in the following format: egress-service.k8s.ovn.org/<svc_namespace>-<svc_name>: "" . |
5 | Specify the routing table for egress traffic. |
Apply the configuration for the egress service by running the following command:
$ oc apply -f egress-service.yaml
Verify that you can access the application endpoint of the pods running behind the MetalLB service by running the following command:
$ curl <external_ip_address>:<port_number> (1)
1 | Update the external IP address and port number to suit your application endpoint. |
Optional: If you assigned the LoadBalancer
service ingress IP address as the source IP address for egress traffic, verify this configuration by using tools such as tcpdump
to analyze packets received at the external client.