Skip to content

RDMA not detected on Intel(R) E810-DA2 #2335

@cmontemuino

Description

@cmontemuino

I have multiple servers in a K8S cluster with the following profiles:

  1. Lenovo SR630 V2 - 2x ThinkSystem Mellanox ConnectX-6 Lx 10/25GbE SFP28 2-port OCP Ethernet Adapter
  2. Lenovo SR670 V2 - 2x Intel(R) E810-DA2 10/25GbE SFP28 2-Port OCP Ethernet Adapter

I get label rdma.capable: true for the former, but nodes from second profile do not get the label.

According to this, the NICs from second profile are RDMA capable.

If I'm not mistaken, RDMA detection is done here:
https://github.com/kubernetes-sigs/node-feature-discovery/blob/master/source/custom/static_features.go

These Intel cards have device ID 159b:

$ lspci -nn | grep -i ethernet
4b:00.0 Ethernet controller [0200]: Intel Corporation Ethernet Controller E810-XXV for SFP [8086:159b] (rev 02)
4b:00.1 Ethernet controller [0200]: Intel Corporation Ethernet Controller E810-XXV for SFP [8086:159b] (rev 02)

System recognizes both NICs as RDMA Channel Adapters (ca):

$ rdma dev show
0: irdma0: node_type ca fw 1.69 node_guid b696:91ff:fef9:6f5c sys_image_guid b696:91ff:fef9:6f5c
1: irdma1: node_type ca fw 1.69 node_guid b696:91ff:fef9:6f5d sys_image_guid b696:91ff:fef9:6f5d

$ rdma link show
link irdma0/1 state ACTIVE physical_state LINK_UP netdev ens27f0
link irdma1/1 state ACTIVE physical_state LINK_UP netdev ens27f1

I wonder if a check against "159b" is missing, or am I missing something?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions