Skip to content

Commit fcdb44d

Browse files
jprestwokuba-moo
authored andcommitted
net: arp: introduce arp_evict_nocarrier sysctl parameter
This change introduces a new sysctl parameter, arp_evict_nocarrier. When set (default) the ARP cache will be cleared on a NOCARRIER event. This new option has been defaulted to '1' which maintains existing behavior. Clearing the ARP cache on NOCARRIER is relatively new, introduced by: commit 859bd2e Author: David Ahern <[email protected]> Date: Thu Oct 11 20:33:49 2018 -0700 net: Evict neighbor entries on carrier down The reason for this changes is to prevent the ARP cache from being cleared when a wireless device roams. Specifically for wireless roams the ARP cache should not be cleared because the underlying network has not changed. Clearing the ARP cache in this case can introduce significant delays sending out packets after a roam. A user reported such a situation here: https://lore.kernel.org/linux-wireless/CACsRnHWa47zpx3D1oDq9JYnZWniS8yBwW1h0WAVZ6vrbwL_S0w@mail.gmail.com/ After some investigation it was found that the kernel was holding onto packets until ARP finished which resulted in this 1 second delay. It was also found that the first ARP who-has was never responded to, which is actually what caues the delay. This change is more or less working around this behavior, but again, there is no reason to clear the cache on a roam anyways. As for the unanswered who-has, we know the packet made it OTA since it was seen while monitoring. Why it never received a response is unknown. In any case, since this is a problem on the AP side of things all that can be done is to work around it until it is solved. Some background on testing/reproducing the packet delay: Hardware: - 2 access points configured for Fast BSS Transition (Though I don't see why regular reassociation wouldn't have the same behavior) - Wireless station running IWD as supplicant - A device on network able to respond to pings (I used one of the APs) Procedure: - Connect to first AP - Ping once to establish an ARP entry - Start a tcpdump - Roam to second AP - Wait for operstate UP event, and note the timestamp - Start pinging Results: Below is the tcpdump after UP. It was recorded the interface went UP at 10:42:01.432875. 10:42:01.461871 ARP, Request who-has 192.168.254.1 tell 192.168.254.71, length 28 10:42:02.497976 ARP, Request who-has 192.168.254.1 tell 192.168.254.71, length 28 10:42:02.507162 ARP, Reply 192.168.254.1 is-at ac:86:74:55:b0:20, length 46 10:42:02.507185 IP 192.168.254.71 > 192.168.254.1: ICMP echo request, id 52792, seq 1, length 64 10:42:02.507205 IP 192.168.254.71 > 192.168.254.1: ICMP echo request, id 52792, seq 2, length 64 10:42:02.507212 IP 192.168.254.71 > 192.168.254.1: ICMP echo request, id 52792, seq 3, length 64 10:42:02.507219 IP 192.168.254.71 > 192.168.254.1: ICMP echo request, id 52792, seq 4, length 64 10:42:02.507225 IP 192.168.254.71 > 192.168.254.1: ICMP echo request, id 52792, seq 5, length 64 10:42:02.507232 IP 192.168.254.71 > 192.168.254.1: ICMP echo request, id 52792, seq 6, length 64 10:42:02.515373 IP 192.168.254.1 > 192.168.254.71: ICMP echo reply, id 52792, seq 1, length 64 10:42:02.521399 IP 192.168.254.1 > 192.168.254.71: ICMP echo reply, id 52792, seq 2, length 64 10:42:02.521612 IP 192.168.254.1 > 192.168.254.71: ICMP echo reply, id 52792, seq 3, length 64 10:42:02.521941 IP 192.168.254.1 > 192.168.254.71: ICMP echo reply, id 52792, seq 4, length 64 10:42:02.522419 IP 192.168.254.1 > 192.168.254.71: ICMP echo reply, id 52792, seq 5, length 64 10:42:02.523085 IP 192.168.254.1 > 192.168.254.71: ICMP echo reply, id 52792, seq 6, length 64 You can see the first ARP who-has went out very quickly after UP, but was never responded to. Nearly a second later the kernel retries and gets a response. Only then do the ping packets go out. If an ARP entry is manually added prior to UP (after the cache is cleared) it is seen that the first ping is never responded to, so its not only an issue with ARP but with data packets in general. As mentioned prior, the wireless interface was also monitored to verify the ping/ARP packet made it OTA which was observed to be true. Signed-off-by: James Prestwood <[email protected]> Reviewed-by: David Ahern <[email protected]> Signed-off-by: Jakub Kicinski <[email protected]>
1 parent 1d6d336 commit fcdb44d

File tree

6 files changed

+27
-1
lines changed

6 files changed

+27
-1
lines changed

Documentation/networking/ip-sysctl.rst

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1611,6 +1611,15 @@ arp_accept - BOOLEAN
16111611
gratuitous arp frame, the arp table will be updated regardless
16121612
if this setting is on or off.
16131613

1614+
arp_evict_nocarrier - BOOLEAN
1615+
Clears the ARP cache on NOCARRIER events. This option is important for
1616+
wireless devices where the ARP cache should not be cleared when roaming
1617+
between access points on the same network. In most cases this should
1618+
remain as the default (1).
1619+
1620+
- 1 - (default): Clear the ARP cache on NOCARRIER events
1621+
- 0 - Do not clear ARP cache on NOCARRIER events
1622+
16141623
mcast_solicit - INTEGER
16151624
The maximum number of multicast probes in INCOMPLETE state,
16161625
when the associated hardware address is unknown. Defaults

include/linux/inetdevice.h

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -133,6 +133,8 @@ static inline void ipv4_devconf_setall(struct in_device *in_dev)
133133
#define IN_DEV_ARP_ANNOUNCE(in_dev) IN_DEV_MAXCONF((in_dev), ARP_ANNOUNCE)
134134
#define IN_DEV_ARP_IGNORE(in_dev) IN_DEV_MAXCONF((in_dev), ARP_IGNORE)
135135
#define IN_DEV_ARP_NOTIFY(in_dev) IN_DEV_MAXCONF((in_dev), ARP_NOTIFY)
136+
#define IN_DEV_ARP_EVICT_NOCARRIER(in_dev) IN_DEV_ANDCONF((in_dev), \
137+
ARP_EVICT_NOCARRIER)
136138

137139
struct in_ifaddr {
138140
struct hlist_node hash;

include/uapi/linux/ip.h

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -169,6 +169,7 @@ enum
169169
IPV4_DEVCONF_DROP_UNICAST_IN_L2_MULTICAST,
170170
IPV4_DEVCONF_DROP_GRATUITOUS_ARP,
171171
IPV4_DEVCONF_BC_FORWARDING,
172+
IPV4_DEVCONF_ARP_EVICT_NOCARRIER,
172173
__IPV4_DEVCONF_MAX
173174
};
174175

include/uapi/linux/sysctl.h

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -482,6 +482,7 @@ enum
482482
NET_IPV4_CONF_PROMOTE_SECONDARIES=20,
483483
NET_IPV4_CONF_ARP_ACCEPT=21,
484484
NET_IPV4_CONF_ARP_NOTIFY=22,
485+
NET_IPV4_CONF_ARP_EVICT_NOCARRIER=23,
485486
};
486487

487488
/* /proc/sys/net/ipv4/netfilter */

net/ipv4/arp.c

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1247,6 +1247,8 @@ static int arp_netdev_event(struct notifier_block *this, unsigned long event,
12471247
{
12481248
struct net_device *dev = netdev_notifier_info_to_dev(ptr);
12491249
struct netdev_notifier_change_info *change_info;
1250+
struct in_device *in_dev;
1251+
bool evict_nocarrier;
12501252

12511253
switch (event) {
12521254
case NETDEV_CHANGEADDR:
@@ -1257,7 +1259,14 @@ static int arp_netdev_event(struct notifier_block *this, unsigned long event,
12571259
change_info = ptr;
12581260
if (change_info->flags_changed & IFF_NOARP)
12591261
neigh_changeaddr(&arp_tbl, dev);
1260-
if (!netif_carrier_ok(dev))
1262+
1263+
in_dev = __in_dev_get_rtnl(dev);
1264+
if (!in_dev)
1265+
evict_nocarrier = true;
1266+
else
1267+
evict_nocarrier = IN_DEV_ARP_EVICT_NOCARRIER(in_dev);
1268+
1269+
if (evict_nocarrier && !netif_carrier_ok(dev))
12611270
neigh_carrier_down(&arp_tbl, dev);
12621271
break;
12631272
default:

net/ipv4/devinet.c

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,7 @@ static struct ipv4_devconf ipv4_devconf = {
7575
[IPV4_DEVCONF_SHARED_MEDIA - 1] = 1,
7676
[IPV4_DEVCONF_IGMPV2_UNSOLICITED_REPORT_INTERVAL - 1] = 10000 /*ms*/,
7777
[IPV4_DEVCONF_IGMPV3_UNSOLICITED_REPORT_INTERVAL - 1] = 1000 /*ms*/,
78+
[IPV4_DEVCONF_ARP_EVICT_NOCARRIER - 1] = 1,
7879
},
7980
};
8081

@@ -87,6 +88,7 @@ static struct ipv4_devconf ipv4_devconf_dflt = {
8788
[IPV4_DEVCONF_ACCEPT_SOURCE_ROUTE - 1] = 1,
8889
[IPV4_DEVCONF_IGMPV2_UNSOLICITED_REPORT_INTERVAL - 1] = 10000 /*ms*/,
8990
[IPV4_DEVCONF_IGMPV3_UNSOLICITED_REPORT_INTERVAL - 1] = 1000 /*ms*/,
91+
[IPV4_DEVCONF_ARP_EVICT_NOCARRIER - 1] = 1,
9092
},
9193
};
9294

@@ -2532,6 +2534,8 @@ static struct devinet_sysctl_table {
25322534
DEVINET_SYSCTL_RW_ENTRY(ARP_IGNORE, "arp_ignore"),
25332535
DEVINET_SYSCTL_RW_ENTRY(ARP_ACCEPT, "arp_accept"),
25342536
DEVINET_SYSCTL_RW_ENTRY(ARP_NOTIFY, "arp_notify"),
2537+
DEVINET_SYSCTL_RW_ENTRY(ARP_EVICT_NOCARRIER,
2538+
"arp_evict_nocarrier"),
25352539
DEVINET_SYSCTL_RW_ENTRY(PROXY_ARP_PVLAN, "proxy_arp_pvlan"),
25362540
DEVINET_SYSCTL_RW_ENTRY(FORCE_IGMP_VERSION,
25372541
"force_igmp_version"),

0 commit comments

Comments
 (0)