Bug 704770 – NMPlatform caches out-of-sync on resume and enable/disable

After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.

Bug 704770 - NMPlatform caches out-of-sync on resume and enable/disable


Summary:	NMPlatform caches out-of-sync on resume and enable/disable


Status:	RESOLVED FIXED

Product:	NetworkManager
Classification:	Platform
Component:	IP and DNS config
Version:	git master
Hardware:	Other Linux

Importance:	Normal normal
Target Milestone:	---
Assigned To:	NetworkManager maintainer(s)
QA Contact:	NetworkManager maintainer(s)

URL:
Whiteboard:

Depends on:
Blocks:

Reported:	2013-07-23 21:58 UTC by Dan Williams
Modified:	2013-07-24 14:51 UTC

See Also:
GNOME target:	---
GNOME version:	---

Attachments
Fix cache sync issues in platform (1.52 KB, patch) 2013-07-23 22:40 UTC, Dan Williams	none	Details \| Review
handle address/route deletion upon link actions (4.02 KB, patch) 2013-07-24 08:46 UTC, Pavel Simerda	none	Details \| Review

Description Dan Williams 2013-07-23 21:58:39 UTC

First question, is there a reason that event_notification() doesn't handle RTM_DELROUTE/RTM_NEWROUTE the same way it does for links and addresses?  Wouldn't that be necessary to discover routing changes made underneath NM?  If doing that, wouldn't we have to call nl_socket_add_membership (priv->nlh, RTNLGRP_IPV4_ROUTE); to get those events?  I tried just adding RTM_DELROUTE/RTM_NEWROUTE to the switch and didn't get any new events when deleting routes.

Anyway, the core problem here is on suspend and enable/disable (like "nmcli radio wifi off"), NetworkManager sets devices down, and if the default route is through one of those devices the kernel automatically deletes the default route, but the platform doesn't know about it because it doesn't listen for signals from the kernel, or the kernel doesn't send the RTM_DELROUTE, or both.

Then, on resume (or "nmcli radio wifi on"), after the device connects, when NetworkManager tries to add the new default route through the same device, the platform rejects that with EEXIST because it thinks the default route is still present in the route cache.  Thus the default route never gets re-added.

A hack to work around this would be calling nl_cache_refill (priv->nlh, priv->route_cache); from nm-manager.c's do_sleep_wake() function to refresh the route cache on resume.

Alternatively, the platform could listen to RTM_DELROUTE events.

Thoughts?

Comment 1 Dan Williams 2013-07-23 22:29:40 UTC

Ok, so do we have a kernel issue here too?  Obviously we need to work around this in NetworkManager, but it appears the kernel does not send RTM_DELROUTE for a routes that are removed as a result of the interface being set down.

Running libnl's "nl-monitor ipv4-route" tool and taking wlan0 down with /sbin/ip, no RTM_DELROUTE is ever received:

$ ip -4 route
default via 192.168.1.1 dev wlan0 
192.168.1.0/24 dev wlan0  proto kernel  scope link  src 192.168.1.175 
$ sudo ip link set dev wlan0 down
<<< expect RTM_DELROUTEs here, but none come >>>
$ ip -4 route
$ sudo ip link set dev wlan0 up
$ ip -4 route
192.168.1.0/24 dev wlan0  proto kernel  scope link  src 192.168.1.175 
$ 

So I guess we just need to ensure that when a device gets taken down in the platform, that we refill the route cache...  That seems to be what the kernel expects us to do.  Still, would be nice to get notifications that the routes have been removed on !IFF_UP, since the automagic ones for the interface's subnet (eg 192.168.1.0/24) *do* get announced via RTM_NEWROUTE when you set the interface back up.

Comment 2 Dan Williams 2013-07-23 22:33:15 UTC

This makes things work:

@@ -1277,7 +1304,12 @@ link_set_up (NMPlatform *platform, int ifindex)
 static gboolean
 link_set_down (NMPlatform *platform, int ifindex)
 {
-       return link_change_flags (platform, ifindex, IFF_UP, FALSE);
+       NMLinuxPlatformPrivate *priv = NM_LINUX_PLATFORM_GET_PRIVATE (platform);
+       gboolean success;
+
+       success = link_change_flags (platform, ifindex, IFF_UP, FALSE);
+       nl_cache_refill (priv->nlh, priv->route_cache);
+       return success;
 }
 
 static gboolean

though we should probably also listen for kernel routing events and respond to them correctly too.

Comment 3 Dan Williams 2013-07-23 22:40:02 UTC

Created attachment 249966 [details] [review]
Fix cache sync issues in platform

Comment 4 Pavel Simerda 2013-07-24 08:01:16 UTC

(In reply to comment #2)
> This makes things work:
> 
> @@ -1277,7 +1304,12 @@ link_set_up (NMPlatform *platform, int ifindex)
>  static gboolean
>  link_set_down (NMPlatform *platform, int ifindex)
>  {
> -       return link_change_flags (platform, ifindex, IFF_UP, FALSE);
> +       NMLinuxPlatformPrivate *priv = NM_LINUX_PLATFORM_GET_PRIVATE
> (platform);
> +       gboolean success;
> +
> +       success = link_change_flags (platform, ifindex, IFF_UP, FALSE);
> +       nl_cache_refill (priv->nlh, priv->route_cache);
> +       return success;
>  }
> 
>  static gboolean
> 
> though we should probably also listen for kernel routing events and respond to
> them correctly too.

This indeed works only for interfaces taken down by NetworkManager.

(In reply to comment #3)
> Created an attachment (id=249966) [details] [review]
> Fix cache sync issues in platform

Ah, it looks like we're missing some important bits. I'll push a patch with just the left-overs, so that we can concentrate on the issue.

Comment 5 Pavel Simerda 2013-07-24 08:46:55 UTC

Created attachment 250005 [details] [review]
handle address/route deletion upon link actions

This patch handles the issue as late as possible, in order to cover *all* cases when link down/deletion is noticed by nm-platform, including both internal and external changes.

Comment 6 Dan Winship 2013-07-24 11:44:29 UTC

(In reply to comment #1)
> Ok, so do we have a kernel issue here too?

In the past, the kernel was known to not announce certain routing changes (in particular, any change to the IPv6 default route), and this was considered not a bug, because netlink sockets don't guarantee reliable delivery, so you have to manually resync your caches now and then to ensure they're accurate anyway.

I don't know if this is still true. (Well, the theoretical part still is. I don't know there are still specific things it intentionally doesn't announce.)

Comment 7 Pavel Simerda 2013-07-24 12:08:43 UTC

(In reply to comment #6)
> (In reply to comment #1)
> > Ok, so do we have a kernel issue here too?
> 
> In the past, the kernel was known to not announce certain routing changes (in
> particular, any change to the IPv6 default route), and this was considered not
> a bug, because netlink sockets don't guarantee reliable delivery, so you have
> to manually resync your caches now and then to ensure they're accurate anyway.

Any reliable source for that, apart from just a feeling?

> I don't know if this is still true. (Well, the theoretical part still is. I
> don't know there are still specific things it intentionally doesn't announce.)

Well, my tests didn't show such behavior but they were not very thorough. But we abandoned the kernel IPv6 autoconf which was the most problematic in various respects.

Therefore, our current situation is:

1) Most of the time we don't rely on kernel netlink notifications, as NetworkManager is the originator of the link/address/route records. This applies to static configuration, DHCP and router discovery, regardless of the IP protocol version.

AFAIK this also applies to this particular case as it's NetworkManager, which sets down interfaces on suspend, unless I'm mistaken.

2) *When* we depend on the kernel at all, it's only for external tools making changes to the kernel tables. That is a brand new feature and therefore it should be OK to request kernel fixes for it to work properly. Also, it is mostly a convenience feature, rarely a critical one.

That said, I don't see a need to workaround kernel problems any more and it sounds like a valid behavior to only accept external changes actually announced by the kernel. But that's not really about this particular bug.

Comment 8 Dan Winship 2013-07-24 13:23:30 UTC

(In reply to comment #7)
> Any reliable source for that, apart from just a feeling?

I discovered this when writing GNetworkMonitor. OK, actually, it's the IPv4 default route, not IPv6, and only when it's removed. (bug 620932 comment 45). I seem to recall someone pointing me to a lkml post where someone said that this was known and wasn't going to be fixed, but I can't find that now.

Comment 9 Pavel Simerda 2013-07-24 13:57:16 UTC

Thanks for more information. I still believe we should ignore the external default IPv4 route issue for now until it becomes a real problem and then at least try to get it fixed in kernel first. If we ever need to work around it in NetworkManager, I would like to have the workaround properly commented including the source of information, etc, as those are really ugly hacks around kernel's bugs and/or missing features.

Comment 10 Dan Winship 2013-07-24 14:19:57 UTC

Well, the theoretical point still applies: netlink sockets have a fixed-size buffer, so if too many notifications get sent out all at once (or, say, if you're running NM in gdb, and you leave it stopped at a breakpoint too long), then some notifications might get dropped.

I don't know how likely this is in practice.

Comment 11 Pavel Simerda 2013-07-24 14:40:32 UTC

(In reply to comment #10)
> Well, the theoretical point still applies: netlink sockets have a fixed-size
> buffer, so if too many notifications get sent out all at once (or, say, if
> you're running NM in gdb, and you leave it stopped at a breakpoint too long),
> then some notifications might get dropped.

But then you must be talking about NetworkManager, or more precisely libnl3, side of communication, if you're mentioning gdb. That buffer size size is specified by NetworkManager.

See for example:

commit 38a9ac5cc22a39506453a238406bd7f9d9205943
Author: Pavel Šimerda <psimerda@redhat.com>
Date:   Mon May 27 17:45:55 2013 +0200

    netlink: enlarge netlink buffer to 128k

+       nle = nl_socket_set_buffer_size (priv->nlh_event, 131072, 0);

As far as I remember, more weird things happen, and techniques like explicit refill won't help much.

> I don't know how likely this is in practice.

Actually it first happened to me with nm-platform tests and then later someone noted that it happened in a running NetworkManager. We will have to keep the size of the buffer large enough even if it requires guesswork using number of interfaces, IP addresses or whatever. We will learn more when NM is used in some complicated environments.

Comment 12 Pavel Simerda 2013-07-24 14:41:37 UTC

I forgot the important point that any netlink socket error conditions are noted in the logs.

Comment 13 Dan Williams 2013-07-24 14:51:55 UTC

Pushed the patch.