GNOME Bugzilla – Bug 739274
File notifications do not kick the main loop
Last modified: 2014-10-28 21:20:27 UTC
Hi. I was looking into an issue in emacs, where file notifications that use gio would not fire without any user input in emacs. This turned out to be a bug in glib that pops up if you use file notifications with your own event loop, which is what emacs does. Emacs does not call gtk_main(), but instead retrieves a list of active file descriptors with g_main_context_query(), and then uses pselect() to wait on those file descriptors and others. The issue is that the file descriptors returned by g_main_context_query() are not given data by the notification system, so using select() on them does nothing. I'm attaching a sample program that demonstrates this issue. It registers a file monitor on /tmp/dat, and runs a main loop in the way emacs does, as described above. When a notification event fires, this program prints something. The select() call has a timeout of 5 seconds. After select() returns, I process the glib events. So as glib stands now, select() doesn't work right so glib notification events are always delayed until the next time select() times out: 5 seconds in this case. This can be fixed by kicking the main loop. For instance, I can call g_main_context_wakeup(NULL); right after the g_file_monitor_emit_event() call in ih_event_callback() in inotify-helper.c. This isn't the right solution, since it assumes inotify, but something like this should be done. After I make that change, the select() actually works in my test program. There is still a 1 second delay between when the OS tells glib about the modification (ik_read_callback() call) and when the main loop gets the event, but that's a different bug, I suspect.
Created attachment 289495 [details] test case
I am able to reproduce the problem using the given test case. However, g_main_context_query requires that g_main_context_acquire is called first. The problem goes away if the test case is modified so that g_main_context_acquire is called just before g_main_context_query. Can you check if calling g_main_context_acquire solves the problem in Emacs? If it does then this is most likely not a bug in glib.
Aha. Thank you very much. The emacs code in question is this: https://github.com/emacs-mirror/emacs/blob/master/src/xgselect.c Essentially it does context = g_main_context_default (); g_main_context_query(context, ...); If I change it to context = g_main_context_default (); g_main_context_acquire(context); g_main_context_query(context, ...); g_main_context_release(context); then the select() works, as you say. Is this what you suggest? Thanks again!
Yes. You might want to also check the return value from g_main_context_acquire to make sure that the operation succeeded, but that is essentially what I suggest.
OK. Thank you very much. I'll create a separate bug for the 1-second delay I'm observing.