After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 674184 - Trying to create new email causes Evolution to crash on Debian Unstable
Trying to create new email causes Evolution to crash on Debian Unstable
Status: RESOLVED NOTGNOME
Product: evolution
Classification: Applications
Component: Mailer
3.2.x (obsolete)
Other Linux
: High critical
: ---
Assigned To: evolution-mail-maintainers
Evolution QA team
Depends on:
Blocks:
 
 
Reported: 2012-04-16 09:00 UTC by Russel Winder
Modified: 2012-04-18 16:27 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
DMesg of machine for which fails happen (66.88 KB, text/plain)
2012-04-18 14:36 UTC, Russel Winder
Details

Description Russel Winder 2012-04-16 09:00:53 UTC
Whenever I try to create a new email or to reply to an email on one of my Debian Unstable machiens, Evolution crashes and all open terminals receive:


Message from syslogd@anglides at Apr 16 09:49:05 ...
 kernel:[  436.682966] Oops: 0000 [#1] SMP 

Message from syslogd@anglides at Apr 16 09:49:05 ...
 kernel:[  436.683113] Stack:

Message from syslogd@anglides at Apr 16 09:49:05 ...
 kernel:[  436.683133] Call Trace:

Message from syslogd@anglides at Apr 16 09:49:05 ...
 kernel:[  436.683185] Code: df e8 a7 4c ff ff 48 89 ef 89 44 24 08 e8 d8 fc ff ff 8b 44 24 08 48 83 c4 38 5b 5d 41 5c 41 5d 41 5e 41 5f c3 f0 80 4f 48 04 c3 <48> 8b 7f b8 31 c0 48 85 ff 74 16 8b 57 30 83 e6 03 21 f2 39 f2 

Message from syslogd@anglides at Apr 16 09:49:05 ...
 kernel:[  436.683229] CR2: ffffffffffffffb8


On the other machines things are fine. The difference between the machiens is that the one that crashes has an NVIDIA card. I am using all the standard Debian NVIDIA subsystems.  The machine is fully up to date with Debian Unstable as at 2012-04-16T10:00+01:00.

If people can guide me as to what data to provide to help investigate this issue I can provide it.
Comment 1 André Klapper 2012-04-16 11:26:53 UTC
Thanks for taking the time to report this bug.
Without a stack trace from the crash it's very hard to determine what caused it.
Can you get us a stack trace? Please see http://live.gnome.org/GettingTraces for more information on how to do so. Thanks in advance!
Comment 2 Russel Winder 2012-04-16 12:02:48 UTC
I installed the evolution-dbg package and then ran evolution from within gdb.  After waiting for the sync of all the folder caches so there was no activity I clicked on the "new message" button, got the kernel messages, but Evolution has deadlocked rather than crashed. So I ^C to get the prompt and then asked for the backtrace:

 Message from syslogd@anglides at Apr 16 12:51:44 ...
 kernel:[10017.745037] Oops: 0000 [#2] SMP 

Message from syslogd@anglides at Apr 16 12:51:44 ...
 kernel:[10017.745170] Stack:

Message from syslogd@anglides at Apr 16 12:51:44 ...
 kernel:[10017.745186] Call Trace:

Message from syslogd@anglides at Apr 16 12:51:44 ...
 kernel:[10017.745237] Code: df e8 a7 4c ff ff 48 89 ef 89 44 24 08 e8 d8 fc ff ff 8b 44 24 08 48 83 c4 38 5b 5d 41 5c 41 5d 41 5e 41 5f c3 f0 80 4f 48 04 c3 <48> 8b 7f b8 31 c0 48 85 ff 74 16 8b 57 30 83 e6 03 21 f2 39 f2 

Message from syslogd@anglides at Apr 16 12:51:44 ...
 kernel:[10017.745281] CR2: ffffffffffffffb8
^C
Program received signal SIGINT, Interrupt.

Thread 140736980678400 (LWP 4505)

  • #0 poll
    from /lib/x86_64-linux-gnu/libc.so.6
  • #1 ??
    from /lib/x86_64-linux-gnu/libglib-2.0.so.0
  • #2 g_main_loop_run
    from /lib/x86_64-linux-gnu/libglib-2.0.so.0
  • #3 ??
    from /usr/lib/x86_64-linux-gnu/libgio-2.0.so.0
  • #4 ??
    from /lib/x86_64-linux-gnu/libglib-2.0.so.0
  • #5 start_thread
    from /lib/x86_64-linux-gnu/libpthread.so.0
  • #6 clone
    from /lib/x86_64-linux-gnu/libc.so.6
  • #7 ??


Which doesn't seem entirely helpful since this is all libc threads stuff and not a lot to do with Evolution per se.
Comment 3 André Klapper 2012-04-16 12:22:54 UTC
What does "deadlocked" exactly mean for you?
And after hitting Ctrl+C once, can you enter "thread apply all bt" and post the complete gdb output here, please?
Comment 4 Russel Winder 2012-04-16 18:33:10 UTC
Message from syslogd@anglides at Apr 16 19:29:01 ...
 kernel:[23455.103786] Oops: 0000 [#2] SMP 

Message from syslogd@anglides at Apr 16 19:29:01 ...
 kernel:[23455.103927] Stack:

Message from syslogd@anglides at Apr 16 19:29:01 ...
 kernel:[23455.103943] Call Trace:

Message from syslogd@anglides at Apr 16 19:29:01 ...
 kernel:[23455.103994] Code: df e8 a7 4c ff ff 48 89 ef 89 44 24 08 e8 d8 fc ff ff 8b 44 24 08 48 83 c4 38 5b 5d 41 5c 41 5d 41 5e 41 5f c3 f0 80 4f 48 04 c3 <48> 8b 7f b8 31 c0 48 85 ff 74 16 8b 57 30 83 e6 03 21 f2 39 f2 

Message from syslogd@anglides at Apr 16 19:29:01 ...
 kernel:[23455.104039] CR2: ffffffffffffffb8
^C
Program received signal SIGINT, Interrupt.
[Switching to Thread 0x7fffe1bd7700 (LWP 5690)]
0x00007fffee6b7cc3 in poll () from /lib/x86_64-linux-gnu/libc.so.6
gdb>thread apply all bt
Cannot find new threads: generic error
gdb>
Comment 5 André Klapper 2012-04-16 19:49:06 UTC
This looks like a problem that is WAY deeper in the stack than in Evolution. 
You mention that the only difference is the NVIDIA card.

Before I close this as NOTGNOME and ask you to file a report in Debian's bugtracker:
Do you use kernel 3.3.1/.2 or 3.4.0?
   If yes: Does this also happen with 3.3.0?
Do you use NFS as your home directory?
   If yes: Can you provide dmesg output?

I have https://bugzilla.redhat.com/show_bug.cgi?id=811138 in mind, but probably I'm totally wrong. :)
Comment 6 Matthew Barnes 2012-04-16 20:11:35 UTC
Or might be something like bug 670478, which turned out to be an NVIDIA driver bug.  (libnvidia-tls.so.295.20 in the backtrace is the telltale sign.)  Impossible to tell without a usable backtrace though.
Comment 7 Russel Winder 2012-04-18 14:34:58 UTC
André,

I am using the kernel 3.2.0-2-amd64 series as per Debian Unstable.  Currently 3.2.15-1. I have no idea if it fails with other kernels.
 
I am definitely using NFS for my home directory -- it's the only sane way of having multiple machines guaranteed to be using the same filestore -- though, sadly, it is increasingly the case that GNOME assumes every machine is working with a separate filestore for ~ just like in the Windows 3.1 days :-((

dmesg result being appended shortly.  It looks suspiciously like NFS4 is involved, which is strange as I thought I was using NFS3.

Matthew,

The NVIDIA driver has been a source of great problems recently, causing irregular and terminal X crashes. Each time there is a new kernel or a new NVIDIA driver I give things a whirl but get the X terminations. I was trying a new driver which seemed to be holding up when this Evolution issue hit and so I have been using other machines as not being ale to send email is not an option.
Comment 8 Russel Winder 2012-04-18 14:36:52 UTC
Created attachment 212298 [details]
DMesg of machine for which fails happen

This is the dmesg as is without special preparation. If a more directed dmesg is needed, please let me know.
Comment 9 Matthew Barnes 2012-04-18 14:57:15 UTC
I agree with Andre in comment #5, this is almost certainly an NFS issue.  Gonna go ahead and close this as NOTGNOME until a gdb stack trace of the evolution process is provided.
Comment 10 Russel Winder 2012-04-18 16:27:24 UTC
I have posted a bug report in the Debian bugzilla:  669270: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=669270