Bug 639621 – Evolution freezes after restore

After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.

Bug 639621 - Evolution freezes after restore


Summary:	Evolution freezes after restore


Status:	RESOLVED OBSOLETE

Product:	evolution
Classification:	Applications
Component:	Mailer
Version:	2.32.x (obsolete)
Hardware:	Other Linux

Importance:	Normal normal
Target Milestone:	---
Assigned To:	evolution-mail-maintainers
QA Contact:	Evolution QA team

URL:
Whiteboard:

Depends on:
Blocks:

Reported:	2011-01-15 21:27 UTC by chernoff
Modified:	2013-12-20 22:52 UTC

See Also:
GNOME target:	---
GNOME version:	---

Attachments
gdb output (8.06 KB, text/plain) 2011-01-15 22:08 UTC, chernoff	Details
compressed output of CAMEL_DEBUG=all evolution >& evo.log (477.21 KB, application/x-gzip) 2011-01-15 22:11 UTC, chernoff	Details
valgrind output (2.07 KB, text/plain) 2011-01-16 23:23 UTC, chernoff	Details
gdb with symbols (8.13 KB, text/plain) 2011-01-16 23:25 UTC, chernoff	Details
2 records from folders.db (658 bytes, text/plain) 2011-01-24 17:40 UTC, chernoff	Details

Description chernoff 2011-01-15 21:27:57 UTC

A full backup was made under evolution 2.24.5 and an attempt to restore under 2.32.1. The backup was read successfully by the new version, the data directories created but some problem occurs in reading back the information. Attached is the result of running CAMEL_DEBUG=all evolution >& evo.log
The evolution window is completely unresponsive and had to be killed. It appears (this is just a guess from examining the end of evo.log) that the sequence of messages being read/inserted into the database has a jump in number just before the freeze.

Comment 1 chernoff 2011-01-15 22:08:32 UTC

Created attachment 178407 [details]
gdb output

Comment 2 chernoff 2011-01-15 22:11:35 UTC

Created attachment 178409 [details]
compressed output of CAMEL_DEBUG=all evolution >& evo.log

Comment 3 Akhil Laddha 2011-01-16 05:07:45 UTC

possible dupe of bug 617038

Comment 4 chernoff 2011-01-16 17:58:43 UTC

(In reply to comment #3)
> possible dupe of bug 617038

This appears to be different from 617038 but I am no expert so perhaps this will help:

The restore operation completes normally (I specify that evolution NOT restart automatically after the restore in the dialog). 

After the restore I do a telinit 3 then telinit 5 to make sure that we're starting everything fresh.

I run evolution via gdb and/or with the CAMEL_DEBUG=all. The outputs are above.

Evolution hangs and there IS an evolution-alarm-notify process present. It looks like there is some sort of interaction with the sqlite that is blocked.

Comment 5 chernoff 2011-01-16 23:21:25 UTC

Attachments for valgind and gdb traceback with debugging symbols follow. These are from running evolution after the restore operation.

Valgrind finds an invalid read and a weird calloc call in icaltzutil_fetch_timeszone.

The gdb traceback shows the explicit operations on the sqlite db at the time things freeze.

Comment 6 chernoff 2011-01-16 23:23:23 UTC

Created attachment 178465 [details]
valgrind output

Comment 7 chernoff 2011-01-16 23:25:01 UTC

Created attachment 178466 [details]
gdb with symbols

Comment 8 chernoff 2011-01-17 15:51:05 UTC

(In reply to comment #0)
> A full backup was made under evolution 2.24.5 and an attempt to restore under
> 2.32.1. The backup was read successfully by the new version, the data
> directories created but some problem occurs in reading back the information.
> Attached is the result of running CAMEL_DEBUG=all evolution >& evo.log
> The evolution window is completely unresponsive and had to be killed. It
> appears (this is just a guess from examining the end of evo.log) that the
> sequence of messages being read/inserted into the database has a jump in number
> just before the freeze.

There is no jump in sequence number at the end. My mistake in scanning the output.

Comment 9 Milan Crha 2011-01-18 07:48:38 UTC

Is this a move from 32 bit system to 64 bit system or vice versa? I mean, the backup was made on a 32 bit system and you are about to run evolution after restore on a 64 bit system or vice versa?

Comment 10 chernoff 2011-01-18 14:57:47 UTC

(In reply to comment #9)
> Is this a move from 32 bit system to 64 bit system or vice versa? I mean, the
> backup was made on a 32 bit system and you are about to run evolution after
> restore on a 64 bit system or vice versa?

No. This was 64 bit before and after the migration from Fedora 10 to 14.

Howerver: The mail files have been rolled over from one machine to another
previously.

Briefly, I know they were on a 32 bit machine running evolution 2.22.3.1
and migrated about 2 years ago to a 64 bit machine. I didn't have any
particular problem doing so. If it would be helpful I could try to
produce a backup from the files on the original 32 bit machine and
feed them into the current 64 bit evolution to see if the problem is present.

Comment 11 Milan Crha 2011-01-19 08:12:54 UTC

(In reply to comment #10)
> If it would be helpful I could try to
> produce a backup from the files on the original 32 bit machine and
> feed them into the current 64 bit evolution to see if the problem is present.

Thanks, but there is no need to waste your time, the issue with a move from 32bit to 64bit system is still there with backups from evolution before 2.32.0.

From your backtrace with symbols I see it's stuck in sqlite3. Please try to go to ~/.local/share/evolution/mail/local and move away a folders.db file from there, and then run evolution. It may recreate it and possibly fix the issue. Note there are more folders.db files in ~/.local/share/evolution/mail subfolders, which might cause similar issue (but maybe not). It depends how many mail accounts you have configures, what type they are, and if it'll be stuck even after moving away the mentioned folders.db file, then how the backtrace changes. I see in that yours that it's working with "On This Computer" trash (".#evolution/Trash").

+ Trace 225583

Thread 1 (Thread 0x7ffff7fae980 (LWP 2673))

#0 nanosleep
from /lib64/libc.so.6
#1 usleep
from /lib64/libc.so.6
#2 unixSleep
at sqlite3.c line 26515
#3 sqlite3OsSleep
at sqlite3.c line 12647
#4 sqliteDefaultBusyCallback
at sqlite3.c line 31759
#5 sqlite3InvokeBusyHandler
at sqlite3.c line 97318
#6 btreeInvokeBusyHandler
at sqlite3.c line 40214
#7 pager_wait_on_lock
at sqlite3.c line 34743
#8 sqlite3PagerSharedLock
at sqlite3.c line 35779
#9 lockBtree
at sqlite3.c line 40803
#10 sqlite3BtreeBeginTrans
at sqlite3.c line 41055
#11 sqlite3VdbeExec
at sqlite3.c line 56074
#12 sqlite3Step
at sqlite3.c line 51732
#13 sqlite3_step
at sqlite3.c line 51792
#14 sqlite3_exec
at sqlite3.c line 76845
#15 camel_db_select
at camel-db.c line 963
#16 camel_db_read_message_info_record_with_uid
at camel-db.c line 1893
#17 message_info_from_uid
at camel-folder-summary.c line 1275
#18 camel_vee_summary_add
at camel-vee-summary.c line 483
#19 vee_folder_add_uid
at camel-vee-folder.c line 89
#20 folder_added_uid
at camel-vee-folder.c line 719
#21 g_hash_table_foreach
at ghash.c line 1328
#22 vee_folder_rebuild_folder
at camel-vee-folder.c line 1781
#23 store_get_special
at camel-store.c line 95
#24 local_get_trash
at camel-local-store.c line 226
#25 camel_store_get_folder
at camel-store.c line 469
#26 store_info_new
at e-mail-store.c line 89
#27 mail_store_add
at e-mail-store.c line 177
#28 do_async_event
at mail-mt.c line 621
#29 idle_async_event
at mail-mt.c line 633
#30 g_main_dispatch
at gmain.c line 2149
#31 g_main_context_dispatch
at gmain.c line 2702
#32 g_main_context_iterate
at gmain.c line 2780
#33 g_main_loop_run
at gmain.c line 2988
#34 IA__gtk_main
at gtkmain.c line 1237
#35 main
at main.c line 679

Comment 12 chernoff 2011-01-19 15:36:00 UTC

(In reply to comment #11)
> (In reply to comment #10)
> > If it would be helpful I could try to
> > produce a backup from the files on the original 32 bit machine and
> > feed them into the current 64 bit evolution to see if the problem is present.
> 
> Thanks, but there is no need to waste your time, the issue with a move from
> 32bit to 64bit system is still there with backups from evolution before 2.32.0.
> 
> From your backtrace with symbols I see it's stuck in sqlite3. Please try to go
> to ~/.local/share/evolution/mail/local and move away a folders.db file from
> there, and then run evolution. It may recreate it and possibly fix the issue.
> Note there are more folders.db files in ~/.local/share/evolution/mail
> subfolders, which might cause similar issue (but maybe not). It depends how
> many mail accounts you have configures, what type they are, and if it'll be
> stuck even after moving away the mentioned folders.db file, then how the
> backtrace changes. I see in that yours that it's working with "On This
> Computer" trash (".#evolution/Trash").
> 
> 

I moved the folders.db out and the new one was created; everything seems to
work, no freezes, everything intact. Thanks!

FYI: the new folders.db is roughly 1/2 the size of the old one. And there was
only one folders.db in my setup. Also, the valgrind error remains.

Comment 13 Milan Crha 2011-01-20 08:33:44 UTC

valgrind output is OK, there's a bug #633967 for it.

I'm wondering what to do now. You've it fixed and it works for me as expected (most likely because of sequential updates on my side). I'm not sure whether sharing your broken folders.db file would be of any help, because apart of sharing pretty sensitive information one should have the same folder structure with messages on the machine too, what I really do not consider a good thing to do.

Comment 14 chernoff 2011-01-20 13:02:35 UTC

3 possibilities:

1. If the *.db files can be accessed by a tool like mysql
I can characterize the differences between the good and bad db.

2. I rebuilt 2.31.1 from source (same as on my FC 14; I tried to
specify no-optimization via CFLAGS=-g -O0 but that part didn't
seem to do what I hoped -- I still see some variables optimized out)
and I could go in and "debug" if there's something specific to examine.

Of course, I know nothing about the insides of evolution so
this is probably pretty inefficient unless you can give me some
general instructions.

3. I can just wait.

One last discovery: the address book had disappeared but thats not
too important in my case.

In any case, thanks for your help.

Comment 15 Milan Crha 2011-01-21 11:55:36 UTC

(In reply to comment #14)
> 1. If the *.db files can be accessed by a tool like mysql
> I can characterize the differences between the good and bad db.

It's using sqlite3 databases, and the command is named sqlite3 too

> 2. I rebuilt 2.31.1 from source (same as on my FC 14; I tried to

2.31.1 is not in F14, it's 2.32.1. There about half-year difference between these two versions ;)

> specify no-optimization via CFLAGS=-g -O0 but that part didn't
> seem to do what I hoped -- I still see some variables optimized out)
> and I could go in and "debug" if there's something specific to examine.
> 
> Of course, I know nothing about the insides of evolution so
> this is probably pretty inefficient unless you can give me some
> general instructions.

There are none I'm aware of, this is too general issue, we may try to find out what is wrong with the folders.db file, maybe some version update between sqlite3, but doing it either through bugzilla or anyhow "offline" is too much time consuming for all interested.

> 3. I can just wait.

Wait for what? I understood that removing the old folders.db file fixed the issue for you so you are fine now.

> One last discovery: the address book had disappeared but thats not
> too important in my case.

This was reported and is fixed for 2.32.2.

Comment 16 chernoff 2011-01-24 17:38:27 UTC

(In reply to comment #15)
> (In reply to comment #14)
> > 1. If the *.db files can be accessed by a tool like mysql
> > I can characterize the differences between the good and bad db.
> 
> It's using sqlite3 databases, and the command is named sqlite3 too
> 
> > 2. I rebuilt 2.31.1 from source (same as on my FC 14; I tried to
> 
> 2.31.1 is not in F14, it's 2.32.1. There about half-year difference between
> these two versions ;)
> 
> > specify no-optimization via CFLAGS=-g -O0 but that part didn't
> > seem to do what I hoped -- I still see some variables optimized out)
> > and I could go in and "debug" if there's something specific to examine.
> > 
> > Of course, I know nothing about the insides of evolution so
> > this is probably pretty inefficient unless you can give me some
> > general instructions.
> 
> There are none I'm aware of, this is too general issue, we may try to find out
> what is wrong with the folders.db file, maybe some version update between
> sqlite3, but doing it either through bugzilla or anyhow "offline" is too much
> time consuming for all interested.
> 
> > 3. I can just wait.
> 
> Wait for what? I understood that removing the old folders.db file fixed the
> issue for you so you are fine now.
> 
> > One last discovery: the address book had disappeared but thats not
> > too important in my case.
> 
> This was reported and is fixed for 2.32.2.

Yes I recompiled 2.32.1 and yes its working for me. 

Here are a few more clues:

I examined the database integrity with sqlite3 (folders.db) prior to starting the restore and its OK. After the restore it has an unreferenced page.

I traced to the point at which the database gets locked in evolution.
It occurs for a particular message uid = 35296. I dumped the folders.db
and found TWO statements that appear to refer to the message;

INSERT INTO ".#evolution/Trash" VALUES('feFI23dL35296');
INSERT INTO ".#evolution/Trash" VALUES('KhCGGnx35296');

There is only one occurrence of this sort of statement for each
message handled correctly before 35296.

So the hint is that the folders.db file contains evidence that
the same message was deleted more than once.

The schematic form of record 35296 looks identical to the one
successfully moved to trash in the previous operation. I will attach
examples.

Comment 17 chernoff 2011-01-24 17:40:49 UTC

Created attachment 179194 [details]
2 records from folders.db

Comment 18 Alexandre Franke 2013-12-20 22:52:47 UTC

I don't expect this report to move anywhere further and I see you managed to get your environment back in a working state, so I'm closing this as obsolete. Feel free to reopen if you think there's actually a need to do anything else regarding the issue.