After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 333407 - Extremely slow import of plain text datafile
Extremely slow import of plain text datafile
Status: RESOLVED FIXED
Product: Gnumeric
Classification: Applications
Component: import/export Text
1.6.x
Other All
: Normal normal
: ---
Assigned To: Morten Welinder
Jody Goldberg
Depends on:
Blocks:
 
 
Reported: 2006-03-04 22:33 UTC by Luke Hutchison
Modified: 2006-03-05 02:49 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
The file, gzipped (238.99 KB, application/x-gzip)
2006-03-04 22:36 UTC, Luke Hutchison
Details

Description Luke Hutchison 2006-03-04 22:33:21 UTC
The attached sheet is extremely slow to import.  It has 110 columns and 952 rows, but even stripping it down to 10 rows using "head" it takes forever for Gnumeric 1.6.2 to import it.  This is true using "fixed width" with column autodiscovery, and field-delimited with "see two adjacent separators as one" checked.

It should not take this long, which leads me to think that there is some very inefficient algorithm being employed at some point.  Probably a quadratic-in-number-of-columns algo or something.
Comment 1 Luke Hutchison 2006-03-04 22:36:05 UTC
Created attachment 60662 [details]
The file, gzipped

The file.  It's not huge:

$ wc walk.txt
    952  103768 1862112 walk.txt
Comment 2 Morten Welinder 2006-03-04 23:32:40 UTC
Import is almost instantaneous.

But going from the fixed/csv page to the format page takes a close
approximation to forever.
Comment 3 Morten Welinder 2006-03-05 02:37:24 UTC
Fixed in cvs.

Initialization was triggering the format page's setup and it created a
treeview with ~1350 columns.  GTK+ was spending a long time removing
columns from that one by one.
Comment 4 Morten Welinder 2006-03-05 02:49:18 UTC
And for good measure, the gtk+ slowness is now bug 333429.