After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 664227 - [0.10.x] & [0.12.x] Strange behaviour with Libreoffice files
[0.10.x] & [0.12.x] Strange behaviour with Libreoffice files
Status: RESOLVED FIXED
Product: tracker
Classification: Core
Component: Extractor
0.10.x
Other Linux
: Normal normal
: ---
Assigned To: tracker-extractor
Jamie McCracken
: 668032 (view as bug list)
Depends on:
Blocks:
 
 
Reported: 2011-11-16 21:35 UTC by Guido
Modified: 2016-05-13 23:55 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
doesnt work (8.58 KB, application/vnd.oasis.opendocument.text)
2011-11-17 11:45 UTC, Guido
Details
works (8.73 KB, application/vnd.oasis.opendocument.text)
2011-11-17 11:45 UTC, Guido
Details

Description Guido 2011-11-16 21:35:22 UTC
Tracker doesnt index correctly Libreoffice files. If I write a simple text it is not indexed but, strangely, if I apply to the text  an intestation style, tracker works.

The same problem is on 0.10.x and 0.12.x while 0.8.x seems to work fine.
Comment 1 Aleksander Morgado 2011-11-17 08:23:26 UTC
0.8 used odt2txt below; while in newer versions we have our own extractor. Could you attach the simple file to test with it?
Comment 2 Guido 2011-11-17 11:45:26 UTC
Created attachment 201584 [details]
doesnt work
Comment 3 Guido 2011-11-17 11:45:59 UTC
Created attachment 201585 [details]
works
Comment 4 Guido 2011-11-17 11:47:01 UTC
esempio.odt (searching for "australopitecus" it doesnt work)
esempio2.odt (works)




dpkg -l | grep libgsf
ii  libgsf-1-114         1.14.21-1  Structured File Library - runtime version
ii  libgsf-1-common      1.14.21-1  Structured File Library - common files
ii  libgsf-gnome-1-114   1.14.21-1  Structured File Library - runtime version for GNOME

dpkg -l | grep tracker

ii  libtracker-extract-0.10-0   0.10.35-1~0guiodic1 tracker extractor library
ii  libtracker-miner-0.10-0     0.10.35-1~0guiodic1 tracker data miner library
ii  libtracker-sparql-0.10-0    0.10.35-1~0guiodic1 metadata database, indexer and search tool - library
ii  tracker                     0.10.35-1~0guiodic1 metadata database, indexer and search tool
ii  tracker-extract             0.10.35-1~0guiodic1 metadata database, indexer and search tool - metadata extractors
ii  tracker-gui                 0.10.35-1~0guiodic1 metadata database, indexer and search tool - GNOME frontends
ii  tracker-miner-fs            0.10.35-1~0guiodic1 metadata database, indexer and search tool - filesystem indexer
ii  tracker-utils               0.10.35-1~0guiodic1 metadata database, indexer and search tool - commandline tools
Comment 5 Guido 2011-11-28 13:37:31 UTC
up :-)
Comment 6 Guido 2011-12-02 20:28:21 UTC
I installed tracker 0.10.36 but the problem is still here.

Thak you.
Comment 7 Martyn Russell 2011-12-06 16:39:57 UTC
(In reply to comment #4)
> esempio.odt (searching for "australopitecus" it doesnt work)
> esempio2.odt (works)

You're right. It seems the nie:plainTextContent for esempio2.odt is found, but not for esempio.odt. I am testing this on master myself.

On the face of it, the files look identical when opening in LibreOffice. What actual difference is there do you know?

I did check after some conversion and only found this difference between the two files:

--

martyn@prunus:~/Downloads$ odt2txt /home/martyn/Downloads/esempio2.odt > 2.txt
martyn@prunus:~/Downloads$ odt2txt /home/martyn/Downloads/esempio.odt > 1.txt
martyn@prunus:~/Downloads$ diff -u 1.txt 2.txt 
--- 1.txt	2011-12-06 16:38:46.753843101 +0000
+++ 2.txt	2011-12-06 16:38:39.873843166 +0000
@@ -2,4 +2,5 @@
 Testo di esempio.
 
 Australopitecus Africanus
+=========================
 
--

I will do some more testing tomorrow to see if that's related at all.
Comment 8 Guido 2011-12-06 17:33:32 UTC
On the face of it, the files look identical when opening in LibreOffice. What
actual difference is there do you know?

In esempio2.odt, "Australopitecus Africanus" is in header style, while in esempio it isnt.
Comment 9 below 2012-02-13 10:04:30 UTC
*** Bug 668032 has been marked as a duplicate of this bug. ***
Comment 10 Martyn Russell 2014-08-21 09:19:34 UTC
We've had some improvements to the extractor over the releases, is this still an issue?
Comment 11 Carlos Garnacho 2016-05-13 23:55:13 UTC
This is fixed.