GNOME Bugzilla – Bug 664227
[0.10.x] & [0.12.x] Strange behaviour with Libreoffice files
Last modified: 2016-05-13 23:55:13 UTC
Tracker doesnt index correctly Libreoffice files. If I write a simple text it is not indexed but, strangely, if I apply to the text an intestation style, tracker works. The same problem is on 0.10.x and 0.12.x while 0.8.x seems to work fine.
0.8 used odt2txt below; while in newer versions we have our own extractor. Could you attach the simple file to test with it?
Created attachment 201584 [details] doesnt work
Created attachment 201585 [details] works
esempio.odt (searching for "australopitecus" it doesnt work) esempio2.odt (works) dpkg -l | grep libgsf ii libgsf-1-114 1.14.21-1 Structured File Library - runtime version ii libgsf-1-common 1.14.21-1 Structured File Library - common files ii libgsf-gnome-1-114 1.14.21-1 Structured File Library - runtime version for GNOME dpkg -l | grep tracker ii libtracker-extract-0.10-0 0.10.35-1~0guiodic1 tracker extractor library ii libtracker-miner-0.10-0 0.10.35-1~0guiodic1 tracker data miner library ii libtracker-sparql-0.10-0 0.10.35-1~0guiodic1 metadata database, indexer and search tool - library ii tracker 0.10.35-1~0guiodic1 metadata database, indexer and search tool ii tracker-extract 0.10.35-1~0guiodic1 metadata database, indexer and search tool - metadata extractors ii tracker-gui 0.10.35-1~0guiodic1 metadata database, indexer and search tool - GNOME frontends ii tracker-miner-fs 0.10.35-1~0guiodic1 metadata database, indexer and search tool - filesystem indexer ii tracker-utils 0.10.35-1~0guiodic1 metadata database, indexer and search tool - commandline tools
up :-)
I installed tracker 0.10.36 but the problem is still here. Thak you.
(In reply to comment #4) > esempio.odt (searching for "australopitecus" it doesnt work) > esempio2.odt (works) You're right. It seems the nie:plainTextContent for esempio2.odt is found, but not for esempio.odt. I am testing this on master myself. On the face of it, the files look identical when opening in LibreOffice. What actual difference is there do you know? I did check after some conversion and only found this difference between the two files: -- martyn@prunus:~/Downloads$ odt2txt /home/martyn/Downloads/esempio2.odt > 2.txt martyn@prunus:~/Downloads$ odt2txt /home/martyn/Downloads/esempio.odt > 1.txt martyn@prunus:~/Downloads$ diff -u 1.txt 2.txt --- 1.txt 2011-12-06 16:38:46.753843101 +0000 +++ 2.txt 2011-12-06 16:38:39.873843166 +0000 @@ -2,4 +2,5 @@ Testo di esempio. Australopitecus Africanus +========================= -- I will do some more testing tomorrow to see if that's related at all.
On the face of it, the files look identical when opening in LibreOffice. What actual difference is there do you know? In esempio2.odt, "Australopitecus Africanus" is in header style, while in esempio it isnt.
*** Bug 668032 has been marked as a duplicate of this bug. ***
We've had some improvements to the extractor over the releases, is this still an issue?
This is fixed.