After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 439655 - Keywords are not tags
Keywords are not tags
Status: RESOLVED NOTABUG
Product: tracker
Classification: Core
Component: General
unspecified
Other All
: Normal minor
: ---
Assigned To: Jamie McCracken
Jamie McCracken
Depends on:
Blocks:
 
 
Reported: 2007-05-19 11:36 UTC by Jonas De Vuyst
Modified: 2010-05-17 13:32 UTC
See Also:
GNOME target: ---
GNOME version: ---



Description Jonas De Vuyst 2007-05-19 11:36:07 UTC
Tracker automatically adds tags to office documents and PDFs for all keywords stored in these files. This behaviour is annoying.

Let me make my point with an example. Document keywords will often contain names. Now although I wouldn't mind too badly if a document was tagged 'Kuhn', it's really silly if it's also tagged with this philosopher's very common first name, 'Thomas'. Even worse is when names have a Von-part (to use BibTeX terminology). This clutters the tag list with words like 'Van' (translates to: 'of') and 'De' ('the').

Populating the tag database with document keywords is annoying for a second reason also. Some keyword-enabled documents that I have come from the Internet. This means that some of the tags in my tracker DB fall outside my control. If tags are intended to replace directories for organising files, this clearly is undesirable behaviour.

Keywords are a great feature. They make searching for documents easier. They are not, however, tags.

Other information:
Comment 1 Luca Ferretti 2007-07-07 10:59:09 UTC
I agree.

Keywords (special metadata extracted from files) should be used to set the relevance of items when showing search results and should not (directly) appear to users.
For example, lets me assume you are searching for a document with the word "financial" and only results are 2 ODFs; the first one is a spreadsheet containing "financial" in as keyword, the second one only in its contents. The fist document should be more relevant then the other.

Tags, instead should be fully user-controllable (edit, apply, remove...) providing a similiar relevance behavior, but they should be orthogonal to keywords.

I hope this distinction could land on Xesam.
Comment 2 Mikkel Kamstrup Erlandsen 2007-07-20 11:43:17 UTC
Luca: These things are taken into account in the upcoming xesam ontology, so no worries.

Another thing is if the indexers will repsect the ontology and do sane hit ranking :-)
Comment 3 Jamie McCracken 2007-07-21 22:48:20 UTC
thats right the kweywords are not stored against userKeywords so this is not a bug
Comment 4 Martyn Russell 2010-05-17 13:32:33 UTC
Moving "Indexer" component bugs to "General" since "Indexer" refers to the old 0.6 architecture