After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 491850 - Allow option to ignore stopwords
Allow option to ignore stopwords
Status: RESOLVED FIXED
Product: tracker
Classification: Core
Component: Store
git master
Other All
: Normal normal
: ---
Assigned To: tracker-general
Jamie McCracken
: 526344 (view as bug list)
Depends on:
Blocks:
 
 
Reported: 2007-10-30 20:38 UTC by dpr
Modified: 2010-05-20 16:34 UTC
See Also:
GNOME target: ---
GNOME version: ---



Description dpr 2007-10-30 20:38:11 UTC
Please describe the problem:
It is really hard to find files or narrow down the result count because of the use of stopwords in file names.

Steps to reproduce:
1. Have an album from Amon Tobin with file amon_tobin-always.mp3
2. Try to find it with two keywords: tobin and always.


Actual results:
All Amon Tobin songs are displayed.

Expected results:
Only one file should be displayed.

Does this happen every time?
Yes

Other information:
I can see two possible solutions:
1) Use stopwords only when indexing text content of the file.
For file names and other meta-data the stopword list should not be used and only the three character minimum may be enforced.
2) Use shorter stopword list.

#1 would be my favorite.
Comment 1 dpr 2007-10-30 20:41:21 UTC
tracker-0.6.3, Ubuntu Gutsy final.
Comment 2 Tshepang Lekhonkhobe 2008-04-14 18:58:29 UTC
Wouldn't adding an option to disable stopwords be superior?

* http://bugzilla.gnome.org/show_bug.cgi?id=526344
Comment 3 Martyn Russell 2010-03-11 15:22:21 UTC
Can the reporter of this bug set the _version_ so we can target bugs to work on more easily please. We can also have an idea about if these bugs are likely to be obsolete too.

If you don't set the version, we are less likely to look at them.

Sorry for the spam, but I don't want to say this 88 times on each bug :)
Comment 4 Martyn Russell 2010-03-16 15:12:42 UTC
*** Bug 526344 has been marked as a duplicate of this bug. ***
Comment 5 bgruber 2010-04-10 08:04:03 UTC
it seems like nowadays, the result is that rather than getting all the results, you get none, which is even worse.

martyn, the reporter of this bug only filed this one bug, ever, and that was 2.5 years ago. i think it's unlikely that he or she is going to come back and look at your request to set the version. as i am not the reporter, i cannot set the version, but i am using tracker 0.6.95 right now.
Comment 6 Nikolaus 2010-05-17 19:26:19 UTC
I was stupefied by this as well. How about adding a warning to tracker-search and the GUI if some words of the search query are ignored?
Comment 7 Martyn Russell 2010-05-17 21:02:33 UTC
(In reply to comment #6)
> I was stupefied by this as well. How about adding a warning to tracker-search
> and the GUI if some words of the search query are ignored?

We do this for tracker-search in 0.8+, we didn't do this for 0.6 though.
The tracker-search-tool could use this technique too.
Comment 8 Aleksander Morgado 2010-05-18 18:38:45 UTC
I added a new "IgnoreStopWords" option in tracker-fts.cfg, by default TRUE (meaning that stopwords are not considered, as currently being done). If set to FALSE, stopwords are indexed as any other word.

This patch is in the parser-unicode-libs-review branch.
Comment 9 Martyn Russell 2010-05-20 16:34:13 UTC
This problem has been fixed in the development version. The fix will be available in the next major software release. Thank you for your bug report.