After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 328162 - Diacritics in topic keywords
Diacritics in topic keywords
Status: RESOLVED FIXED
Product: epiphany
Classification: Core
Component: Bookmarks
git master
Other All
: Normal normal
: ---
Assigned To: Epiphany Maintainers
Epiphany Maintainers
Depends on:
Blocks:
 
 
Reported: 2006-01-22 13:29 UTC by Thomas HAMEL
Modified: 2008-08-14 22:56 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
Works for me (33.68 KB, image/png)
2007-09-26 23:11 UTC, Diego Escalante Urrelo (not reading bugmail)
Details

Description Thomas HAMEL 2006-01-22 13:29:21 UTC
The probleme here is for french, but I suppose some other languages are concerned.

My use-case :

I defined a bookmark topic called "Vidéo" (I think you can guess the
translation), when I want to access this topic in the location Bar, I type
"vidéo" or "Vidéo" and I have the list of related bookmarks, fine. But I'm
sometimes lazy, and I type "video" without accent, then the bookmarks doesn't
appears anymore. The comparison should be done without taking diacritics into
account.

After some test I found out the problem is even more subtle, If I type "vid" my
bookmarks are here. If the next letter I type is "é", the bookmarks stays, if I
type an "a" for exemple they disappear (no problem), but if I type an "e" they
stays too (great). But the problem is that if I finish the word : with "éo"
bookmarks are still listed, and with "eo" they disappears. This is not
consistent". Maybe a weird effect of UTF-8 encoding on two bytes...

Other information:
Comment 1 Christian Persch 2006-02-15 21:36:51 UTC
Yes, this is a sideeffect of UTF-8 representation.

We should definitely do a more sophisticated search.
Comment 2 Diego Escalante Urrelo (not reading bugmail) 2006-10-03 10:36:54 UTC
I can't reproduce this anymore and there's even a bug (#343906) complaining about the solution to this bug (making no difference between -for example- é and e).

I'm closing this.
Comment 3 Christian Persch 2006-10-03 11:32:28 UTC
Bug 343906 is only about the location entry though, while this also applies to the completion in the bookmark properties dialogue.
Comment 4 Thomas HAMEL 2006-10-29 20:55:13 UTC
the bug still exists for me in 2.16.1 from ubuntu.
Comment 5 Reinout van Schouwen 2006-10-30 15:16:12 UTC
Does this bug need to be reopened?
Comment 6 Thomas HAMEL 2007-04-11 23:01:15 UTC
Oups.. forgot to respond to this one.

Yes it's still present and for me it should be reopened.
Comment 7 Diego Escalante Urrelo (not reading bugmail) 2007-09-26 23:11:24 UTC
Created attachment 96265 [details]
Works for me

This works for me, if the topic is name "eeeé" I search for "eeee" and my bkmk is shown.
Comment 8 Christian Persch 2007-09-27 12:12:58 UTC
Is that é just an U+00E9 character, or the sequence U+0065 U+0301 ?
Comment 9 Thomas HAMEL 2007-09-27 22:05:16 UTC
I don't have an SVN build to check if the bug is still present. But it was still in 2.18.1. 

Your use-case always worked, "video" matched "vidéo" until the "o" was typed.The correct use case would be to check if the search "eeeee" matches the topic "eeeée".
Comment 10 Simos Xenitellis 2008-07-14 16:44:58 UTC
I do not think that the current western layouts produce characters with diacritics, but rather produce precomposed characters.

I think that the big question is whether to decompose the names and tags of bookmarks and search names, before trying to apply a search.
This is a general issue, and is possible it has been already addressed elsewhere, such as in tracker or beagle. 
It would be good to try a consistent solution. 

Comment 11 Diego Escalante Urrelo (not reading bugmail) 2008-08-14 22:56:08 UTC
Fixed, part of #517960.