Bug 77134 – Select-by-word should conform to LC_CTYPE

After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.

Bug 77134 - Select-by-word should conform to LC_CTYPE


Summary:	Select-by-word should conform to LC_CTYPE


Status:	RESOLVED FIXED

Product:	gnome-terminal
Classification:	Core
Component:	general
Version:	unspecified
Hardware:	Other other

Importance:	Normal normal
Target Milestone:	---
Assigned To:	GNOME Terminal Maintainers
QA Contact:	GNOME Terminal Maintainers

URL:
Whiteboard:

Duplicates:	90558 (view as bug list)
Depends on:
Blocks:

Reported:	2002-04-01 02:52 UTC by Dennis Haney
Modified:	2007-12-05 02:10 UTC

See Also:
GNOME target:	---
GNOME version:	2.9/2.10

Attachments
using fixed(misc) 12 (7.95 KB, image/png) 2002-04-29 20:08 UTC, Dennis Haney	Details
using fixed(misc) 13 (7.95 KB, image/png) 2002-04-29 20:08 UTC, Dennis Haney	Details
using andale mono 72 (8.82 KB, image/png) 2002-04-29 20:09 UTC, Dennis Haney	Details

Description Dennis Haney 2002-04-01 02:53:11 UTC

Package: gnome-terminal
Severity: normal
Version: 1.9.2
Synopsis: Select-by-word should conform to LC_CTYPE
Bugzilla-Product: gnome-terminal
Bugzilla-Component: general

Description:
Description of Problem:
Select-by-word is currently default -A-Za-z0-9,./?%&#, whereas my locale
says I have ÃÃÃÃ¦Ã¸Ã¥ as word chars too...

Steps to reproduce the problem:
1. 
2. 
3. 

Actual Results:


Expected Results:


How often does this happen?


Additional Information:




------- Bug moved to this database by unknown@bugzilla.gnome.org 2002-03-31 21:53 -------

Reassigning to the default owner of the component, hp@redhat.com.

Comment 1 Havoc Pennington 2002-04-01 03:48:04 UTC

I see no way to do this while also making it configurable as it
currently is.

Comment 2 Dennis Haney 2002-04-01 03:50:17 UTC

When making new profile read LC_* thats all...

Comment 3 Telsa Gwynne 2002-04-14 10:38:37 UTC

Not a fix, but a work-around:

gnome-terminal -> preferences -> select-by-word-characters

-A-Za-z0-9,./?%&#_жи-іј-џ

Doesn't catch quite everything, but it's a start. Make sure
it's in ~/.gnome/Terminal as 'wordclass' under Config : I
once did this only for tclasses, not for the default gnome-terminal
and spent ages why it only worked in transparent (or whatever
terminal class I had created) terminals. 

I'm mostly adding this for people whoi query bugzilla and need a
workaround in the interim :)

Comment 4 Dennis Haney 2002-04-29 20:08:26 UTC

Created attachment 8036 [details]
using fixed(misc) 12

Comment 5 Dennis Haney 2002-04-29 20:08:35 UTC

Created attachment 8037 [details]
using fixed(misc) 13

Comment 6 Dennis Haney 2002-04-29 20:09:00 UTC

Created attachment 8038 [details]
using andale mono 72

Comment 7 Dennis Haney 2002-04-29 20:11:00 UTC

ARG :( Wrong bug :( Should have been in bug 77130

Comment 8 Havoc Pennington 2002-09-22 21:46:53 UTC

Conceivably the right answer is to translate the .schemas file...

Comment 9 Havoc Pennington 2002-09-22 22:10:21 UTC

*** Bug 90558 has been marked as a duplicate of this bug. ***

Comment 10 Nalin Dahyabhai 2002-09-24 20:25:53 UTC

The locale system doesn't appear to provide useful "is this a word
character or not" information, so the closest you can I think you can
get is to approximate it using the various g_unichar_isXXXXX() functions.

Comment 11 Noah Levitt 2003-07-09 17:12:17 UTC

I'd suggest something like

  g_unichar_isgraph (c) && !g_unichar_ispunct (c)

And then change the select-by-word option to "additional"
select-by-word characters, and check for those too.

Comment 12 Havoc Pennington 2003-07-09 18:24:38 UTC

Of course, this is all totally broken; the idea of "word characters"
doesn't produce the right word boundaries in many languages, you 
really have to run the huge algorithm in Pango. What we're discussing 
here is the equivalent of "let's just assume all text is Latin-1"

Comment 13 Noah Levitt 2003-07-09 19:34:32 UTC

I wouldn't say "totally". Most languages use spacing characters to
separate words, and this method is pretty satisfactory for those. For
others, I don't think it's worth trying to solve the problem.

Comment 14 Behdad Esfahbod 2006-03-19 05:50:46 UTC

I totally agree with Noah.  What should we do with the preferences thing then?  Anybody willing to submit a patch?

Comment 15 Mariano Suárez-Alvarez 2007-01-26 05:16:19 UTC

We -could- remove the preference...

Comment 16 Behdad Esfahbod 2007-01-26 05:39:13 UTC

I think we can close this bug.  We now use g_unichar_* functions to detect word chars, and only use the preference option, if set, for the ASCII range.

Comment 17 Mariano Suárez-Alvarez 2007-01-26 06:19:08 UTC

We have to change the default in the /schemas/apps/gnome-terminal/profiles/Default/word_chars schema to be "" so that we get the Unicode thing by default, and that's it, no?

Comment 18 Behdad Esfahbod 2007-01-26 06:39:40 UTC

As I said, the setting is only used for the ASCII range.  So, something like "a-zA-Z_" is better than empty, because we really want '_' as a word char, at least in the terminal.