Bug 123406 - need a way to do unicode normalization in atomic steps
need a way to do unicode normalization in atomic steps
Status: RESOLVED FIXED
Product: glib
Classification: Platform
Component: i18n
unspecified
Other other
: Normal enhancement
: ---
Assigned To: gtkdev
gtkdev
:
Depends on:
Blocks: 139950
  Show dependency tree
 
Reported: 2003-09-28 05:58 UTC by Noah Levitt
Modified: 2011-09-05 02:00 UTC (History)
3 users (show)

See Also:
GNOME target: ---
GNOME version: ---


Attachments
the patch that led me to make this discovery (so it doesn't get lost) (17.00 KB, text/plain)
2003-11-14 16:19 UTC, Noah Levitt
Details

Description Noah Levitt 2003-09-28 05:58:28 UTC
Need a way to do unicode normalization in one normalization unit at a time.
Pango wants this so that it can render canonically equivalent strings
identically.
Comment 1 Noah Levitt 2003-09-28 06:03:53 UTC
One of our assumptions was wrong-- a normalization unit isn't
necessarily the same as a combining character sequence. For instance,
U+09CB = U+09C7 + U+09BE, but U+09C7 and U+09BE both have combining
class zero. I'm not sure there's a simpler way to figure out the
atomic normalization units than to see what happens when you do the
whole normalization.
Comment 2 Noah Levitt 2003-11-14 16:19:24 UTC
Created attachment 21459 [details]
the patch that led me to make this discovery (so it doesn't get lost)
Comment 3 alexander.winston 2004-01-25 00:54:22 UTC
Adding the PATCH keyword and marking priority as high.
Comment 4 Noah Levitt 2004-01-30 22:05:35 UTC
The patch is informative, not meant to be used.
Comment 5 Matthias Clasen 2011-09-05 02:00:49 UTC
We have this now.

Note You need to log in before you can comment on or make changes to this bug.