Bug 123406 – need a way to do unicode normalization in atomic steps

After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.

Bug 123406 - need a way to do unicode normalization in atomic steps


Summary:	need a way to do unicode normalization in atomic steps


Status:	RESOLVED FIXED

Product:	glib
Classification:	Platform
Component:	i18n
Version:	unspecified
Hardware:	Other other

Importance:	Normal enhancement
Target Milestone:	---
Assigned To:	gtkdev
QA Contact:	gtkdev

URL:
Whiteboard:

Depends on:
Blocks:	139950

Reported:	2003-09-28 05:58 UTC by Noah Levitt
Modified:	2011-09-05 02:00 UTC

See Also:
GNOME target:	---
GNOME version:	---

Attachments
the patch that led me to make this discovery (so it doesn't get lost) (17.00 KB, text/plain) 2003-11-14 16:19 UTC, Noah Levitt	Details

Description Noah Levitt 2003-09-28 05:58:28 UTC

Need a way to do unicode normalization in one normalization unit at a time.
Pango wants this so that it can render canonically equivalent strings
identically.

Comment 1 Noah Levitt 2003-09-28 06:03:53 UTC

One of our assumptions was wrong-- a normalization unit isn't
necessarily the same as a combining character sequence. For instance,
U+09CB = U+09C7 + U+09BE, but U+09C7 and U+09BE both have combining
class zero. I'm not sure there's a simpler way to figure out the
atomic normalization units than to see what happens when you do the
whole normalization.

Comment 2 Noah Levitt 2003-11-14 16:19:24 UTC

Created attachment 21459 [details]
the patch that led me to make this discovery (so it doesn't get lost)

Comment 3 alexander.winston 2004-01-25 00:54:22 UTC

Adding the PATCH keyword and marking priority as high.

Comment 4 Noah Levitt 2004-01-30 22:05:35 UTC

The patch is informative, not meant to be used.

Comment 5 Matthias Clasen 2011-09-05 02:00:49 UTC

We have this now.