After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 313583 - Update Unicode tables to Unicode 4.1
Update Unicode tables to Unicode 4.1
Status: RESOLVED FIXED
Product: glib
Classification: Platform
Component: general
2.8.x
Other All
: Normal normal
: ---
Assigned To: gtkdev
gtkdev
Depends on: 313857
Blocks: 313907
 
 
Reported: 2005-08-16 03:27 UTC by Behdad Esfahbod
Modified: 2011-02-18 15:49 UTC
See Also:
GNOME target: ---
GNOME version: Unversioned Enhancement


Attachments
Unicode 4.1 patch (856.29 KB, patch)
2005-08-16 03:28 UTC, Behdad Esfahbod
committed Details | Review
requested warning patch (412 bytes, patch)
2005-08-18 17:47 UTC, Behdad Esfahbod
committed Details | Review
warning patch for docs (792 bytes, patch)
2005-09-26 15:23 UTC, Behdad Esfahbod
committed Details | Review

Description Behdad Esfahbod 2005-08-16 03:27:08 UTC
I'm attaching patch to update glib's data and test tables to Unicode 4.1.
Pango 1.10 contains Unicode 4.1 data, so would be good to have Glib with 4.1 too.
Comment 1 Behdad Esfahbod 2005-08-16 03:28:37 UTC
Created attachment 50753 [details] [review]
Unicode 4.1 patch

The patch adds 5 entries to the Line Breaking enum, and updates the data files.
 No code changes other than the enum.
Comment 2 Behdad Esfahbod 2005-08-17 01:00:09 UTC
Note that this patch breaks Pango:

Pango-ERROR **: file break.c: line 780 (pango_default_break): assertion failed:
(IN_BREAK_TABLE (break_type))

I'm working on a patch for Pango, but the fact that it breaks older Pangos is a
bit, well...
Comment 3 Matthias Clasen 2005-08-18 14:10:53 UTC
Behdad, the pango patch should make sure that it prevents similar problems in
the future. And the glib patch should probably add a warning to the docs, that
the enumerations might grow due to additions in future Unicode versions.
Comment 4 Behdad Esfahbod 2005-08-18 17:32:12 UTC
Thanks Matthias.
Safety patch for pango: bug #313857
Comment 5 Behdad Esfahbod 2005-08-18 17:47:56 UTC
Created attachment 50933 [details] [review]
requested warning patch

Tiny patch to add a warning to the docs about future additions and recommending
to regard unknown values as G_UNICODE_BREAK_UNKNOWN.
Comment 6 Julien Gilli 2005-08-19 00:47:27 UTC
Thank you very much for your bug report ! However, I was not able to reproduce
this bug. I applied the unicode 4.1 patch to glib HEAD and it didn't break pango
as you describe in your comments. Can you please tell us what program trigger
the issue ? Also, could you please specify the glib version that is affected ?
Comment 7 Behdad Esfahbod 2005-08-19 01:00:33 UTC
All glib versions are affected.  The bug triggers whenever a Korean Hangul
character is tried to be rendered, since they are the characters that use the
newly defined G_UNICODE_BREAK_* types.  Just run ./pango-*view ./HELLO.utf8 in
pango/examples and you get the abort.

Bug 313857 contains a patch for Pango to not abort.  Bug 313907 contains a patch
for Pango to use the new line-breaking types.
Comment 8 Julien Gilli 2005-08-19 01:12:04 UTC
Thank you very much. I reproduced the bug, applied your fixes to my working
copies and it works great.
Comment 9 Julien Gilli 2005-08-19 19:48:16 UTC
I chose to set GNOME Version to 2.11/2.12 even if we are in string freeze since
there is no next version in the GNOME Version list box.
Comment 10 Julien Gilli 2005-08-19 19:51:34 UTC
Forget my GNOME Version problem, I was triaging two bugs at the same time, and
choose the wrong GNOME Version for this one. Sorry.
Comment 11 Matthias Clasen 2005-09-26 15:03:07 UTC
Behdad, whats the status of this ? 
I would actually like the warning to be added to the api docs, not just in a
comment in the header.
Comment 12 Matthias Clasen 2005-09-26 15:04:24 UTC
I guess we should probably not do this on the stable branch, to avoid breaking
pango, but we should do it soon in HEAD.
Comment 13 Behdad Esfahbod 2005-09-26 15:23:53 UTC
Created attachment 52681 [details] [review]
warning patch for docs

Ah, sorry, thought gtkdoc picks up the doc from the comments.  This patches
docs now.
Comment 14 Behdad Esfahbod 2005-09-26 15:25:25 UTC
The three patches together do the job.  We have already applied fix to pango to
not break (not released though), but yes, HEAD only should be fine.  Can be
applied IMO.
Comment 15 Matthias Clasen 2005-09-26 16:06:16 UTC
I would like to see a paragraph added in the long description of unicode.sgml
which spells out the supported Unicode versions. Something like

" GLib 2.8 supports Unicode 4.0, GLib 2.10 supports Unicode 4.1."

And maybe explain a little bit where these version differences may show
up in the API.

Can you commit it to had with that extra documentation, Behdad ?
Comment 16 Behdad Esfahbod 2005-09-26 16:08:42 UTC
Sure.  Later today.
Comment 17 Behdad Esfahbod 2005-10-01 12:35:16 UTC
Committed after reworking the documentation, and updating the enum values in the
docs too.

2005-10-01  Behdad Esfahbod  <behdad@gnome.org>

        * docs/reference/glib/tmpl/unicode.sgml:
        * glib/gen-unicode-tables.pl:
        * glib/gunibreak.h:
        * glib/gunichartables.h:
        * glib/gunicode.h:
        * tests/casefold.txt:
        * tests/casemap.txt: Updated to Unicode 4.1.  There are five new
        GUnicodeBreakType types.  That may break some applications, like
        Pango <= 1.10.