After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 497367 - don't use US-ASCII for output encoding
don't use US-ASCII for output encoding
Status: RESOLVED FIXED
Product: gtk-doc
Classification: Platform
Component: general
1.9
Other All
: Normal normal
: 1.10
Assigned To: gtk-doc maintainers
gtk-doc maintainers
Depends on:
Blocks:
 
 
Reported: 2007-11-16 13:07 UTC by Kouhei Sutou
Modified: 2007-12-11 15:28 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
Ues UTF-8 instead of US-ASCII (643 bytes, patch)
2007-11-16 13:08 UTC, Kouhei Sutou
committed Details | Review

Description Kouhei Sutou 2007-11-16 13:07:04 UTC
Please describe the problem:
We can't use non-ASCII encoding (like Japanese) with gtk-doc because gtk-doc.xsl uses US-ASCII for default.encoding and chunker.output.encoding value.
Yes, we can get *.html but they has 'content="text/html; charset=US-ASCII"' and non-ASCII characters are replaced by character entity reference. e.g. 「あ」 -> 「あ」

Steps to reproduce:
1. % cd html
2. % gtkdoc-mkhtml sample ../sample-docs.sgml # sample-docs.sgml includes 「あ」
3. % grep 'あ' index.html > /dev/null && echo found





Actual results:
'found' is echoed.

Expected results:
'found' isn't echoed.

Does this happen every time?
Yes.

Other information:
Comment 1 Kouhei Sutou 2007-11-16 13:08:48 UTC
Created attachment 99207 [details] [review]
Ues UTF-8 instead of US-ASCII

Another (more flexible) option is that gtkdoc-mkhtml accepts '--encoding' option.
Comment 2 Kouhei Sutou 2007-12-10 01:20:11 UTC
generate.index template uses utf-8 as its encoding:

gtk-doc.xsl:
  <xsl:template name="generate.index">
    ...
    <xsl:with-param name="encoding" select="'utf-8'"/>
    ...
  </xsl:template>

It seems that all encodings are unified.
Comment 3 Stefan Sauer (gstreamer, gtkdoc dev) 2007-12-11 15:28:12 UTC
2007-12-11  Stefan Kost  <ensonic@users.sf.net>

	patch by: Kouhei Sutou <kou@cozmixng.org>

	* gtk-doc.xsl:
	  Consitently use UTF-8 instead of US-ASCII for output encoding.
	  Fixes #497367