GNOME Bugzilla – Bug 497367
don't use US-ASCII for output encoding
Last modified: 2007-12-11 15:28:23 UTC
Please describe the problem: We can't use non-ASCII encoding (like Japanese) with gtk-doc because gtk-doc.xsl uses US-ASCII for default.encoding and chunker.output.encoding value. Yes, we can get *.html but they has 'content="text/html; charset=US-ASCII"' and non-ASCII characters are replaced by character entity reference. e.g. 「あ」 -> 「あ」 Steps to reproduce: 1. % cd html 2. % gtkdoc-mkhtml sample ../sample-docs.sgml # sample-docs.sgml includes 「あ」 3. % grep 'あ' index.html > /dev/null && echo found Actual results: 'found' is echoed. Expected results: 'found' isn't echoed. Does this happen every time? Yes. Other information:
Created attachment 99207 [details] [review] Ues UTF-8 instead of US-ASCII Another (more flexible) option is that gtkdoc-mkhtml accepts '--encoding' option.
generate.index template uses utf-8 as its encoding: gtk-doc.xsl: <xsl:template name="generate.index"> ... <xsl:with-param name="encoding" select="'utf-8'"/> ... </xsl:template> It seems that all encodings are unified.
2007-12-11 Stefan Kost <ensonic@users.sf.net> patch by: Kouhei Sutou <kou@cozmixng.org> * gtk-doc.xsl: Consitently use UTF-8 instead of US-ASCII for output encoding. Fixes #497367