GNOME Bugzilla – Bug 114777
Text export of accented characters broken
Last modified: 2004-12-22 21:47:04 UTC
[Originally reported as http://bugs.debian.org/189281] From: Toni Mueller <support@oeko.net> Subject: gnumeric: exporting to text mangles LATIN1 characters to escape sequence Package: gnumeric Version: 1.1.16-6 Severity: normal Tags: upstream Hello, exporting a spreadsheet with accented characters, eg. german umlauts, yields a file with escape sequences consisting of octal 303 followed by another byte that should indicate what character was intended (ie, octal 174 for lower case u umlaut) in place for the accented characters. There doesn't appear to be a way to adjust how this is handled, and it prevents reimporting the resulting CSV file as a spread sheet.
Testing with 1.1.19, things seem to have gotten worse. I've tested using lines from latin1(8) and exported File -> Save As -> Text export (configurable) -> Character encoding: Locale: Western (ISO-8859-1) and in the resulting file the 8-bit characters have all been replaced by "ÿ¿¿¿¿¿".
This bug is not surprising: we convert the utf8 content of the cell into the appropriate encoding and then we check the string whether we need to quote anything assuming that we still have utf8...
fixed in cvs