GNOME Bugzilla – Bug 337317
Non-ascii characters are not always handled gracefully
Last modified: 2006-07-06 21:47:32 UTC
On the server news.daimi.au.dk (which has global read access), the following message ids are available: 1) Message-ID: <dv8gfk$oeo$1@news.net.uni-c.dk> 2) Message-ID: <wvxirqhxi5n.fsf@hadrian.daimi.au.dk> In the case of 1), non-ascii characters are displayed correctly in the body pane, whereas in 2) the first non-ascii character in the body causes the rest of the body to get lost. This is possibly due to the fact that 2) does not have a Content-Type header, but I can't be sure. This is illustrated in http://gallery.overgaard.org/misc/pan-0.91-error-1.jpg
Created attachment 62778 [details] Image showing how a portion of the message body gets lost
Created attachment 62810 [details] [review] possible patch to fix the encoding bug
Søren, could you test out the patch in comment #2?
Hi Charles, I'm having trouble with the patch: boll@andariel [sid]:~/sandbox/pan/pan-0.91/pan/usenet-utils$ patch < ~/sandbox/pan/encoding-patch.txt patching file utf8-utils.cc Hunk #3 FAILED at 252. 1 out of 3 hunks FAILED -- saving rejects to file utf8-utils.cc.rej boll@andariel [sid]:~/sandbox/pan/pan-0.91/pan/usenet-utils$ Was it possibly not made against 0.91, or did I do something completely wrong?
You're probably right. Here's the whole file.
Created attachment 62820 [details] patched version of utf8-utils.cc
Ok, i've recompiled it with your patch. It no longer chokes on non-ascii characters, although it doesn't actually display them. Nothing is displayed in their place, the text simply continues. If you want, I will be happy to provide a screenshot.
grumble. :) What group is <wvxirqhxi5n.fsf@hadrian.daimi.au.dk> in?
daimi.test (an internal group) on news.daimi.au.dk. You may not have read access to that server. Another message which triggers the error is <44314881$0$47005$edfadb0f@dread15.news.tele.dk> which is in the group dk.edb.programmering. That should be available on most news servers.
Created attachment 68497 [details] cached copy of the message cited above This is a copy of the message mentioned above. As of 0.102, Pan is trimming "nævnt" to "nvnt".
Created attachment 68501 [details] [review] patch against 0.102 that fixes the test case. This patch makes the test case work right and will be in 0.103. If you find any problems with this, feel free to reopen this ticket and let me know. Thank you!