After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 80086 - utf8ize all the data for the first time of transferring
utf8ize all the data for the first time of transferring
Status: VERIFIED INCOMPLETE
Product: Pan
Classification: Other
Component: general
pre-0.12.0 betas
Other other
: High normal
: 0.12.1
Assigned To: Charles Kerr
Charles Kerr
: 80697 (view as bug list)
Depends on: 80002
Blocks:
 
 
Reported: 2002-04-27 23:52 UTC by kz
Modified: 2009-08-15 18:40 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
Kang's message (918 bytes, text/plain)
2002-10-16 19:30 UTC, Christophe Lambin
Details
screenshot (40.54 KB, image/png)
2002-10-16 19:40 UTC, Christophe Lambin
Details

Description kz 2002-04-27 23:53:05 UTC
Package: Pan
Severity: normal
Version: 0.11.91
Synopsis: multichar broken into each ascii byte in article view
Bugzilla-Product: Pan
Bugzilla-Component: general

Description:
each two byte of hangul broken into single byte, on article view.
but if I try followup to the article, the message window show me pretty
hangul well.

the single byte is <127 ascii. the one line sample is below:
"""
 ¿ì¼± ȍÃð¡ ¾ÃÃþü­ ±Ã»ç¶÷ÃÃ
ÃÃÃÃÃö¸¦ Ãôðí ÃýÃÃà µÃ½Ã´Ã¸é
ÃÿäÃÃÃÃû
"""




------- Bug moved to this database by unknown@bugzilla.gnome.org 2002-04-27 19:53 -------

Reassigning to the default owner of the component, charles@rebelbase.com.

Comment 1 Charles Kerr 2002-04-29 17:38:36 UTC
We should convert article bodies into UTF-8 at the point where they
are brought into Pan (in this case, nntp_body()) rather than in a
piecemeal fashion (works in text.c because it uf8izes before calling
update_body_pane, but fails in Reply because message-window.c doesn't
utf8ize).

This is another instance of the same problem described in 80002.

For discussion: should the .msg files in ~/.pan/cache/ be UTF-8?
It will make coding Pan simpler; do most editors handle UTF-8?
Comment 2 kz 2002-04-29 23:08:48 UTC
if pan would simple to utf8ize cache, I agree to do that.
simple is right.
we'll have native locale(non-utf8) article by 'save article'.

at this time gedit2 is absolutely support utf8.
vim6; I think I remember I've heard about utf8 support, not sure.
Comment 3 Charles Kerr 2002-05-03 19:35:33 UTC
*** Bug 80697 has been marked as a duplicate of this bug. ***
Comment 4 Charles Kerr 2002-05-25 16:42:31 UTC
Kang, does this work in 0.11.93 now?
Comment 5 Charles Kerr 2002-05-31 17:20:39 UTC
kang: ping
Comment 6 Charles Kerr 2002-06-04 04:08:18 UTC
punting NEEDINFO reports from 0.12.0 to 0.12.1.

If the bug reporters come through with more information,
they can be moved back into 0.12.0.
Comment 7 Christophe Lambin 2002-06-30 21:50:21 UTC
kang: I believe this is working correctly in 0.12.0.  Can you confirm
this ?
Comment 8 kz 2002-06-30 21:58:35 UTC
yeah, it seems to work with 0.12.1 that I've used.
(sorry to be late. my ISP drop the newsgroup service.. :(
Comment 9 kz 2002-08-13 17:36:43 UTC
somebody complained me about filter phrase not work,
thou article subject and filter phrase equivalent on display.
I think this is native<->utf8 problem inside,
like this bug 80086, and bug 80004.

tried a little dive from the beginning of phrase entry,
and lost where the filter phrase working to match subjects.

wanted not open another bug. so reopen this close one.
Comment 10 Christophe Lambin 2002-08-14 07:03:01 UTC
What release was this person using ?



Pan 0.12.94 *does* convert the subject to UTF-8 when it gets the header. The subject filter works fine on my end, even for 8bit characters.



So, could you 

1) validate that this is happening with 0.12.94 ?

2) provide a group + message_id + search criteria that is failing ?



Thanks.

Comment 11 kz 2002-08-14 17:51:48 UTC
1. I tried and failed with 0.12.94 (pan --version said me)

2. you mean you're giving a try to filter with a phrase?
a. server: news.nuri.net
b. group: han.test
c. phrase: '&#50672;&#49845;' (means 'trial' in Korean)

item c needs your attention a bit.
A. do you have Korean XIM client, ami installed?
if you have RedHat distro, ami provided may.
B. then, you turn on the im-xim module for gtk2, to allow ami to work
with gtk2.
C. launch pan and go to the specific group.
D. give focus filter entry and type this sequence. 'dustmq'. and
you'll see the Korean character building on entry.
E. press the enter and nothing left on filter.

3. I'm afraid that I misunderstood your order.
how can I send you the raw message that pan transferred?
Comment 12 Christophe Lambin 2002-08-19 22:35:17 UTC
I'll have a look to see if I can reproduce this using your steps 1&2.

Regarding 3: no need to send any raw messages. Just give us the
Message-ID of the message that you're trying to filter.

(you can see the article's Message-ID by looking at the headers -
shortcut 'h').
Comment 13 Charles Kerr 2002-10-16 16:12:55 UTC
From what I can tell, utf8 is working now.
Kang, what is the state of this bug? :)
Comment 14 kz 2002-10-16 16:30:08 UTC
<WP6r9.58501$v85.3761400@news.bora.net>
is a sample thing on han.binaries.photo.

and I think pan does not convert locale to utf8;
tried save article body and found the file is ok with full text.
but pan's article pane does not show me any text for them.
I suppose pan does not convert the text and gtk2 make the text of
native locale invisible.. :)

and, if it's possible, why don't you convert the filename
of save as.. by G_BROKEN_FILENAME=boolean?
Comment 15 Christophe Lambin 2002-10-16 19:30:17 UTC
Created attachment 11600 [details]
Kang's message
Comment 16 Christophe Lambin 2002-10-16 19:37:04 UTC
This bug report has changed contents so many times I can't work out
anymore what it is we're supposed to fix. :)

According to your last entry, you're having trouble displaying that
message. I can read it on my system. Here's what I did:

1. Set LANG=ko_KR.euckr
2. Start Pan
3. Set default charset of han.binaries.photo to 'Korean (euc-kr)'
4. Retrieve all headers for that group
5. Read message with Message-ID you mentioned.
6. Look very puzzled at strange Korean characters. :)

I'll post a screenshot, just so you can see it.
Comment 17 Christophe Lambin 2002-10-16 19:40:06 UTC
Created attachment 11603 [details]
screenshot
Comment 18 kz 2002-10-16 20:05:10 UTC
uhm.. yeah, it's nice shot with most ugly Korean font. :)
but I cannot reproduce your method.
there's no character visible, even broken.
I did GDK_USE_XFT=0|1 both.

or, this may be a ridiculous simple problem of mine.
but I cannot figure out what the problem is..
sorry for lack of info.
Comment 19 Christophe Lambin 2003-03-30 22:37:38 UTC
Haven't been able to reproduce this, but there's been lots of fixes 
in this area since 0.11.91. Could you:

1. upgrade to 0.13.92
2. verify the group's charset is right.
3. download all headers in the group.
4. see if the problem persists?

Thanks.
Comment 20 Christophe Lambin 2003-10-16 18:40:34 UTC
Closing stale bugreports. If you still see this on 0.14.2 or later,
feel free to reopen and provide more info.