After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 634645 - Issue with non-ascii characters in ZIP files created in Windows system
Issue with non-ascii characters in ZIP files created in Windows system
Status: RESOLVED OBSOLETE
Product: file-roller
Classification: Applications
Component: general
2.32.x
Other Linux
: Normal normal
: ---
Assigned To: file-roller-maint
file-roller-maint
Depends on:
Blocks:
 
 
Reported: 2010-11-11 23:46 UTC by Luca Ferretti
Modified: 2020-11-11 19:13 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
Test zip file created on Windows 7 (304 bytes, application/zip)
2010-11-11 23:46 UTC, Luca Ferretti
Details
Test ZIP files created on linux (ubuntu 10.10) (466 bytes, application/zip)
2010-11-11 23:52 UTC, Luca Ferretti
Details
Zip created in Windows using 7zip (29.07 KB, image/png)
2010-12-19 11:46 UTC, Luca Ferretti
Details

Description Luca Ferretti 2010-11-11 23:46:28 UTC
Created attachment 174282 [details]
Test zip file created on Windows 7 

Here is attached a simple ZIP file created using the build-in "compress folder" feature in Windows (7, but from my experience I suspect the same occurs using different version). The ZIP file stores 3 empty files with following names:

  * 1000 ¥.txt
  * Ænima.txt
  * cipì.txt

Yen, Æ and ì appears in file roller as unrecognized characters (i.e. as ?)
Extracting or mounting in Nautilus, the icon caption reports "invalid encoding" (this, of course, comes from GIO[1].


[1] http://git.gnome.org/browse/glib/tree/gio/glocalfileinfo.c#n1568
Comment 1 Luca Ferretti 2010-11-11 23:50:41 UTC
It seems the same "issue" occurs creating the ZIP in GNOME and trying to open in Windows.

However, in this case, file-roller is still unable to recognize non-ascii characters.
Comment 2 Luca Ferretti 2010-11-11 23:52:47 UTC
Created attachment 174296 [details]
Test ZIP files created on linux (ubuntu 10.10)

Empty files using same non-ascii characters. File Roller shows "??", and Windows (7) is unable to recognize them too. However, no issue extracting them in Linux itself.
Comment 3 vlog 2010-12-12 10:07:42 UTC
uncompressing files created under windows with File Roller (automatically invoked by Gnome) under Ubuntu 10.04.
I confirm the problem (compressed files created under Windows (XP, VISTA) with different archiver (unknown, files coming from my clients)
Each time there is a special characters (like french éèàç...) in the file name it is impossible to extract the file.
So it is a BLOCKING bug.

The same file is processed without any problem on the same computer using winzip under XP (running under virtual box).

It seems that it is the same bug than n° 559481 (well described in comment n°2)
Comment 4 Luca Ferretti 2010-12-19 11:46:19 UTC
Created attachment 176698 [details]
Zip created in Windows using 7zip

Trying to install 7zip (ubuntu 10.10) as suggested on bug #559481, but there are still issues...
Comment 5 Ma Hsiao-chun 2012-11-12 17:30:09 UTC
ZIP archives can be either Unicode enabled or not Unicode enabled.
(7Z archives are always Unicode enabled)

To determine whether your archive is Unicode enabled or not on Linux, do the following (tested on UnZip 6.0):
unzip -l foo.zip
unzip -lU foo.zip

If your archive is Unicode enabled, you should get some '?' in first invocation and some escaped Unicode character as #UXXXX or #LYYYYYY in second invocation.

If you get same result, then your archive is not Unicode enabled.

p7zip treat file names in ZIP archives literally, so works for Unicode enabled archive only.

To extract not Unicode enabled ZIP archives, you can either use patched UnZip (found in Arch and Ubuntu) that support character encoding conversion in extraction.

You may also try unar from The Unarchiver project, which support encoding conversion directly.
http://code.google.com/p/theunarchiver/

Some people even write their own Python scripts based on 'zipfile' module to extract not Unicode enabled ZIP archives.

Anyway, you cannot handle not Unicode enabled ZIP archives with File Roller GUI currently.
Comment 6 André Klapper 2020-11-11 19:13:37 UTC
bugzilla.gnome.org is being replaced by gitlab.gnome.org. We are closing all old bug reports and feature requests in GNOME Bugzilla which have not seen updates for a long time.

If you still use file-roller and if you still see this bug / want this feature in a currently supported version of GNOME (currently that would be 3.38), then please feel free to report it at https://gitlab.gnome.org/GNOME/file-roller/-/issues/

Thank you for creating this report and we are sorry it could not be implemented (volunteer workforce and time is limited).