GNOME Bugzilla – Bug 630471
[tag] ligatures "Œ" and "œ" are not supported in freeform strings
Last modified: 2010-09-28 09:42:50 UTC
Created attachment 170966 [details] [review]
patch for supporting Ligatures Œ and œ.
The ligatures "Œ" and "œ" were removed from ISO-8859-1 encoding. So the are not supporting in tag extraction.I think we can change the final encoding fallback(if all the other assumptions about input string's encodings are failing) in gst_tag_freeform_string_to_utf8 (gst-plugins-base/gst-libs/gst/tag/tags.c ) from ISO-8859-1 to Windows-1252.Since Windows-1252 is the superset of ISO-8859-1 ,i think it wont create any issue.
Makes sense, committed, thanks for the patch (I kept a paranoia-fallback for ISO-8859-1 though):
Author: Tim-Philipp Müller <firstname.lastname@example.org>
Date: Mon Sep 27 14:36:17 2010 +0100
tags: try ISO-8859-1 as second fallback in case WINDOWS-1252 is not supported
Better safe than sorry. Some embedded systems may use crippled iconv
implementations or not support WINDOWS-1252 for other reasons.
Author: Sreerenj Balachandran <email@example.com>
Date: Thu Sep 23 23:53:48 2010 +0300
tags: when converting freeform strings try Windows-1252 as fallback instead of ISO-8859-1
Windows-1252 is a superset of ISO-8859-1, which uses some space
allocated to control characters for additional printable characters.