GNOME Bugzilla – Bug 647153
One bad entity discards any next valid entity when parsing.
Last modified: 2021-07-05 13:26:38 UTC
Created attachment 185512 [details] Code illustrating bug. When parsing very simple xml with one bad entity like: "<a><b>black &foo; white</b><c>red & green</c></a>" parser (lxml frontend) created with options: ns_clean=True,dtd_validation=False,recover=True,remove_pis=True Clears not only first _bad_ entity, but any other next _valid_ one. Example python lxml code in attachement.
GNOME is going to shut down bugzilla.gnome.org in favor of gitlab.gnome.org. As part of that, we are mass-closing older open tickets in bugzilla.gnome.org which have not seen updates for a longer time (resources are unfortunately quite limited so not every ticket can get handled). If you can still reproduce the situation described in this ticket in a recent and supported software version, then please follow https://wiki.gnome.org/GettingInTouch/BugReportingGuidelines and create a new ticket at https://gitlab.gnome.org/GNOME/libxml2/-/issues/ Thank you for your understanding and your help.