GNOME Bugzilla – Bug 706530
[PATCH] epub more metadata support
Last modified: 2013-09-13 11:27:39 UTC
Created attachment 252652 [details] [review] epub extract metadata Authors were stored with nco:Contact, now nco:PersonContact. Added nco:Role for authors, editors, and illustrators. add language, subject, description, uuid, and isbn. Example output of tracker-extract -f progit.epub: a nfo:TextDocument ; nco:publisher [ a nco:Contact ; nco:fullname "Springer"] ; nie:description "licensed under the Creative Commons Attribution-Non Commercial-Share Alike 3.0 license" ; nie:language "en" ; nco:creator [ a nco:PersonContact ; nco:nameFamily "Chacon" ; nco:nameGiven "Scott" , [ a nco:Role ; nco:role "aut"]] ; nie:title "Pro Git" ; nie:identifier "bf50c6e1-eb0a-4a1c-a2cd-ea8809ae086a" ; nie:identifier "9781430218333" . The two identifiers are UUID and ISBN. If something should be stored differently, let me know. Also, calibre adds a rating tag, should that be indexed? If so, how?
Previous patch broke something, tracker-info has no results for any epub file, although tracker-extract doesn't have any warnings or errors (not even the ones the old version had). Will post update later.
Hi Martin, do you have an ePub file to test with that you could attach here? It would help in my testing. Thanks for the patch too! :)
I've started a branch for this: https://git.gnome.org/browse/tracker/log/?h=epub-metadata-improvements I've cleaned up the patch and fixed a few things here and there, but currently the SPARQL isn't quite correct: (lt-tracker-miner-fs:16739): Tracker-CRITICAL **: Could not execute sparql: GDBus.Error:org.freedesktop.Tracker1.SparqlError.Parse: 19.39: syntax error, no support for nested anonymous blank nodes This needs fixing before merging to master. But thanks for making a start on this!
This problem has been fixed in the development version. The fix will be available in the next major software release. Thank you for your bug report.