After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 706530 - [PATCH] epub more metadata support
[PATCH] epub more metadata support
Status: RESOLVED FIXED
Product: tracker
Classification: Core
Component: Extractor
git master
Other Linux
: Normal enhancement
: ---
Assigned To: tracker-extractor
Depends on:
Blocks:
 
 
Reported: 2013-08-21 19:14 UTC by Martin Franco
Modified: 2013-09-13 11:27 UTC
See Also:
GNOME target: ---
GNOME version: ---


Attachments
epub extract metadata (9.95 KB, patch)
2013-08-21 19:14 UTC, Martin Franco
none Details | Review

Description Martin Franco 2013-08-21 19:14:44 UTC
Created attachment 252652 [details] [review]
epub extract metadata

Authors were stored with nco:Contact, now nco:PersonContact.  Added nco:Role for authors, editors, and illustrators.

add language, subject, description, uuid, and isbn.

Example output of tracker-extract -f progit.epub:

a nfo:TextDocument ;
         nco:publisher [ a nco:Contact ;
         nco:fullname "Springer"] ;
         nie:description "licensed under the Creative Commons Attribution-Non Commercial-Share Alike 3.0 license" ;
         nie:language "en" ;
         nco:creator [ a nco:PersonContact ;
         nco:nameFamily "Chacon" ;
         nco:nameGiven "Scott" , [ a nco:Role ;
         nco:role "aut"]] ;
         nie:title "Pro Git" ;
         nie:identifier "bf50c6e1-eb0a-4a1c-a2cd-ea8809ae086a" ;
         nie:identifier "9781430218333" .

The two identifiers are UUID and ISBN.

If something should be stored differently, let me know.  Also, calibre adds a rating tag, should that be indexed?  If so, how?
Comment 1 Martin Franco 2013-08-21 22:18:41 UTC
Previous patch broke something, tracker-info has no results for any epub file, although tracker-extract doesn't have any warnings or errors (not even the ones the old version had).  Will post update later.
Comment 2 Martyn Russell 2013-08-30 08:14:44 UTC
Hi Martin, do you have an ePub file to test with that you could attach here? It would help in my testing.

Thanks for the patch too! :)
Comment 3 Martyn Russell 2013-09-11 16:32:27 UTC
I've started a branch for this:

  https://git.gnome.org/browse/tracker/log/?h=epub-metadata-improvements

I've cleaned up the patch and fixed a few things here and there, but currently the SPARQL isn't quite correct:

  (lt-tracker-miner-fs:16739): Tracker-CRITICAL **: Could not execute sparql: GDBus.Error:org.freedesktop.Tracker1.SparqlError.Parse: 19.39: syntax error, no support for nested anonymous blank nodes

This needs fixing before merging to master. But thanks for making a start on this!
Comment 4 Martyn Russell 2013-09-13 11:27:39 UTC
This problem has been fixed in the development version. The fix will be available in the next major software release. Thank you for your bug report.