GNOME Bugzilla – Bug 172294
RTF filter ignores some part of documents.
Last modified: 2006-07-28 04:07:27 UTC
Please describe the problem: Footnotes/endnotes are not getting extracted. Also, it extracts unwanted-font-details and other strings. Steps to reproduce: 1. 2. 3. Actual results: Expected results: Does this happen every time? Other information:
Font-names and other unwanted-strings are not extracted. Fixed in cvs. Footnotes/endnotes extraction yet to be done. :)
Also, font-attributes like bold, italics, underline etc are not handled properly.
*** Bug 307106 has been marked as a duplicate of this bug. ***
*** for this bug u can catch up with ms- 1.5 rtf specifications and look DoPull() method to solve it
M working on this bug currently . Found the microsoft-msdn RTF specification useful to understand MS Word RTF Format ...
Following are some current problems with the RTF-Filter regarding extraction of MS-Word RTFs: (w.r.t. beagle-0.0.11) ------ i.Header/Footer not getting extracted. ii.Date,Time,Text inside TextBoxes not getting extracted. However, footnotes,bold,italics,underline properties get extracted well as far as beagle-0.0.11 is concerned.
Is there any progress on this bug ?
With my limited understanding of office terms like "header", "footer", "textbox", I created rtf files with these features and tested against those files. They were all getting indexed. It looks like these features might have been added in CVS over time. Closing as fixed. If something is indeed broken, please reopen and _IMPORTANT_ attach an RTF file causing the problem.