https://bugs.kde.org/show_bug.cgi?id=438455

skierpage <skierp...@gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|Baloo doesn't index         |Baloo doesn't index some
                   |Microsoft Office .doc files |Microsoft Office .doc files

--- Comment #2 from skierpage <skierp...@gmail.com> ---
So it turns out Baloo did and can index contents of other .doc files, e.g.
external .doc files I received in 2016 and earlier, and `catdoc` displays their
contents; but catdoc doesn't display anything for the contents of the recent
.doc file I received or the .doc file generated by LibreOffice 7.1.3.2 that
Baloo doesn't index. I couldn't find any Linux utility that identifies the
version of the Word file format that a .doc file uses, or whether it's been
saved with Word's "Fast Save" feature. The two failing documents contain the
string "Microsoft Word-Dokument" near the front, whereas the working ones
contain "Microsoft Word 9.0" or "Microsoft Word 97-2004 Document" near the end.

So the problem here seems to be with KFileMetaData and its use of catdoc. I
couldn't find a bug that catdoc doesn't support some Word file formats; its
maintainer's CVStrac is dead, the most active bug list seems to be Debian's bug
tracker.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to