https://bugs.kde.org/show_bug.cgi?id=438455
skierpage <skierp...@gmail.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Summary|Baloo doesn't index |Baloo doesn't index some |Microsoft Office .doc files |Microsoft Office .doc files --- Comment #2 from skierpage <skierp...@gmail.com> --- So it turns out Baloo did and can index contents of other .doc files, e.g. external .doc files I received in 2016 and earlier, and `catdoc` displays their contents; but catdoc doesn't display anything for the contents of the recent .doc file I received or the .doc file generated by LibreOffice 7.1.3.2 that Baloo doesn't index. I couldn't find any Linux utility that identifies the version of the Word file format that a .doc file uses, or whether it's been saved with Word's "Fast Save" feature. The two failing documents contain the string "Microsoft Word-Dokument" near the front, whereas the working ones contain "Microsoft Word 9.0" or "Microsoft Word 97-2004 Document" near the end. So the problem here seems to be with KFileMetaData and its use of catdoc. I couldn't find a bug that catdoc doesn't support some Word file formats; its maintainer's CVStrac is dead, the most active bug list seems to be Debian's bug tracker. -- You are receiving this mail because: You are watching all bug changes.