>hsp wrote:
>> What could be the warning message about LazyReader? Even after
>> rebuild the indexes, I just find only one .doc file, that is the file
>> I put separated (it is the same that is inside the zip file with the
>> same properties inside a nt:resouce node).
>
>It seems that the underlying text filter has difficulties reading the 
>word document. the warning indicates that the binary data could not be 
>converted into a text representation. hence a fulltext query for that 
>document will not work.
Really, thanks by advice, the file was unzipped incorrectly in some situation. 
The bug was resolved and the indexes now are creating for all the files 
currently in the session. Thank you very much by your help.
>
>iirc we are using apache poi to extract text from word documents. to 
>further track down the problem I suggest you try to read the word 
>document using the plain poi library without jackrabbit. If it doesn't 
>work that way either it's not a jackrabbit core issue.
>
>regards
>  marcel
>

Reply via email to