Hi,

We have solved our problem in another way.
There is a file \org\apache\jackrabbit\core\query\lucene\tika-config.xml that is located in jackrabbit-core.jar

We added section:
<parser class="org.apache.tika.parser.EmptyParser">
<mime>application/vnd.openxmlformats-officedocument.spreadsheetml.sheet</mime>
</parser>

And commented this section:
<parser name="parse-pdf" class="org.apache.tika.parser.pdf.PDFParser">
<mime>application/pdf</mime>
</parser>

Is there a chance to configure it in another way? Otherwise we will have to change tika-config.xml manually each time we make a build.. Maybe your solution about parameters in workspace.xml will solve the problem?

Regards,
Anton

One thing more ...

If you have problems to start jackrabbit you could add following in the 
workspace.xml
in the failing workspace.

<SearchIndex class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
...
<param name="forceConsistencyCheck" value="true"/>
<param name="autoRepair" value="true"/>
<param name="onWorkspaceInconsistency" value="log"/>
...


see also
https://issues.apache.org/jira/browse/JCR-2651

greets
claus

Reply via email to