[ 
https://issues.apache.org/jira/browse/OAK-6414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chetan Mehrotra resolved OAK-6414.
----------------------------------
       Resolution: Fixed
    Fix Version/s: 1.7.4

Implemented the new approach as mentioned in description with 1800726

> Use Tika config to determine non indexed mimeTypes
> --------------------------------------------------
>
>                 Key: OAK-6414
>                 URL: https://issues.apache.org/jira/browse/OAK-6414
>             Project: Jackrabbit Oak
>          Issue Type: Technical task
>          Components: lucene
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>             Fix For: 1.8, 1.7.4
>
>
> With OAK-2895 support was added to avoid loading of binary content whose 
> mimeType have been excluded from indexing via configuring EmptyParser against 
> them. That approach used a lazyInputStream and relied on the fact that Tika 
> would not access the stream if none of the parser is going to touch that file.
> However as seen while upgrading to Tika 1.15 now Tika would [check that the 
> InputStream support marking or 
> not|https://github.com/apache/tika/commit/896c46a0c652de436da0e4f25bfa53a7d83ae02f].
>  
> To support this change we need to change the logic on Oak side to explicit 
> check by reading tika-config.xml to see which all mimeType have been 
> configured with EmptyParser



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to