[ 
https://issues.apache.org/jira/browse/TIKA-888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13242397#comment-13242397
 ] 

Uwe Schindler commented on TIKA-888:
------------------------------------

{quote}
Couldn't you take the Parser out of the file:

org.apache.tika.parser.Parser

(e.g., the Service loading mechanism). If you remove the 
org.apache.tika.parser.netcdf.NetCDFParser and 
org.apache.tika.parser.hdf.HDFParser entries from that file, the user will 
never reach the NetCDF or HDF Parser, right? I think you guys can provide your 
own custom copy of this file, and make sure it's at the root of the classpath 
in Solr Cell and then it will take your guys version over the baked in one for 
the tika-parsers jar.
{quote}

That's exactly *not* possible. SPI collects all 
/META-INF/services/org.apache.tika.parser.Parser it can find on classpath and 
collects all Parsers it can find. Removin parsers from one file does not help 
(order of classpath does not matter), as the SPI builds a set of all collected 
parsers from all META-INF files.

That's the problem I ran into. By the way, Lucene 4.0 is now also using SPI for 
their codec/posting format support, so I know about what I am talking :-)

Because of this I replicated your single-SPI file parser and made a simple 
Collection<Parser> out of it, that I pass to AutoDetectParser. System SPI is 
then ignored.
                
> NetCDF parser uses Java 6 JAR file and test/compilation fails with Java 1.5, 
> although TIKA is Java 1.5
> ------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-888
>                 URL: https://issues.apache.org/jira/browse/TIKA-888
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Uwe Schindler
>            Assignee: Chris A. Mattmann
>
> Lucene/Solr developers ran this tool before releasing Lucene/Solr 3.6 (Solr 
> 3.6 is still required to run on Java 1.5, see SOLR-3295): 
> http://code.google.com/p/versioncheck/
> {noformat}
> Major.Minor Version : 50.0             JAVA compatibility : Java 1.6 
> platform: 45.3-50.0
> Number of classes : 60
> Classes are: 
> c:\Work\lucene-solr\.\solr\contrib\extraction\lib\netcdf-4.2-min.jar [:] 
> ucar/unidata/geoloc/Bearing.class
> ...
> {noformat}
> TIKA should use a 1.5 version of this class and especially do some Java 5 
> tests before releasing (as it's build dependencies says, it's minimum Java5). 
> I tried to compile and run TIKA tests with Java 1.5 -> crash (Invalid class 
> file format).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to