[jira] [Commented] (CONNECTORS-1699) Upgrade to Tika 2.x

2022-03-07 Thread Cihad Guzel (Jira)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17502576#comment-17502576
 ] 

Cihad Guzel commented on CONNECTORS-1699:
-

Maybe any student would like to develop this issue for GSoC 2022

> Upgrade to Tika 2.x
> ---
>
> Key: CONNECTORS-1699
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1699
> Project: ManifoldCF
>  Issue Type: Improvement
>  Components: Tika extractor
>Affects Versions: ManifoldCF 2.21
>Reporter: Cihad Guzel
>Priority: Major
> Fix For: ManifoldCF next
>
>
> Tika has a new version as 2.x . We can support the new version instead of 1.x 
>  . There is a migration document here: 
> [https://cwiki.apache.org/confluence/display/TIKA/Migrating+to+Tika+2.0.0]
> Tika has released 2.3.0. Changes can be found from here: 
> https://tika.apache.org/2.3.0/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (CONNECTORS-1699) Upgrade to Tika 2.x

2022-02-22 Thread Karl Wright (Jira)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17496421#comment-17496421
 ] 

Karl Wright commented on CONNECTORS-1699:
-

Attempting to find the dependencies for 2.3.0 comes up with this error:

{code}
[ERROR] Failed to execute goal on project mcf-tika-connector: Could not resolve 
dependencies for project 
org.apache.manifoldcf:mcf-tika-connector:jar:2.21-SNAPSHOT: Could not find 
artifact org.apache.tika:tika-parsers:jar:2.3.0 in central 
(https://repo.maven.apache.org/maven2)
{code}

It looks like 2.x repackaged everything.  Somebody therefore needs to invest 
the basic time to figure out where everything went.  A port to 2.x will likely 
require substantial work and time and I certainly do not have it right now.  
Volunteers welcome.


> Upgrade to Tika 2.x
> ---
>
> Key: CONNECTORS-1699
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1699
> Project: ManifoldCF
>  Issue Type: Improvement
>  Components: Tika extractor
>Affects Versions: ManifoldCF 2.21
>Reporter: Cihad Guzel
>Priority: Major
> Fix For: ManifoldCF next
>
>
> Tika has a new version as 2.x . We can support the new version instead of 1.x 
>  . There is a migration document here: 
> [https://cwiki.apache.org/confluence/display/TIKA/Migrating+to+Tika+2.0.0]
> Tika has released 2.3.0. Changes can be found from here: 
> https://tika.apache.org/2.3.0/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)