[jira] [Commented] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030253#comment-17030253
 ] 

Hudson commented on TIKA-3010:
--

SUCCESS: Integrated in Jenkins build Tika-trunk #1768 (See 
[https://builds.apache.org/job/Tika-trunk/1768/])
TIKA-3010 Install and run Tika-Server as a Service (#305) (github: 
[https://github.com/apache/tika/commit/cba7149243b5d9d2df6b2bf6cbac21adef6aadec])
* (add) tika-server/bin/install_tika_service.sh
* (add) tika-server/bin/tika
* (add) tika-server/bin/tika.in.sh
* (edit) tika-server/README.md
* (add) tika-server/bin/init.d/tika
* (edit) tika-server/pom.xml
* (add) tika-server/assembly.xml


> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
> Fix For: 1.24
>
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030252#comment-17030252
 ] 

Hudson commented on TIKA-3010:
--

SUCCESS: Integrated in Jenkins build tika-branch-1x #302 (See 
[https://builds.apache.org/job/tika-branch-1x/302/])
TIKA-3010 Install and run Tika-Server as a Service (#305) (tallison: 
[https://github.com/apache/tika/commit/6c62b22a19b51273009875b6a925322bea02c300])
* (add) tika-server/bin/tika.in.sh
* (edit) tika-server/pom.xml
* (add) tika-server/bin/tika
* (edit) tika-server/README.md
* (add) tika-server/bin/install_tika_service.sh
* (add) tika-server/assembly.xml
* (add) tika-server/bin/init.d/tika


> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
> Fix For: 1.24
>
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TIKA-3036) broken build: "group id is too large" on a Mac

2020-02-04 Thread Tim Allison (Jira)
Tim Allison created TIKA-3036:
-

 Summary: broken build: "group id is too large" on a Mac
 Key: TIKA-3036
 URL: https://issues.apache.org/jira/browse/TIKA-3036
 Project: Tika
  Issue Type: Task
Reporter: Tim Allison
Assignee: Tim Allison


I recently got a failed build on a mac with this problem: 
[https://issues.redhat.com/browse/KEYCLOAK-4563]

 

The fix looks straightforward...add \{tarLongFileMode} configuration:
{code:java}

org.apache.maven.plugins
maven-assembly-plugin

posix


{code}
Any objections?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3010.
---
Resolution: Fixed

Thank you, [~epugh]!

> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-3010:
--
Fix Version/s: 1.24

> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
> Fix For: 1.24
>
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030196#comment-17030196
 ] 

ASF GitHub Bot commented on TIKA-3010:
--

tballison commented on pull request #305: TIKA-3010 Install and run Tika-Server 
as a Service
URL: https://github.com/apache/tika/pull/305
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030197#comment-17030197
 ] 

ASF GitHub Bot commented on TIKA-3010:
--

tballison commented on issue #305: TIKA-3010 Install and run Tika-Server as a 
Service
URL: https://github.com/apache/tika/pull/305#issuecomment-582160103
 
 
   Thank you, @epugh!!!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3010) Tika needs service installation script

2020-02-04 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030191#comment-17030191
 ] 

ASF GitHub Bot commented on TIKA-3010:
--

epugh commented on issue #305: TIKA-3010 Install and run Tika-Server as a 
Service
URL: https://github.com/apache/tika/pull/305#issuecomment-582157359
 
 
   It’s ready for commit!!!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Tika needs service installation script 
> ---
>
> Key: TIKA-3010
> URL: https://issues.apache.org/jira/browse/TIKA-3010
> Project: Tika
>  Issue Type: Improvement
>  Components: server
>Affects Versions: 1.23
>Reporter: David Eric Pugh
>Priority: Major
>
> With motion towards removing the tight integration of Tika into Solr, and the 
> fact that many folks deploy Tika-Server as a microservice, we should have a 
> community supported way of installing Tika.
> I'm thinking of something modeled on what Solr does: 
> https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3034) Detector always returns text/plain when scanning Mathematica files

2020-02-04 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029796#comment-17029796
 ] 

Hudson commented on TIKA-3034:
--

UNSTABLE: Integrated in Jenkins build Tika-trunk #1767 (See 
[https://builds.apache.org/job/Tika-trunk/1767/])
TIKA-3034 Mathematica files don't have a unique magic, but try to detect (nick: 
[https://github.com/apache/tika/commit/f5571fa99ef6f178a16bd1bd3a3cded83c7b0013])
* (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml


> Detector always returns text/plain when scanning Mathematica files
> --
>
> Key: TIKA-3034
> URL: https://issues.apache.org/jira/browse/TIKA-3034
> Project: Tika
>  Issue Type: Bug
>  Components: detector
>Affects Versions: 1.23
>Reporter: Tung Nguyen
>Priority: Blocker
> Fix For: 1.23
>
>
> We are working with Tika to implement our mime types detection module. The 
> library seemingly cannot detect Mathematica files although the documentation 
> confirmed it does [1]. The Tika detector always returns `text/plain` instead 
> of `application/mathematica` as described in the documentation as well as 
> unit tests [2].
> By doing the same need with Python code as below, we can obtain the right 
> mime types for any Mathematica file downloaded from the Wolfram Library 
> Archive [3]. 
> {code:java}
> #!/usr/bin/python3
> import mimetypes, os, sys
> test_file = sys.argv[1]
> print(mimetypes.MimeTypes().guess_type(test_file)[0])
> {code}
>  Therefore, we suspected there is a bug in Tika detector where it tries to 
> guess mime types for Mathematica files.
> References:
>  [1] [https://tika.apache.org/1.23/formats.html]
>  [2] 
> [https://github.com/apache/tika/blob/master/tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java#L64]
> [3] [https://library.wolfram.com/infocenter/Courseware/4706/]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3034) Detector always returns text/plain when scanning Mathematica files

2020-02-04 Thread Nick Burch (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029736#comment-17029736
 ] 

Nick Burch commented on TIKA-3034:
--

Mathematica does have a fairly unusual start-of-comment structure, so I've 
added some mime magic in {{f5571fa99}} for that which would help if the file 
began with a comment. Will only help for some files though, so passing the 
filename in such that it can be used too is the "best" option overall

> Detector always returns text/plain when scanning Mathematica files
> --
>
> Key: TIKA-3034
> URL: https://issues.apache.org/jira/browse/TIKA-3034
> Project: Tika
>  Issue Type: Bug
>  Components: detector
>Affects Versions: 1.23
>Reporter: Tung Nguyen
>Priority: Blocker
> Fix For: 1.23
>
>
> We are working with Tika to implement our mime types detection module. The 
> library seemingly cannot detect Mathematica files although the documentation 
> confirmed it does [1]. The Tika detector always returns `text/plain` instead 
> of `application/mathematica` as described in the documentation as well as 
> unit tests [2].
> By doing the same need with Python code as below, we can obtain the right 
> mime types for any Mathematica file downloaded from the Wolfram Library 
> Archive [3]. 
> {code:java}
> #!/usr/bin/python3
> import mimetypes, os, sys
> test_file = sys.argv[1]
> print(mimetypes.MimeTypes().guess_type(test_file)[0])
> {code}
>  Therefore, we suspected there is a bug in Tika detector where it tries to 
> guess mime types for Mathematica files.
> References:
>  [1] [https://tika.apache.org/1.23/formats.html]
>  [2] 
> [https://github.com/apache/tika/blob/master/tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java#L64]
> [3] [https://library.wolfram.com/infocenter/Courseware/4706/]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)