[jira] [Commented] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030253#comment-17030253 ] Hudson commented on TIKA-3010: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1768 (See [https://builds.apache.org/job/Tika-trunk/1768/]) TIKA-3010 Install and run Tika-Server as a Service (#305) (github: [https://github.com/apache/tika/commit/cba7149243b5d9d2df6b2bf6cbac21adef6aadec]) * (add) tika-server/bin/install_tika_service.sh * (add) tika-server/bin/tika * (add) tika-server/bin/tika.in.sh * (edit) tika-server/README.md * (add) tika-server/bin/init.d/tika * (edit) tika-server/pom.xml * (add) tika-server/assembly.xml > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > Fix For: 1.24 > > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030252#comment-17030252 ] Hudson commented on TIKA-3010: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #302 (See [https://builds.apache.org/job/tika-branch-1x/302/]) TIKA-3010 Install and run Tika-Server as a Service (#305) (tallison: [https://github.com/apache/tika/commit/6c62b22a19b51273009875b6a925322bea02c300]) * (add) tika-server/bin/tika.in.sh * (edit) tika-server/pom.xml * (add) tika-server/bin/tika * (edit) tika-server/README.md * (add) tika-server/bin/install_tika_service.sh * (add) tika-server/assembly.xml * (add) tika-server/bin/init.d/tika > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > Fix For: 1.24 > > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (TIKA-3036) broken build: "group id is too large" on a Mac
Tim Allison created TIKA-3036: - Summary: broken build: "group id is too large" on a Mac Key: TIKA-3036 URL: https://issues.apache.org/jira/browse/TIKA-3036 Project: Tika Issue Type: Task Reporter: Tim Allison Assignee: Tim Allison I recently got a failed build on a mac with this problem: [https://issues.redhat.com/browse/KEYCLOAK-4563] The fix looks straightforward...add \{tarLongFileMode} configuration: {code:java} org.apache.maven.plugins maven-assembly-plugin posix {code} Any objections? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Resolved] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3010. --- Resolution: Fixed Thank you, [~epugh]! > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-3010: -- Fix Version/s: 1.24 > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > Fix For: 1.24 > > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030196#comment-17030196 ] ASF GitHub Bot commented on TIKA-3010: -- tballison commented on pull request #305: TIKA-3010 Install and run Tika-Server as a Service URL: https://github.com/apache/tika/pull/305 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030197#comment-17030197 ] ASF GitHub Bot commented on TIKA-3010: -- tballison commented on issue #305: TIKA-3010 Install and run Tika-Server as a Service URL: https://github.com/apache/tika/pull/305#issuecomment-582160103 Thank you, @epugh!!! This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (TIKA-3010) Tika needs service installation script
[ https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030191#comment-17030191 ] ASF GitHub Bot commented on TIKA-3010: -- epugh commented on issue #305: TIKA-3010 Install and run Tika-Server as a Service URL: https://github.com/apache/tika/pull/305#issuecomment-582157359 It’s ready for commit!!! This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > Tika needs service installation script > --- > > Key: TIKA-3010 > URL: https://issues.apache.org/jira/browse/TIKA-3010 > Project: Tika > Issue Type: Improvement > Components: server >Affects Versions: 1.23 >Reporter: David Eric Pugh >Priority: Major > > With motion towards removing the tight integration of Tika into Solr, and the > fact that many folks deploy Tika-Server as a microservice, we should have a > community supported way of installing Tika. > I'm thinking of something modeled on what Solr does: > https://lucene.apache.org/solr/guide/8_3/taking-solr-to-production.html#service-installation-script -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (TIKA-3034) Detector always returns text/plain when scanning Mathematica files
[ https://issues.apache.org/jira/browse/TIKA-3034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029796#comment-17029796 ] Hudson commented on TIKA-3034: -- UNSTABLE: Integrated in Jenkins build Tika-trunk #1767 (See [https://builds.apache.org/job/Tika-trunk/1767/]) TIKA-3034 Mathematica files don't have a unique magic, but try to detect (nick: [https://github.com/apache/tika/commit/f5571fa99ef6f178a16bd1bd3a3cded83c7b0013]) * (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml > Detector always returns text/plain when scanning Mathematica files > -- > > Key: TIKA-3034 > URL: https://issues.apache.org/jira/browse/TIKA-3034 > Project: Tika > Issue Type: Bug > Components: detector >Affects Versions: 1.23 >Reporter: Tung Nguyen >Priority: Blocker > Fix For: 1.23 > > > We are working with Tika to implement our mime types detection module. The > library seemingly cannot detect Mathematica files although the documentation > confirmed it does [1]. The Tika detector always returns `text/plain` instead > of `application/mathematica` as described in the documentation as well as > unit tests [2]. > By doing the same need with Python code as below, we can obtain the right > mime types for any Mathematica file downloaded from the Wolfram Library > Archive [3]. > {code:java} > #!/usr/bin/python3 > import mimetypes, os, sys > test_file = sys.argv[1] > print(mimetypes.MimeTypes().guess_type(test_file)[0]) > {code} > Therefore, we suspected there is a bug in Tika detector where it tries to > guess mime types for Mathematica files. > References: > [1] [https://tika.apache.org/1.23/formats.html] > [2] > [https://github.com/apache/tika/blob/master/tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java#L64] > [3] [https://library.wolfram.com/infocenter/Courseware/4706/] > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (TIKA-3034) Detector always returns text/plain when scanning Mathematica files
[ https://issues.apache.org/jira/browse/TIKA-3034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029736#comment-17029736 ] Nick Burch commented on TIKA-3034: -- Mathematica does have a fairly unusual start-of-comment structure, so I've added some mime magic in {{f5571fa99}} for that which would help if the file began with a comment. Will only help for some files though, so passing the filename in such that it can be used too is the "best" option overall > Detector always returns text/plain when scanning Mathematica files > -- > > Key: TIKA-3034 > URL: https://issues.apache.org/jira/browse/TIKA-3034 > Project: Tika > Issue Type: Bug > Components: detector >Affects Versions: 1.23 >Reporter: Tung Nguyen >Priority: Blocker > Fix For: 1.23 > > > We are working with Tika to implement our mime types detection module. The > library seemingly cannot detect Mathematica files although the documentation > confirmed it does [1]. The Tika detector always returns `text/plain` instead > of `application/mathematica` as described in the documentation as well as > unit tests [2]. > By doing the same need with Python code as below, we can obtain the right > mime types for any Mathematica file downloaded from the Wolfram Library > Archive [3]. > {code:java} > #!/usr/bin/python3 > import mimetypes, os, sys > test_file = sys.argv[1] > print(mimetypes.MimeTypes().guess_type(test_file)[0]) > {code} > Therefore, we suspected there is a bug in Tika detector where it tries to > guess mime types for Mathematica files. > References: > [1] [https://tika.apache.org/1.23/formats.html] > [2] > [https://github.com/apache/tika/blob/master/tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java#L64] > [3] [https://library.wolfram.com/infocenter/Courseware/4706/] > -- This message was sent by Atlassian Jira (v8.3.4#803005)