Re: [VOTE] Release Apache Tika 1.13 Candidate #1

Konstantin Gribov Fri, 13 May 2016 08:07:34 -0700

All signatures are correct, builds fine on Linux with openjdk8. Tests pass
w/ tesseract OCR enabled.


RC1 doesn't contain fix for TIKA-1938[1]. We should either roll another RC
or mark it as fix for 1.14. Currently I've marked it as fix for 1.14.

[x] +1 Release this package as Apache Tika 1.13

[1]: https://issues.apache.org/jira/browse/TIKA-1938

чт, 12 мая 2016 г. в 1:14, Lewis John Mcgibbney <lewis.mcgibb...@gmail.com>:

> Hi David,
> Good job on the RC
> The .zip artifact contains 2015 in NOTICE
> Everything else looks great
> All Signatures good.
> Tests pass on MacOSX, Java 1.7
>
> [X] +1 Release this package as Apache Tika 1.13
>
> On Wed, May 11, 2016 at 6:50 AM, <dev-digest-h...@tika.apache.org> wrote:
>
> >
> > From: David Meikle <dmei...@apache.org>
> > To: dev@tika.apache.org, u...@tika.apache.org
> > Cc:
> > Date: Mon, 9 May 2016 20:34:32 +0100
> > Subject: [VOTE] Release Apache Tika 1.13 Candidate #1
> > A candidate for the Tika 1.13 release is available at:
> >   https://dist.apache.org/repos/dist/dev/tika/
> >
>
>
>
> >
> > The release candidate is a zip archive of the sources in:
> >
> >
> https://git-wip-us.apache.org/repos/asf?p=tika.git;a=tag;h=18fa8213438183a249df4f52535031670f0a3eef
> >
> > The SHA1 checksum of the archive is
> >   8a591e7ea29dca14d5f25b44b3a2a35425676c64.
> >
>
> > In addition, a staged maven repository is available here:
> >
> >
> https://repository.apache.org/content/repositories/orgapachetika-1019/org/apache/tika
> >
> > Please vote on releasing this package as Apache Tika 1.13.
> > The vote is open for the next 72 hours and passes if a majority of at
> > least three +1 Tika PMC votes are cast.
> >
> > [ ] +1 Release this package as Apache Tika 1.13
> > [ ] -1 Do not release this package because…
> >
> > Here is my +1 for the release.
> >
> > Cheers,
> > Dave
> >
> > P.S. For anyone looking to test using the Apache Tika Server I have put
> up
> > a branch that pulls down the RC at
> > https://github.com/LogicalSpark/docker-tikaserver/tree/1.13rc1
> >
> >
> > ---------- Forwarded message ----------
> > From: "Allison, Timothy B." <talli...@mitre.org>
> > To: "dev@tika.apache.org" <dev@tika.apache.org>, "u...@tika.apache.org"
> <
> > u...@tika.apache.org>
> > Cc:
> > Date: Wed, 11 May 2016 13:50:26 +0000
> > Subject: RE: [VOTE] Release Apache Tika 1.13 Candidate #1
> > +1
> >
> > Built on Windows and Linux.  I'm relying on earlier pre-release tests for
> > no surprises. :)
> >
> > Thank you, Dave!
> >
> > -----Original Message-----
> > From: David Meikle [mailto:loo...@gmail.com] On Behalf Of David Meikle
> > Sent: Monday, May 9, 2016 3:35 PM
> > To: dev@tika.apache.org; u...@tika.apache.org
> > Subject: [VOTE] Release Apache Tika 1.13 Candidate #1
> >
> > A candidate for the Tika 1.13 release is available at:
> >   https://dist.apache.org/repos/dist/dev/tika/
> >
> > The release candidate is a zip archive of the sources in:
> >
> >
> https://git-wip-us.apache.org/repos/asf?p=tika.git;a=tag;h=18fa8213438183a249df4f52535031670f0a3eef
> >
> > The SHA1 checksum of the archive is
> >   8a591e7ea29dca14d5f25b44b3a2a35425676c64.
> >
> > In addition, a staged maven repository is available here:
> >
> >
> https://repository.apache.org/content/repositories/orgapachetika-1019/org/apache/tika
> >
> > Please vote on releasing this package as Apache Tika 1.13.
> > The vote is open for the next 72 hours and passes if a majority of at
> > least three +1 Tika PMC votes are cast.
> >
> > [ ] +1 Release this package as Apache Tika 1.13 [ ] -1 Do not release
> this
> > package because…
> >
> > Here is my +1 for the release.
> >
> > Cheers,
> > Dave
> >
> > P.S. For anyone looking to test using the Apache Tika Server I have put
> up
> > a branch that pulls down the RC at
> > https://github.com/LogicalSpark/docker-tikaserver/tree/1.13rc1
> >
> >
> > ---------- Forwarded message ----------
> > From: "Zoltan Toth (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 00:56:12 +0000 (UTC)
> > Subject: [jira] [Commented] (TIKA-1817) Extracts entire file content for
> > ASCII DXF files
> >
> >     [
> >
> https://issues.apache.org/jira/browse/TIKA-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15277426#comment-15277426
> > ]
> >
> > Zoltan Toth commented on TIKA-1817:
> > -----------------------------------
> >
> > Hi there.  I can see that some great progress has been made on this
> > issue.  Many thanks!  Could someone please let me know when the fix for
> > this issue is likely to be released to production?
> >
> > > Extracts entire file content for ASCII DXF files
> > > ------------------------------------------------
> > >
> > >                 Key: TIKA-1817
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1817
> > >             Project: Tika
> > >          Issue Type: Bug
> > >    Affects Versions: 1.11
> > >            Reporter: Zoltan Toth
> > >         Attachments: SMA-Controller.dxf, house design.dxf,
> > jcsample-screendump.jpg, jcsample.dxf
> > >
> > >
> > > By definition, ASCII DXF files are encoded in plain text.  However. the
> > vast majority of their content is not intended to be human readable (see
> > https://en.wikipedia.org/wiki/AutoCAD_DXF).  Unfortunately for these
> > files, Tika simply "extracts" the entire content of the file instead of
> the
> > human-readable portions (i.e. comments etc.) that a CAD tool would
> render.
> > This results in massive amounts of rubbish data being returned with dire
> > consequences for applications that rely on this.
> > > It would be nice if only the human-readable text fields were extracted.
> > Failing this, it would still be nice if no text was extracted from these
> > files at all.
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "Joseph Naegele (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 13:52:13 +0000 (UTC)
> > Subject: [jira] [Commented] (TIKA-1938) HtmlParser drops <script>
> elements
> > found inside <head>
> >
> >     [
> >
> https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278119#comment-15278119
> > ]
> >
> > Joseph Naegele commented on TIKA-1938:
> > --------------------------------------
> >
> > This is a very simple PR if anyone is able to review and merge. I had
> > hoped to get it into 1.13.
> >
> > > HtmlParser drops <script> elements found inside <head>
> > > ------------------------------------------------------
> > >
> > >                 Key: TIKA-1938
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1938
> > >             Project: Tika
> > >          Issue Type: Bug
> > >          Components: parser
> > >    Affects Versions: 1.12
> > >            Reporter: Joseph Naegele
> > >
> > > HtmlParser's HtmlHandler does not check for "SCRIPT" in
> {{startElement}}
> > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This
> > causes <script> elements found within <head> to be dropped altogether.
> They
> > should be treated in the same manner as "LINK" elements.
> > > Here is a sample test case that demonstrates the problem and can be run
> > from within {{HtmlParserTest.java}}, although it could be generalized to
> > check for <a>, <link> and <img> links by using the
> {{LinkContentHandler}}.
> > > {code}
> > > @Test
> > > public void testScriptSrc() throws Exception {
> > >     String url = "http://domain.com/logic.js";;
> > >     String scriptInBody =
> > >             "<html><body><script src=\"" + url +
> > "\"></script></body></html>";
> > >     String scriptInHead =
> > >             "<html><head><script src=\"" + url +
> > "\"></script></head></html>";
> > >     assertScriptLink(scriptInBody, url);
> > >     assertScriptLink(scriptInHead, url);
> > > }
> > > private void assertScriptLink(String html, String url) throws
> Exception {
> > >     // IdentityHtmlMapper needed to extract <script> tags
> > >     ParseContext context = new ParseContext();
> > >     context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE);
> > >     Metadata metadata = new Metadata();
> > >     metadata.set(Metadata.CONTENT_TYPE, "text/html");
> > >     final List<String> links = new ArrayList<String>();
> > >     new HtmlParser().parse(
> > >             new ByteArrayInputStream(html.getBytes(UTF_8)),
> > >             new DefaultHandler() {
> > >                 @Override
> > >                 public void startElement(
> > >                         String u, String l, String name, Attributes
> > atts) {
> > >                     if (name.equals("script") && atts.getValue("",
> > "src") != null) {
> > >                         links.add(atts.getValue("", "src"));
> > >                     }
> > >                 }
> > >             },
> > >             metadata,
> > >             context);
> > >     assertEquals(1, links.size());
> > >     assertEquals(url, links.get(0));
> > > }
> > > {code}
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "ASF GitHub Bot (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 15:37:13 +0000 (UTC)
> > Subject: [jira] [Commented] (TIKA-1938) HtmlParser drops <script>
> elements
> > found inside <head>
> >
> >     [
> >
> https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278290#comment-15278290
> > ]
> >
> > ASF GitHub Bot commented on TIKA-1938:
> > --------------------------------------
> >
> > Github user asfgit closed the pull request at:
> >
> >     https://github.com/apache/tika/pull/111
> >
> >
> > > HtmlParser drops <script> elements found inside <head>
> > > ------------------------------------------------------
> > >
> > >                 Key: TIKA-1938
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1938
> > >             Project: Tika
> > >          Issue Type: Bug
> > >          Components: parser
> > >    Affects Versions: 1.12
> > >            Reporter: Joseph Naegele
> > >            Assignee: Ken Krugler
> > >
> > > HtmlParser's HtmlHandler does not check for "SCRIPT" in
> {{startElement}}
> > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This
> > causes <script> elements found within <head> to be dropped altogether.
> They
> > should be treated in the same manner as "LINK" elements.
> > > Here is a sample test case that demonstrates the problem and can be run
> > from within {{HtmlParserTest.java}}, although it could be generalized to
> > check for <a>, <link> and <img> links by using the
> {{LinkContentHandler}}.
> > > {code}
> > > @Test
> > > public void testScriptSrc() throws Exception {
> > >     String url = "http://domain.com/logic.js";;
> > >     String scriptInBody =
> > >             "<html><body><script src=\"" + url +
> > "\"></script></body></html>";
> > >     String scriptInHead =
> > >             "<html><head><script src=\"" + url +
> > "\"></script></head></html>";
> > >     assertScriptLink(scriptInBody, url);
> > >     assertScriptLink(scriptInHead, url);
> > > }
> > > private void assertScriptLink(String html, String url) throws
> Exception {
> > >     // IdentityHtmlMapper needed to extract <script> tags
> > >     ParseContext context = new ParseContext();
> > >     context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE);
> > >     Metadata metadata = new Metadata();
> > >     metadata.set(Metadata.CONTENT_TYPE, "text/html");
> > >     final List<String> links = new ArrayList<String>();
> > >     new HtmlParser().parse(
> > >             new ByteArrayInputStream(html.getBytes(UTF_8)),
> > >             new DefaultHandler() {
> > >                 @Override
> > >                 public void startElement(
> > >                         String u, String l, String name, Attributes
> > atts) {
> > >                     if (name.equals("script") && atts.getValue("",
> > "src") != null) {
> > >                         links.add(atts.getValue("", "src"));
> > >                     }
> > >                 }
> > >             },
> > >             metadata,
> > >             context);
> > >     assertEquals(1, links.size());
> > >     assertEquals(url, links.get(0));
> > > }
> > > {code}
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "Hudson (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 17:00:16 +0000 (UTC)
> > Subject: [jira] [Commented] (TIKA-1938) HtmlParser drops <script>
> elements
> > found inside <head>
> >
> >     [
> >
> https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278455#comment-15278455
> > ]
> >
> > Hudson commented on TIKA-1938:
> > ------------------------------
> >
> > SUCCESS: Integrated in tika-trunk-jdk1.7 #988 (See [
> > https://builds.apache.org/job/tika-trunk-jdk1.7/988/])
> > fix for TIKA-1938 contributed by naegelejd (jnaegele: rev
> > b6d23c189e852fa2e41b441c18bfe3e66e3f67c4)
> > *
> >
> tika-parsers/src/test/java/org/apache/tika/parser/html/HtmlParserTest.java
> > * tika-parsers/src/main/java/org/apache/tika/parser/html/HtmlHandler.java
> >
> >
> > > HtmlParser drops <script> elements found inside <head>
> > > ------------------------------------------------------
> > >
> > >                 Key: TIKA-1938
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1938
> > >             Project: Tika
> > >          Issue Type: Bug
> > >          Components: parser
> > >    Affects Versions: 1.12
> > >            Reporter: Joseph Naegele
> > >            Assignee: Ken Krugler
> > >
> > > HtmlParser's HtmlHandler does not check for "SCRIPT" in
> {{startElement}}
> > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This
> > causes <script> elements found within <head> to be dropped altogether.
> They
> > should be treated in the same manner as "LINK" elements.
> > > Here is a sample test case that demonstrates the problem and can be run
> > from within {{HtmlParserTest.java}}, although it could be generalized to
> > check for <a>, <link> and <img> links by using the
> {{LinkContentHandler}}.
> > > {code}
> > > @Test
> > > public void testScriptSrc() throws Exception {
> > >     String url = "http://domain.com/logic.js";;
> > >     String scriptInBody =
> > >             "<html><body><script src=\"" + url +
> > "\"></script></body></html>";
> > >     String scriptInHead =
> > >             "<html><head><script src=\"" + url +
> > "\"></script></head></html>";
> > >     assertScriptLink(scriptInBody, url);
> > >     assertScriptLink(scriptInHead, url);
> > > }
> > > private void assertScriptLink(String html, String url) throws
> Exception {
> > >     // IdentityHtmlMapper needed to extract <script> tags
> > >     ParseContext context = new ParseContext();
> > >     context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE);
> > >     Metadata metadata = new Metadata();
> > >     metadata.set(Metadata.CONTENT_TYPE, "text/html");
> > >     final List<String> links = new ArrayList<String>();
> > >     new HtmlParser().parse(
> > >             new ByteArrayInputStream(html.getBytes(UTF_8)),
> > >             new DefaultHandler() {
> > >                 @Override
> > >                 public void startElement(
> > >                         String u, String l, String name, Attributes
> > atts) {
> > >                     if (name.equals("script") && atts.getValue("",
> > "src") != null) {
> > >                         links.add(atts.getValue("", "src"));
> > >                     }
> > >                 }
> > >             },
> > >             metadata,
> > >             context);
> > >     assertEquals(1, links.size());
> > >     assertEquals(url, links.get(0));
> > > }
> > > {code}
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "Ken Krugler (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 15:15:13 +0000 (UTC)
> > Subject: [jira] [Assigned] (TIKA-1938) HtmlParser drops <script> elements
> > found inside <head>
> >
> >      [
> >
> https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> > ]
> >
> > Ken Krugler reassigned TIKA-1938:
> > ---------------------------------
> >
> >     Assignee: Ken Krugler
> >
> > > HtmlParser drops <script> elements found inside <head>
> > > ------------------------------------------------------
> > >
> > >                 Key: TIKA-1938
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1938
> > >             Project: Tika
> > >          Issue Type: Bug
> > >          Components: parser
> > >    Affects Versions: 1.12
> > >            Reporter: Joseph Naegele
> > >            Assignee: Ken Krugler
> > >
> > > HtmlParser's HtmlHandler does not check for "SCRIPT" in
> {{startElement}}
> > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This
> > causes <script> elements found within <head> to be dropped altogether.
> They
> > should be treated in the same manner as "LINK" elements.
> > > Here is a sample test case that demonstrates the problem and can be run
> > from within {{HtmlParserTest.java}}, although it could be generalized to
> > check for <a>, <link> and <img> links by using the
> {{LinkContentHandler}}.
> > > {code}
> > > @Test
> > > public void testScriptSrc() throws Exception {
> > >     String url = "http://domain.com/logic.js";;
> > >     String scriptInBody =
> > >             "<html><body><script src=\"" + url +
> > "\"></script></body></html>";
> > >     String scriptInHead =
> > >             "<html><head><script src=\"" + url +
> > "\"></script></head></html>";
> > >     assertScriptLink(scriptInBody, url);
> > >     assertScriptLink(scriptInHead, url);
> > > }
> > > private void assertScriptLink(String html, String url) throws
> Exception {
> > >     // IdentityHtmlMapper needed to extract <script> tags
> > >     ParseContext context = new ParseContext();
> > >     context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE);
> > >     Metadata metadata = new Metadata();
> > >     metadata.set(Metadata.CONTENT_TYPE, "text/html");
> > >     final List<String> links = new ArrayList<String>();
> > >     new HtmlParser().parse(
> > >             new ByteArrayInputStream(html.getBytes(UTF_8)),
> > >             new DefaultHandler() {
> > >                 @Override
> > >                 public void startElement(
> > >                         String u, String l, String name, Attributes
> > atts) {
> > >                     if (name.equals("script") && atts.getValue("",
> > "src") != null) {
> > >                         links.add(atts.getValue("", "src"));
> > >                     }
> > >                 }
> > >             },
> > >             metadata,
> > >             context);
> > >     assertEquals(1, links.size());
> > >     assertEquals(url, links.get(0));
> > > }
> > > {code}
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: asfgit <g...@git.apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 15:36:53 +0000 (UTC)
> > Subject: [GitHub] tika pull request: fix for TIKA-1938 contributed by
> > naegelejd
> > Github user asfgit closed the pull request at:
> >
> >     https://github.com/apache/tika/pull/111
> >
> >
> > ---
> > If your project is set up for it, you can reply to this email and have
> your
> > reply appear on GitHub as well. If your project does not have this
> feature
> > enabled and wishes so, or if the feature is enabled but not working,
> please
> > contact infrastructure at infrastruct...@apache.org or file a JIRA
> ticket
> > with INFRA.
> > ---
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "Ken Krugler (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Tue, 10 May 2016 15:38:12 +0000 (UTC)
> > Subject: [jira] [Closed] (TIKA-1938) HtmlParser drops <script> elements
> > found inside <head>
> >
> >      [
> >
> https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> > ]
> >
> > Ken Krugler closed TIKA-1938.
> > -----------------------------
> >     Resolution: Fixed
> >
> > Fix with commit da5bbbe..46d5775.
> >
> > Thanks Joseph!
> >
> > > HtmlParser drops <script> elements found inside <head>
> > > ------------------------------------------------------
> > >
> > >                 Key: TIKA-1938
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1938
> > >             Project: Tika
> > >          Issue Type: Bug
> > >          Components: parser
> > >    Affects Versions: 1.12
> > >            Reporter: Joseph Naegele
> > >            Assignee: Ken Krugler
> > >
> > > HtmlParser's HtmlHandler does not check for "SCRIPT" in
> {{startElement}}
> > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This
> > causes <script> elements found within <head> to be dropped altogether.
> They
> > should be treated in the same manner as "LINK" elements.
> > > Here is a sample test case that demonstrates the problem and can be run
> > from within {{HtmlParserTest.java}}, although it could be generalized to
> > check for <a>, <link> and <img> links by using the
> {{LinkContentHandler}}.
> > > {code}
> > > @Test
> > > public void testScriptSrc() throws Exception {
> > >     String url = "http://domain.com/logic.js";;
> > >     String scriptInBody =
> > >             "<html><body><script src=\"" + url +
> > "\"></script></body></html>";
> > >     String scriptInHead =
> > >             "<html><head><script src=\"" + url +
> > "\"></script></head></html>";
> > >     assertScriptLink(scriptInBody, url);
> > >     assertScriptLink(scriptInHead, url);
> > > }
> > > private void assertScriptLink(String html, String url) throws
> Exception {
> > >     // IdentityHtmlMapper needed to extract <script> tags
> > >     ParseContext context = new ParseContext();
> > >     context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE);
> > >     Metadata metadata = new Metadata();
> > >     metadata.set(Metadata.CONTENT_TYPE, "text/html");
> > >     final List<String> links = new ArrayList<String>();
> > >     new HtmlParser().parse(
> > >             new ByteArrayInputStream(html.getBytes(UTF_8)),
> > >             new DefaultHandler() {
> > >                 @Override
> > >                 public void startElement(
> > >                         String u, String l, String name, Attributes
> > atts) {
> > >                     if (name.equals("script") && atts.getValue("",
> > "src") != null) {
> > >                         links.add(atts.getValue("", "src"));
> > >                     }
> > >                 }
> > >             },
> > >             metadata,
> > >             context);
> > >     assertEquals(1, links.size());
> > >     assertEquals(url, links.get(0));
> > > }
> > > {code}
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "Alin Turbut (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Wed, 11 May 2016 08:38:12 +0000 (UTC)
> > Subject: [jira] [Created] (TIKA-1969) The filename or extension is too
> long
> > Alin Turbut created TIKA-1969:
> > ---------------------------------
> >
> >              Summary: The filename or extension is too long
> >                  Key: TIKA-1969
> >                  URL: https://issues.apache.org/jira/browse/TIKA-1969
> >              Project: Tika
> >           Issue Type: Bug
> >     Affects Versions: 1.12, 1.10
> >             Reporter: Alin Turbut
> >
> >
> > After I add the Tika dependency to my project, I receive this error:
> > {code}
> > Caused by: net.rubygrapefruit.platform.NativeException: Could not start
> > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe'
> >         at
> >
> net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27)
> >         at
> >
> net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22)
> >         at
> >
> net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36)
> >         at
> >
> org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68)
> >         ... 2 more
> > Caused by: java.io.IOException: Cannot run program "C:\Program
> > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory
> > "C:\Users\aturbut\Projects\vcrawl"): Crea
> > eProcess error=206, The filename or extension is too long
> > {code}
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: "Alin Turbut (JIRA)" <j...@apache.org>
> > To: dev@tika.apache.org
> > Cc:
> > Date: Wed, 11 May 2016 08:38:13 +0000 (UTC)
> > Subject: [jira] [Updated] (TIKA-1969) The filename or extension is too
> long
> >
> >      [
> >
> https://issues.apache.org/jira/browse/TIKA-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> > ]
> >
> > Alin Turbut updated TIKA-1969:
> > ------------------------------
> >     Description:
> > After I add the Tika dependency to my project, I receive this error:
> > {code}
> > Caused by: net.rubygrapefruit.platform.NativeException: Could not start
> > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe'
> >         at
> >
> net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27)
> >         at
> >
> net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22)
> >         at
> >
> net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36)
> >         at
> >
> org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68)
> >         ... 2 more
> > Caused by: java.io.IOException: Cannot run program "C:\Program
> > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory
> > "C:\Users\aturbut\Projects\vcrawl"): Crea
> > eProcess error=206, The filename or extension is too long
> > {code}
> >
> > Building&Running my project with Gradle, Spring Boot, if any of this
> > information helps.
> >
> >   was:
> > After I add the Tika dependency to my project, I receive this error:
> > {code}
> > Caused by: net.rubygrapefruit.platform.NativeException: Could not start
> > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe'
> >         at
> >
> net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27)
> >         at
> >
> net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22)
> >         at
> >
> net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36)
> >         at
> >
> org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68)
> >         ... 2 more
> > Caused by: java.io.IOException: Cannot run program "C:\Program
> > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory
> > "C:\Users\aturbut\Projects\vcrawl"): Crea
> > eProcess error=206, The filename or extension is too long
> > {code}
> >
> >
> > > The filename or extension is too long
> > > -------------------------------------
> > >
> > >                 Key: TIKA-1969
> > >                 URL: https://issues.apache.org/jira/browse/TIKA-1969
> > >             Project: Tika
> > >          Issue Type: Bug
> > >    Affects Versions: 1.10, 1.12
> > >            Reporter: Alin Turbut
> > >
> > > After I add the Tika dependency to my project, I receive this error:
> > > {code}
> > > Caused by: net.rubygrapefruit.platform.NativeException: Could not start
> > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe'
> > >         at
> >
> net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27)
> > >         at
> >
> net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22)
> > >         at
> >
> net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36)
> > >         at
> >
> org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68)
> > >         ... 2 more
> > > Caused by: java.io.IOException: Cannot run program "C:\Program
> > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory
> > "C:\Users\aturbut\Projects\vcrawl"): Crea
> > > eProcess error=206, The filename or extension is too long
> > > {code}
> > > Building&Running my project with Gradle, Spring Boot, if any of this
> > information helps.
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
> >
> >
> > ...
> >
> > [Message clipped]
>
>
>
>
> --
> *Lewis*
>
-- 
Best regards,
Konstantin Gribov

Re: [VOTE] Release Apache Tika 1.13 Candidate #1

Reply via email to