All signatures are correct, builds fine on Linux with openjdk8. Tests pass w/ tesseract OCR enabled.
RC1 doesn't contain fix for TIKA-1938[1]. We should either roll another RC or mark it as fix for 1.14. Currently I've marked it as fix for 1.14. [x] +1 Release this package as Apache Tika 1.13 [1]: https://issues.apache.org/jira/browse/TIKA-1938 чт, 12 мая 2016 г. в 1:14, Lewis John Mcgibbney <lewis.mcgibb...@gmail.com>: > Hi David, > Good job on the RC > The .zip artifact contains 2015 in NOTICE > Everything else looks great > All Signatures good. > Tests pass on MacOSX, Java 1.7 > > [X] +1 Release this package as Apache Tika 1.13 > > On Wed, May 11, 2016 at 6:50 AM, <dev-digest-h...@tika.apache.org> wrote: > > > > > From: David Meikle <dmei...@apache.org> > > To: dev@tika.apache.org, u...@tika.apache.org > > Cc: > > Date: Mon, 9 May 2016 20:34:32 +0100 > > Subject: [VOTE] Release Apache Tika 1.13 Candidate #1 > > A candidate for the Tika 1.13 release is available at: > > https://dist.apache.org/repos/dist/dev/tika/ > > > > > > > > > The release candidate is a zip archive of the sources in: > > > > > https://git-wip-us.apache.org/repos/asf?p=tika.git;a=tag;h=18fa8213438183a249df4f52535031670f0a3eef > > > > The SHA1 checksum of the archive is > > 8a591e7ea29dca14d5f25b44b3a2a35425676c64. > > > > > In addition, a staged maven repository is available here: > > > > > https://repository.apache.org/content/repositories/orgapachetika-1019/org/apache/tika > > > > Please vote on releasing this package as Apache Tika 1.13. > > The vote is open for the next 72 hours and passes if a majority of at > > least three +1 Tika PMC votes are cast. > > > > [ ] +1 Release this package as Apache Tika 1.13 > > [ ] -1 Do not release this package because… > > > > Here is my +1 for the release. > > > > Cheers, > > Dave > > > > P.S. For anyone looking to test using the Apache Tika Server I have put > up > > a branch that pulls down the RC at > > https://github.com/LogicalSpark/docker-tikaserver/tree/1.13rc1 > > > > > > ---------- Forwarded message ---------- > > From: "Allison, Timothy B." <talli...@mitre.org> > > To: "dev@tika.apache.org" <dev@tika.apache.org>, "u...@tika.apache.org" > < > > u...@tika.apache.org> > > Cc: > > Date: Wed, 11 May 2016 13:50:26 +0000 > > Subject: RE: [VOTE] Release Apache Tika 1.13 Candidate #1 > > +1 > > > > Built on Windows and Linux. I'm relying on earlier pre-release tests for > > no surprises. :) > > > > Thank you, Dave! > > > > -----Original Message----- > > From: David Meikle [mailto:loo...@gmail.com] On Behalf Of David Meikle > > Sent: Monday, May 9, 2016 3:35 PM > > To: dev@tika.apache.org; u...@tika.apache.org > > Subject: [VOTE] Release Apache Tika 1.13 Candidate #1 > > > > A candidate for the Tika 1.13 release is available at: > > https://dist.apache.org/repos/dist/dev/tika/ > > > > The release candidate is a zip archive of the sources in: > > > > > https://git-wip-us.apache.org/repos/asf?p=tika.git;a=tag;h=18fa8213438183a249df4f52535031670f0a3eef > > > > The SHA1 checksum of the archive is > > 8a591e7ea29dca14d5f25b44b3a2a35425676c64. > > > > In addition, a staged maven repository is available here: > > > > > https://repository.apache.org/content/repositories/orgapachetika-1019/org/apache/tika > > > > Please vote on releasing this package as Apache Tika 1.13. > > The vote is open for the next 72 hours and passes if a majority of at > > least three +1 Tika PMC votes are cast. > > > > [ ] +1 Release this package as Apache Tika 1.13 [ ] -1 Do not release > this > > package because… > > > > Here is my +1 for the release. > > > > Cheers, > > Dave > > > > P.S. For anyone looking to test using the Apache Tika Server I have put > up > > a branch that pulls down the RC at > > https://github.com/LogicalSpark/docker-tikaserver/tree/1.13rc1 > > > > > > ---------- Forwarded message ---------- > > From: "Zoltan Toth (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 00:56:12 +0000 (UTC) > > Subject: [jira] [Commented] (TIKA-1817) Extracts entire file content for > > ASCII DXF files > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15277426#comment-15277426 > > ] > > > > Zoltan Toth commented on TIKA-1817: > > ----------------------------------- > > > > Hi there. I can see that some great progress has been made on this > > issue. Many thanks! Could someone please let me know when the fix for > > this issue is likely to be released to production? > > > > > Extracts entire file content for ASCII DXF files > > > ------------------------------------------------ > > > > > > Key: TIKA-1817 > > > URL: https://issues.apache.org/jira/browse/TIKA-1817 > > > Project: Tika > > > Issue Type: Bug > > > Affects Versions: 1.11 > > > Reporter: Zoltan Toth > > > Attachments: SMA-Controller.dxf, house design.dxf, > > jcsample-screendump.jpg, jcsample.dxf > > > > > > > > > By definition, ASCII DXF files are encoded in plain text. However. the > > vast majority of their content is not intended to be human readable (see > > https://en.wikipedia.org/wiki/AutoCAD_DXF). Unfortunately for these > > files, Tika simply "extracts" the entire content of the file instead of > the > > human-readable portions (i.e. comments etc.) that a CAD tool would > render. > > This results in massive amounts of rubbish data being returned with dire > > consequences for applications that rely on this. > > > It would be nice if only the human-readable text fields were extracted. > > Failing this, it would still be nice if no text was extracted from these > > files at all. > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: "Joseph Naegele (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 13:52:13 +0000 (UTC) > > Subject: [jira] [Commented] (TIKA-1938) HtmlParser drops <script> > elements > > found inside <head> > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278119#comment-15278119 > > ] > > > > Joseph Naegele commented on TIKA-1938: > > -------------------------------------- > > > > This is a very simple PR if anyone is able to review and merge. I had > > hoped to get it into 1.13. > > > > > HtmlParser drops <script> elements found inside <head> > > > ------------------------------------------------------ > > > > > > Key: TIKA-1938 > > > URL: https://issues.apache.org/jira/browse/TIKA-1938 > > > Project: Tika > > > Issue Type: Bug > > > Components: parser > > > Affects Versions: 1.12 > > > Reporter: Joseph Naegele > > > > > > HtmlParser's HtmlHandler does not check for "SCRIPT" in > {{startElement}} > > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This > > causes <script> elements found within <head> to be dropped altogether. > They > > should be treated in the same manner as "LINK" elements. > > > Here is a sample test case that demonstrates the problem and can be run > > from within {{HtmlParserTest.java}}, although it could be generalized to > > check for <a>, <link> and <img> links by using the > {{LinkContentHandler}}. > > > {code} > > > @Test > > > public void testScriptSrc() throws Exception { > > > String url = "http://domain.com/logic.js"; > > > String scriptInBody = > > > "<html><body><script src=\"" + url + > > "\"></script></body></html>"; > > > String scriptInHead = > > > "<html><head><script src=\"" + url + > > "\"></script></head></html>"; > > > assertScriptLink(scriptInBody, url); > > > assertScriptLink(scriptInHead, url); > > > } > > > private void assertScriptLink(String html, String url) throws > Exception { > > > // IdentityHtmlMapper needed to extract <script> tags > > > ParseContext context = new ParseContext(); > > > context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE); > > > Metadata metadata = new Metadata(); > > > metadata.set(Metadata.CONTENT_TYPE, "text/html"); > > > final List<String> links = new ArrayList<String>(); > > > new HtmlParser().parse( > > > new ByteArrayInputStream(html.getBytes(UTF_8)), > > > new DefaultHandler() { > > > @Override > > > public void startElement( > > > String u, String l, String name, Attributes > > atts) { > > > if (name.equals("script") && atts.getValue("", > > "src") != null) { > > > links.add(atts.getValue("", "src")); > > > } > > > } > > > }, > > > metadata, > > > context); > > > assertEquals(1, links.size()); > > > assertEquals(url, links.get(0)); > > > } > > > {code} > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: "ASF GitHub Bot (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 15:37:13 +0000 (UTC) > > Subject: [jira] [Commented] (TIKA-1938) HtmlParser drops <script> > elements > > found inside <head> > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278290#comment-15278290 > > ] > > > > ASF GitHub Bot commented on TIKA-1938: > > -------------------------------------- > > > > Github user asfgit closed the pull request at: > > > > https://github.com/apache/tika/pull/111 > > > > > > > HtmlParser drops <script> elements found inside <head> > > > ------------------------------------------------------ > > > > > > Key: TIKA-1938 > > > URL: https://issues.apache.org/jira/browse/TIKA-1938 > > > Project: Tika > > > Issue Type: Bug > > > Components: parser > > > Affects Versions: 1.12 > > > Reporter: Joseph Naegele > > > Assignee: Ken Krugler > > > > > > HtmlParser's HtmlHandler does not check for "SCRIPT" in > {{startElement}} > > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This > > causes <script> elements found within <head> to be dropped altogether. > They > > should be treated in the same manner as "LINK" elements. > > > Here is a sample test case that demonstrates the problem and can be run > > from within {{HtmlParserTest.java}}, although it could be generalized to > > check for <a>, <link> and <img> links by using the > {{LinkContentHandler}}. > > > {code} > > > @Test > > > public void testScriptSrc() throws Exception { > > > String url = "http://domain.com/logic.js"; > > > String scriptInBody = > > > "<html><body><script src=\"" + url + > > "\"></script></body></html>"; > > > String scriptInHead = > > > "<html><head><script src=\"" + url + > > "\"></script></head></html>"; > > > assertScriptLink(scriptInBody, url); > > > assertScriptLink(scriptInHead, url); > > > } > > > private void assertScriptLink(String html, String url) throws > Exception { > > > // IdentityHtmlMapper needed to extract <script> tags > > > ParseContext context = new ParseContext(); > > > context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE); > > > Metadata metadata = new Metadata(); > > > metadata.set(Metadata.CONTENT_TYPE, "text/html"); > > > final List<String> links = new ArrayList<String>(); > > > new HtmlParser().parse( > > > new ByteArrayInputStream(html.getBytes(UTF_8)), > > > new DefaultHandler() { > > > @Override > > > public void startElement( > > > String u, String l, String name, Attributes > > atts) { > > > if (name.equals("script") && atts.getValue("", > > "src") != null) { > > > links.add(atts.getValue("", "src")); > > > } > > > } > > > }, > > > metadata, > > > context); > > > assertEquals(1, links.size()); > > > assertEquals(url, links.get(0)); > > > } > > > {code} > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: "Hudson (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 17:00:16 +0000 (UTC) > > Subject: [jira] [Commented] (TIKA-1938) HtmlParser drops <script> > elements > > found inside <head> > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278455#comment-15278455 > > ] > > > > Hudson commented on TIKA-1938: > > ------------------------------ > > > > SUCCESS: Integrated in tika-trunk-jdk1.7 #988 (See [ > > https://builds.apache.org/job/tika-trunk-jdk1.7/988/]) > > fix for TIKA-1938 contributed by naegelejd (jnaegele: rev > > b6d23c189e852fa2e41b441c18bfe3e66e3f67c4) > > * > > > tika-parsers/src/test/java/org/apache/tika/parser/html/HtmlParserTest.java > > * tika-parsers/src/main/java/org/apache/tika/parser/html/HtmlHandler.java > > > > > > > HtmlParser drops <script> elements found inside <head> > > > ------------------------------------------------------ > > > > > > Key: TIKA-1938 > > > URL: https://issues.apache.org/jira/browse/TIKA-1938 > > > Project: Tika > > > Issue Type: Bug > > > Components: parser > > > Affects Versions: 1.12 > > > Reporter: Joseph Naegele > > > Assignee: Ken Krugler > > > > > > HtmlParser's HtmlHandler does not check for "SCRIPT" in > {{startElement}} > > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This > > causes <script> elements found within <head> to be dropped altogether. > They > > should be treated in the same manner as "LINK" elements. > > > Here is a sample test case that demonstrates the problem and can be run > > from within {{HtmlParserTest.java}}, although it could be generalized to > > check for <a>, <link> and <img> links by using the > {{LinkContentHandler}}. > > > {code} > > > @Test > > > public void testScriptSrc() throws Exception { > > > String url = "http://domain.com/logic.js"; > > > String scriptInBody = > > > "<html><body><script src=\"" + url + > > "\"></script></body></html>"; > > > String scriptInHead = > > > "<html><head><script src=\"" + url + > > "\"></script></head></html>"; > > > assertScriptLink(scriptInBody, url); > > > assertScriptLink(scriptInHead, url); > > > } > > > private void assertScriptLink(String html, String url) throws > Exception { > > > // IdentityHtmlMapper needed to extract <script> tags > > > ParseContext context = new ParseContext(); > > > context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE); > > > Metadata metadata = new Metadata(); > > > metadata.set(Metadata.CONTENT_TYPE, "text/html"); > > > final List<String> links = new ArrayList<String>(); > > > new HtmlParser().parse( > > > new ByteArrayInputStream(html.getBytes(UTF_8)), > > > new DefaultHandler() { > > > @Override > > > public void startElement( > > > String u, String l, String name, Attributes > > atts) { > > > if (name.equals("script") && atts.getValue("", > > "src") != null) { > > > links.add(atts.getValue("", "src")); > > > } > > > } > > > }, > > > metadata, > > > context); > > > assertEquals(1, links.size()); > > > assertEquals(url, links.get(0)); > > > } > > > {code} > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: "Ken Krugler (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 15:15:13 +0000 (UTC) > > Subject: [jira] [Assigned] (TIKA-1938) HtmlParser drops <script> elements > > found inside <head> > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > > ] > > > > Ken Krugler reassigned TIKA-1938: > > --------------------------------- > > > > Assignee: Ken Krugler > > > > > HtmlParser drops <script> elements found inside <head> > > > ------------------------------------------------------ > > > > > > Key: TIKA-1938 > > > URL: https://issues.apache.org/jira/browse/TIKA-1938 > > > Project: Tika > > > Issue Type: Bug > > > Components: parser > > > Affects Versions: 1.12 > > > Reporter: Joseph Naegele > > > Assignee: Ken Krugler > > > > > > HtmlParser's HtmlHandler does not check for "SCRIPT" in > {{startElement}} > > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This > > causes <script> elements found within <head> to be dropped altogether. > They > > should be treated in the same manner as "LINK" elements. > > > Here is a sample test case that demonstrates the problem and can be run > > from within {{HtmlParserTest.java}}, although it could be generalized to > > check for <a>, <link> and <img> links by using the > {{LinkContentHandler}}. > > > {code} > > > @Test > > > public void testScriptSrc() throws Exception { > > > String url = "http://domain.com/logic.js"; > > > String scriptInBody = > > > "<html><body><script src=\"" + url + > > "\"></script></body></html>"; > > > String scriptInHead = > > > "<html><head><script src=\"" + url + > > "\"></script></head></html>"; > > > assertScriptLink(scriptInBody, url); > > > assertScriptLink(scriptInHead, url); > > > } > > > private void assertScriptLink(String html, String url) throws > Exception { > > > // IdentityHtmlMapper needed to extract <script> tags > > > ParseContext context = new ParseContext(); > > > context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE); > > > Metadata metadata = new Metadata(); > > > metadata.set(Metadata.CONTENT_TYPE, "text/html"); > > > final List<String> links = new ArrayList<String>(); > > > new HtmlParser().parse( > > > new ByteArrayInputStream(html.getBytes(UTF_8)), > > > new DefaultHandler() { > > > @Override > > > public void startElement( > > > String u, String l, String name, Attributes > > atts) { > > > if (name.equals("script") && atts.getValue("", > > "src") != null) { > > > links.add(atts.getValue("", "src")); > > > } > > > } > > > }, > > > metadata, > > > context); > > > assertEquals(1, links.size()); > > > assertEquals(url, links.get(0)); > > > } > > > {code} > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: asfgit <g...@git.apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 15:36:53 +0000 (UTC) > > Subject: [GitHub] tika pull request: fix for TIKA-1938 contributed by > > naegelejd > > Github user asfgit closed the pull request at: > > > > https://github.com/apache/tika/pull/111 > > > > > > --- > > If your project is set up for it, you can reply to this email and have > your > > reply appear on GitHub as well. If your project does not have this > feature > > enabled and wishes so, or if the feature is enabled but not working, > please > > contact infrastructure at infrastruct...@apache.org or file a JIRA > ticket > > with INFRA. > > --- > > > > > > > > ---------- Forwarded message ---------- > > From: "Ken Krugler (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Tue, 10 May 2016 15:38:12 +0000 (UTC) > > Subject: [jira] [Closed] (TIKA-1938) HtmlParser drops <script> elements > > found inside <head> > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > > ] > > > > Ken Krugler closed TIKA-1938. > > ----------------------------- > > Resolution: Fixed > > > > Fix with commit da5bbbe..46d5775. > > > > Thanks Joseph! > > > > > HtmlParser drops <script> elements found inside <head> > > > ------------------------------------------------------ > > > > > > Key: TIKA-1938 > > > URL: https://issues.apache.org/jira/browse/TIKA-1938 > > > Project: Tika > > > Issue Type: Bug > > > Components: parser > > > Affects Versions: 1.12 > > > Reporter: Joseph Naegele > > > Assignee: Ken Krugler > > > > > > HtmlParser's HtmlHandler does not check for "SCRIPT" in > {{startElement}} > > when parsing <head> (i.e. {{bodylevel == 0 && discardLevel ==0}}). This > > causes <script> elements found within <head> to be dropped altogether. > They > > should be treated in the same manner as "LINK" elements. > > > Here is a sample test case that demonstrates the problem and can be run > > from within {{HtmlParserTest.java}}, although it could be generalized to > > check for <a>, <link> and <img> links by using the > {{LinkContentHandler}}. > > > {code} > > > @Test > > > public void testScriptSrc() throws Exception { > > > String url = "http://domain.com/logic.js"; > > > String scriptInBody = > > > "<html><body><script src=\"" + url + > > "\"></script></body></html>"; > > > String scriptInHead = > > > "<html><head><script src=\"" + url + > > "\"></script></head></html>"; > > > assertScriptLink(scriptInBody, url); > > > assertScriptLink(scriptInHead, url); > > > } > > > private void assertScriptLink(String html, String url) throws > Exception { > > > // IdentityHtmlMapper needed to extract <script> tags > > > ParseContext context = new ParseContext(); > > > context.set(HtmlMapper.class, IdentityHtmlMapper.INSTANCE); > > > Metadata metadata = new Metadata(); > > > metadata.set(Metadata.CONTENT_TYPE, "text/html"); > > > final List<String> links = new ArrayList<String>(); > > > new HtmlParser().parse( > > > new ByteArrayInputStream(html.getBytes(UTF_8)), > > > new DefaultHandler() { > > > @Override > > > public void startElement( > > > String u, String l, String name, Attributes > > atts) { > > > if (name.equals("script") && atts.getValue("", > > "src") != null) { > > > links.add(atts.getValue("", "src")); > > > } > > > } > > > }, > > > metadata, > > > context); > > > assertEquals(1, links.size()); > > > assertEquals(url, links.get(0)); > > > } > > > {code} > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: "Alin Turbut (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Wed, 11 May 2016 08:38:12 +0000 (UTC) > > Subject: [jira] [Created] (TIKA-1969) The filename or extension is too > long > > Alin Turbut created TIKA-1969: > > --------------------------------- > > > > Summary: The filename or extension is too long > > Key: TIKA-1969 > > URL: https://issues.apache.org/jira/browse/TIKA-1969 > > Project: Tika > > Issue Type: Bug > > Affects Versions: 1.12, 1.10 > > Reporter: Alin Turbut > > > > > > After I add the Tika dependency to my project, I receive this error: > > {code} > > Caused by: net.rubygrapefruit.platform.NativeException: Could not start > > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe' > > at > > > net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27) > > at > > > net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22) > > at > > > net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36) > > at > > > org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68) > > ... 2 more > > Caused by: java.io.IOException: Cannot run program "C:\Program > > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory > > "C:\Users\aturbut\Projects\vcrawl"): Crea > > eProcess error=206, The filename or extension is too long > > {code} > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > > > ---------- Forwarded message ---------- > > From: "Alin Turbut (JIRA)" <j...@apache.org> > > To: dev@tika.apache.org > > Cc: > > Date: Wed, 11 May 2016 08:38:13 +0000 (UTC) > > Subject: [jira] [Updated] (TIKA-1969) The filename or extension is too > long > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > > ] > > > > Alin Turbut updated TIKA-1969: > > ------------------------------ > > Description: > > After I add the Tika dependency to my project, I receive this error: > > {code} > > Caused by: net.rubygrapefruit.platform.NativeException: Could not start > > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe' > > at > > > net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27) > > at > > > net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22) > > at > > > net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36) > > at > > > org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68) > > ... 2 more > > Caused by: java.io.IOException: Cannot run program "C:\Program > > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory > > "C:\Users\aturbut\Projects\vcrawl"): Crea > > eProcess error=206, The filename or extension is too long > > {code} > > > > Building&Running my project with Gradle, Spring Boot, if any of this > > information helps. > > > > was: > > After I add the Tika dependency to my project, I receive this error: > > {code} > > Caused by: net.rubygrapefruit.platform.NativeException: Could not start > > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe' > > at > > > net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27) > > at > > > net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22) > > at > > > net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36) > > at > > > org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68) > > ... 2 more > > Caused by: java.io.IOException: Cannot run program "C:\Program > > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory > > "C:\Users\aturbut\Projects\vcrawl"): Crea > > eProcess error=206, The filename or extension is too long > > {code} > > > > > > > The filename or extension is too long > > > ------------------------------------- > > > > > > Key: TIKA-1969 > > > URL: https://issues.apache.org/jira/browse/TIKA-1969 > > > Project: Tika > > > Issue Type: Bug > > > Affects Versions: 1.10, 1.12 > > > Reporter: Alin Turbut > > > > > > After I add the Tika dependency to my project, I receive this error: > > > {code} > > > Caused by: net.rubygrapefruit.platform.NativeException: Could not start > > 'C:\Program Files\Java\jdk1.8.0_91\bin\java.exe' > > > at > > > net.rubygrapefruit.platform.internal.DefaultProcessLauncher.start(DefaultProcessLauncher.java:27) > > > at > > > net.rubygrapefruit.platform.internal.WindowsProcessLauncher.start(WindowsProcessLauncher.java:22) > > > at > > > net.rubygrapefruit.platform.internal.WrapperProcessLauncher.start(WrapperProcessLauncher.java:36) > > > at > > > org.gradle.process.internal.ExecHandleRunner.run(ExecHandleRunner.java:68) > > > ... 2 more > > > Caused by: java.io.IOException: Cannot run program "C:\Program > > Files\Java\jdk1.8.0_91\bin\java.exe" (in directory > > "C:\Users\aturbut\Projects\vcrawl"): Crea > > > eProcess error=206, The filename or extension is too long > > > {code} > > > Building&Running my project with Gradle, Spring Boot, if any of this > > information helps. > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) > > > > > > ... > > > > [Message clipped] > > > > > -- > *Lewis* > -- Best regards, Konstantin Gribov