[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944992#comment-16944992 ] Luke Butters commented on TIKA-2955: It will only be possible to see the failure if that XML lib is on

[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944983#comment-16944983 ] Tilman Hausherr commented on TIKA-2955: --- I can't answer that question because I'm new here as a

[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944982#comment-16944982 ] Luke Butters commented on TIKA-2955: I tried it in "2.0.0-SNAPSHOT" which seemed to fail I did not try

[jira] [Comment Edited] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944970#comment-16944970 ] Tilman Hausherr edited comment on TIKA-2955 at 10/5/19 4:07 AM: Per your

[jira] [Comment Edited] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944970#comment-16944970 ] Tilman Hausherr edited comment on TIKA-2955 at 10/5/19 4:04 AM: Per your

[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944970#comment-16944970 ] Tilman Hausherr commented on TIKA-2955: --- Per your stack trace you are using tika 1.19. Does it

[jira] [Comment Edited] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-04 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944074#comment-16944074 ] Luke Butters edited comment on TIKA-2955 at 10/4/19 10:04 PM: -- My guess is

[jira] [Commented] (TIKA-2957) Failed build w Java 11

2019-10-04 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944725#comment-16944725 ] Hudson commented on TIKA-2957: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #238 (See

[jira] [Commented] (TIKA-2957) Failed build w Java 11

2019-10-04 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944687#comment-16944687 ] Hudson commented on TIKA-2957: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1702 (See

ApacheCon Europe 2019 talks which are relevant to Apache Tika

2019-10-04 Thread myrle
Dear Apache Tika committers, In a little over 2 weeks time, ApacheCon Europe is taking place in Berlin. Join us from October 22 to 24 for an exciting program and lovely get-together of the Apache Community. We are also planning a hackathon.  If your project is interested in participating,

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-04 Thread Rafa Espillaque (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944649#comment-16944649 ] Rafa Espillaque commented on TIKA-2941: --- Thanks [~bob] > OSGI bundle and app are not self-contained

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-04 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944648#comment-16944648 ] Bob Paulin commented on TIKA-2941: -- Yeah I can take a look.  I reviewed some of the lists and it doesn't

Re: [ANNOUNCE] Welcome Tilman Hausherr as Tika PMC member and committer

2019-10-04 Thread Ken Krugler
Hi Tilman, Congratulations, and welcome! — Ken > On Oct 4, 2019, at 7:19 AM, Tim Allison wrote: > > All, > > The Tika PMC has elected to add Tilman Hausherr to our ranks. Tilman, > please feel free to introduce yourself, and welcome aboard! > > Cheers, > > Tim

Re: TabularFormatsTest test fails in Germany

2019-10-04 Thread Tilman Hausherr
Am 04.10.2019 um 17:32 schrieb Tim Allison: Would it work to set the expected String to something generated with the root locale? Yes, that makes sense. But I'm wondering whether this is a configuration problem - am I the first one outside the US who tried doing a build from source? Tilman

Re: TabularFormatsTest test fails in Germany

2019-10-04 Thread Tim Allison
Would it work to set the expected String to something generated with the root locale? On Fri, Oct 4, 2019 at 10:56 AM Tilman Hausherr wrote: > So I wanted to build tika from source, and failed: > > Failures: >TabularFormatsTest.testSAS7BDAT:229->assertContents:216 en_US Wrong > text in row

TabularFormatsTest test fails in Germany

2019-10-04 Thread Tilman Hausherr
So I wanted to build tika from source, and failed: Failures:   TabularFormatsTest.testSAS7BDAT:229->assertContents:216 en_US Wrong text in row 9 and column 7 - 03(MAR|Mar)(63|1963)[:\s]09:46:40(.00)? vs 03Mär1963:09:46:40.00   TabularFormatsTest.testXLS:236->assertContents:216 en_US Wrong text

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-04 Thread Rafa Espillaque (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944565#comment-16944565 ] Rafa Espillaque commented on TIKA-2941: --- I agree with this.  I believe this commit is the cause 

[jira] [Commented] (TIKA-2957) Failed build w Java 11

2019-10-04 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944554#comment-16944554 ] Hudson commented on TIKA-2957: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #456 (See

Re: [ANNOUNCE] Welcome Tilman Hausherr as Tika PMC member and committer

2019-10-04 Thread Tilman Hausherr
Am 04.10.2019 um 16:19 schrieb Tim Allison: All, The Tika PMC has elected to add Tilman Hausherr to our ranks. Tilman, please feel free to introduce yourself, and welcome aboard! Cheers, Tim Hello everybody, Thanks for the honor. A bit about me: I'm from Germany (coincidentally,

[ANNOUNCE] Welcome Tilman Hausherr as Tika PMC member and committer

2019-10-04 Thread Tim Allison
All, The Tika PMC has elected to add Tilman Hausherr to our ranks. Tilman, please feel free to introduce yourself, and welcome aboard! Cheers, Tim

[jira] [Commented] (TIKA-2957) Failed build w Java 11

2019-10-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944527#comment-16944527 ] Tim Allison commented on TIKA-2957: --- Thank you, Tilman! > Failed build w Java 11 >

[jira] [Commented] (TIKA-2957) Failed build w Java 11

2019-10-04 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944493#comment-16944493 ] Tilman Hausherr commented on TIKA-2957: --- Code suggestion here, I think it's not in the tika dev

[jira] [Created] (TIKA-2957) Failed build w Java 11

2019-10-04 Thread Tim Allison (Jira)
Tim Allison created TIKA-2957: - Summary: Failed build w Java 11 Key: TIKA-2957 URL: https://issues.apache.org/jira/browse/TIKA-2957 Project: Tika Issue Type: Bug Reporter: Tim

JDK 14 - Early Access build 17 is available

2019-10-04 Thread Rory O'Donnell
Hi Tim, *OpenJDK builds *- JDK 14 - Early Access build 17 is available at http://jdk.java.net/14/ These early-access, open-source builds are provided under the GNU General Public License, version 2, with the Classpath Exception . * Schedule