[jira] [Commented] (TIKA-3164) Upgrade to POI 5.0.0 when available

2021-12-13 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458683#comment-17458683 ] Bob Paulin commented on TIKA-3164: -- Hey [~tallison] .  See the mention but will likely not get

[jira] [Commented] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF

2021-11-17 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445479#comment-17445479 ] Bob Paulin commented on TIKA-3591: -- Actually take that back it looks like commons-io is exporting at BOTH

[jira] [Comment Edited] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF

2021-11-17 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445474#comment-17445474 ] Bob Paulin edited comment on TIKA-3591 at 11/17/21, 8:29 PM: -   {quote}I agree

[jira] [Commented] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF

2021-11-17 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445474#comment-17445474 ] Bob Paulin commented on TIKA-3591: --   {quote}I agree that's what commons-io is telling tika-core to do

[jira] [Commented] (TIKA-3591) The Import-Package of commons.io is wrong in MANIFEST.MF

2021-11-17 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445471#comment-17445471 ] Bob Paulin commented on TIKA-3591: -- Hey [~tallison] .  The commons-io library is what's saying hey you

Re: [VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1

2021-01-14 Thread Bob Paulin
Built Successfully Ran Tika Bundle Classic with latest version of Apache Felix (7.0.0) +1 [X ] +1 Release this package as Apache Tika 2.0.0-ALPHA [ ] -1 Do not release this package because... - Bob On 1/13/2021 7:19 PM, Tim Allison wrote: All, A candidate for the Tika 2.0.0-ALPHA release is

[jira] [Commented] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers

2020-12-17 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251496#comment-17251496 ] Bob Paulin commented on TIKA-3178: -- It looks like the xerces issue is caused by the test harness. I added

[jira] [Commented] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers

2020-12-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17245495#comment-17245495 ] Bob Paulin commented on TIKA-3178: -- Hey [~tallison] just found some time to review this.  While

OSGi support in Tika 2.0

2020-08-26 Thread Bob Paulin
Hi, I wanted to discuss OSGi support in Tika 2.0.  My current thought is to start with the minimum support which is to add bundle packaging to each of the modules [1].  This will make the bundles usable is OSGi but will leave users on there own for putting the right dependencies together for

Re: Tests failed in windows but not in linux

2020-08-24 Thread Bob Paulin
! - Bob On 8/24/2020 8:05 AM, Peter Lee wrote: > Hi Bob, > > I think I have found out what's wrong. Seems there's a infinite loop. I have > pushed a PR, please have a look at : > https://github.com/apache/tika/pull/343 > > cheers, > Lee > > On 8 24 2020, at 8:54 , Bob Pau

Re: Tests failed in windows but not in linux

2020-08-24 Thread Bob Paulin
Hi Lee, I get the same error on windows with GeoParser and SentimentAnalysisParser on the main branch.  Removing the Logger fixes both and it builds cleanly.  Still not sure what the exact issue is but I can recreate the issue and your solution. - Bob On 8/24/2020 4:02 AM, Peter Lee wrote: >

[jira] [Resolved] (TIKA-3185) tika-parsers-integration-test fails on windows with File being used by another process.

2020-08-22 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-3185. -- Resolution: Fixed > tika-parsers-integration-test fails on windows with File being used by > a

[jira] [Updated] (TIKA-3185) tika-parsers-integration-test fails on windows with File being used by another process.

2020-08-22 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-3185: - Fix Version/s: 2.0.0 Affects Version/s: 2.0.0 > tika-parsers-integration-test fails on wind

[jira] [Assigned] (TIKA-3185) tika-parsers-integration-test fails on windows with File being used by another process.

2020-08-22 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin reassigned TIKA-3185: Assignee: Bob Paulin > tika-parsers-integration-test fails on windows with File being u

[jira] [Created] (TIKA-3185) tika-parsers-integration-test fails on windows with File being used by another process.

2020-08-22 Thread Bob Paulin (Jira)
Bob Paulin created TIKA-3185: Summary: tika-parsers-integration-test fails on windows with File being used by another process. Key: TIKA-3185 URL: https://issues.apache.org/jira/browse/TIKA-3185 Project

[jira] [Comment Edited] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers

2020-08-21 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182112#comment-17182112 ] Bob Paulin edited comment on TIKA-3178 at 8/21/20, 7:50 PM: ok I get past

[jira] [Commented] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers

2020-08-21 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182112#comment-17182112 ] Bob Paulin commented on TIKA-3178: -- ok I get past that part of the build now.  Thanks [~tallison

[jira] [Commented] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers

2020-08-21 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182090#comment-17182090 ] Bob Paulin commented on TIKA-3178: -- Also I'm getting the following when building.  Seems like the maven

[jira] [Commented] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers

2020-08-21 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182089#comment-17182089 ] Bob Paulin commented on TIKA-3178: -- Is this for recreating tika-bundle?  Or are we looking to create

Re: [EXTERNAL] Tika 2.0 modularization

2020-08-18 Thread Bob Paulin
rom the individual parser modules via test-jar. > > On Fri, Aug 14, 2020 at 3:30 PM Bob Paulin wrote: > >> +1 excited about this. >> >> - Bob >> On 8/14/2020 11:29 AM, Sergey Beryozkin wrote: >> >> +1  >> >> Cheers Sergey >>

Re: [EXTERNAL] Tika 2.0 modularization

2020-08-14 Thread Bob Paulin
+1 excited about this. - Bob On 8/14/2020 11:29 AM, Sergey Beryozkin wrote: > +1  > > Cheers Sergey > > On Fri 14 Aug 2020, 18:26 Chris Mattmann, wrote: > >> Haha I’m down and supportive! >> >> >> >> Time’s TIME FOR 2.x  >> >> >> >> >> >> >> >> From: Tim Allison >> Reply-To:

[jira] [Comment Edited] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102137#comment-17102137 ] Bob Paulin edited comment on TIKA-3094 at 5/8/20, 1:02 AM: --- Looks like the jaxb

[jira] [Comment Edited] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102137#comment-17102137 ] Bob Paulin edited comment on TIKA-3094 at 5/8/20, 1:02 AM: --- Looks like the jaxb

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102137#comment-17102137 ] Bob Paulin commented on TIKA-3094: -- Looks like the jaxb error is not so much an issue with tika

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099868#comment-17099868 ] Bob Paulin commented on TIKA-3094: -- Thanks [~tallison] .  For #2 JAXB was removed from the JDK

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099848#comment-17099848 ] Bob Paulin commented on TIKA-3094: -- Hey [~tallison] I ran a build on Java 8 and Java 11 and I was unable

[jira] [Resolved] (TIKA-3095) tika-bundle tests fail on windows due to missing jcip-annotations

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-3095. -- Fix Version/s: 1.25 Resolution: Fixed > tika-bundle tests fail on windows due to missing j

[jira] [Created] (TIKA-3095) tika-bundle tests fail on windows due to missing jcip-annotations

2020-04-29 Thread Bob Paulin (Jira)
Bob Paulin created TIKA-3095: Summary: tika-bundle tests fail on windows due to missing jcip-annotations Key: TIKA-3095 URL: https://issues.apache.org/jira/browse/TIKA-3095 Project: Tika Issue

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095917#comment-17095917 ] Bob Paulin commented on TIKA-3094: -- Fixed with https://github.com/apache/tika/commit

[jira] [Assigned] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin reassigned TIKA-3094: Assignee: Bob Paulin > Apache Tika fails to extract text for pptx extens

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095883#comment-17095883 ] Bob Paulin commented on TIKA-3094: -- Embedding SparseBitSet in Embed-Dependency fixes the issue

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094614#comment-17094614 ] Bob Paulin commented on TIKA-3094: -- Thanks [~abchauha] .  The build process adds OSGi specific headers so

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094517#comment-17094517 ] Bob Paulin commented on TIKA-3094: -- If SparseBitSet is embedded in the tika-bundle that the library

[jira] [Commented] (TIKA-2987) Extracting Metadata from JPEG Fails with Tika Bundle

2019-11-19 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16977564#comment-16977564 ] Bob Paulin commented on TIKA-2987: -- I happened to have the source of your project on my pc from

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-08 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16947341#comment-16947341 ] Bob Paulin commented on TIKA-2941: -- Looks like there's a configuration option that does exactly what we

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16945889#comment-16945889 ] Bob Paulin commented on TIKA-2941: -- Just an update to provide some transparency around the "why&qu

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-04 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944648#comment-16944648 ] Bob Paulin commented on TIKA-2941: -- Yeah I can take a look.  I reviewed some of the lists and it doesn't

[jira] [Commented] (TIKA-2882) Parsers should not include HTTP client code

2019-08-16 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16909126#comment-16909126 ] Bob Paulin commented on TIKA-2882: -- Happy to help review or answer questions on it. Seems like we're

[jira] [Commented] (TIKA-2882) Parsers should not include HTTP client code

2019-05-30 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16851811#comment-16851811 ] Bob Paulin commented on TIKA-2882: -- I think it might be a good idea bring these ideas to the list to put

[jira] [Commented] (TIKA-2719) Java 9: Requiring tika-parsers from module-info.java fails with "module not found"

2018-08-30 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16597945#comment-16597945 ] Bob Paulin commented on TIKA-2719: -- {{org.apache.tika.core}} sounds more specific and better to me

[jira] [Commented] (TIKA-2719) Java 9: Requiring tika-parsers from module-info.java fails with "module not found"

2018-08-30 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16597897#comment-16597897 ] Bob Paulin commented on TIKA-2719: -- Yeah for some reason I thought we already released tika.core

[jira] [Commented] (TIKA-2719) Java 9: Requiring tika-parsers from module-info.java fails with "module not found"

2018-08-30 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16597674#comment-16597674 ] Bob Paulin commented on TIKA-2719: -- I would suggest adding {{Automatic-Module-Name:}} tika.parsers

[jira] [Resolved] (TIKA-2710) Set Tika to OSGi Execution Environment JavaSE-1.8

2018-08-17 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2710. -- Resolution: Fixed > Set Tika to OSGi Execution Environment JavaSE-

[jira] [Commented] (TIKA-2692) Blanket upgrades in prep for 1.19

2018-08-17 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584249#comment-16584249 ] Bob Paulin commented on TIKA-2692: -- [~talli...@apache.org] create TIKA-2710 for the more improved

[jira] [Created] (TIKA-2710) Set Tika to OSGi Execution Environment JavaSE-1.8

2018-08-17 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2710: Summary: Set Tika to OSGi Execution Environment JavaSE-1.8 Key: TIKA-2710 URL: https://issues.apache.org/jira/browse/TIKA-2710 Project: Tika Issue Type: Improvement

Re: Build with Java 10, but target 8 in Tika 2.0?

2018-06-20 Thread Bob Paulin
I'd also be a bit concerned with ONLY compiling with Java 10.  There are some changes to how resources are accessed across module boundaries that could break some existing functionality if folks decided to RUN with > Java 9 using the module system.  I covered some of these in my 2016 Apache Con

[jira] [Commented] (TIKA-2660) Prep Tika for Java 10

2018-06-04 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16500404#comment-16500404 ] Bob Paulin commented on TIKA-2660: -- Hey Tim.  I have some thoughts on this but I do think the parsers

Re: steps for Tika 2.0

2017-12-13 Thread Bob Paulin
Hey Tim, Happy to help with this effort.  I have a 4 week old branch that I've started applying changes to that I could push up called tika-2.0-demo-update that might provide a head start for you.  I think we do have to make some decisions on where the captioning, recognition, and sentiment

[jira] [Resolved] (TIKA-2506) Nullpointer in tika-dl test on windows

2017-11-17 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2506. -- Resolution: Fixed > Nullpointer in tika-dl test on wind

[jira] [Created] (TIKA-2506) Nullpointer in tika-dl test on windows

2017-11-17 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2506: Summary: Nullpointer in tika-dl test on windows Key: TIKA-2506 URL: https://issues.apache.org/jira/browse/TIKA-2506 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-17 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16257726#comment-16257726 ] Bob Paulin commented on TIKA-2502: -- Ok so I should have a patch shortly to upgrade the maven-bundle-plugin

[jira] [Commented] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-14 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16252146#comment-16252146 ] Bob Paulin commented on TIKA-2502: -- This looks like the error is from the felix maven-bundle-plugin

Re: Tika 2.0?

2017-09-11 Thread Bob Paulin
> We could also do the upgrade to jdk 8 with Tika 2.0. > > If this sounds reasonable, I propose creating a 1.x branch from trunk > for 1.x maintenance and then reworking trunk to the 2.x structure that Bob > Paulin so elegantly worked out. I figure we

Re: Tika 2.0?

2017-08-28 Thread Bob Paulin
ch from trunk for > 1.x maintenance and then reworking trunk to the 2.x structure that Bob Paulin > so elegantly worked out. I figure we can either copy/paste from trunk to the > current 2.x (and _hope_ we get all the updates) or use Bob's 2.0 as a model > for restructuring trunk. A

[jira] [Commented] (TIKA-2411) Clean up tika-bundle

2017-07-03 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16072715#comment-16072715 ] Bob Paulin commented on TIKA-2411: -- opennlp-maxent and jwnl used to be a transitive dependency pulled

Re: Tika 1.15.1?

2017-06-29 Thread Bob Paulin
> > Tim > > -Original Message----- > From: Tyler Bui-Palsulich [mailto:tbpalsul...@gmail.com] > Sent: Friday, June 2, 2017 8:39 PM > To: dev@tika.apache.org > S

Re: Tika 1.16?

2017-06-01 Thread Bob Paulin
+1 On 6/1/2017 6:50 AM, Allison, Timothy B. wrote: > Given the broken OSGi and the org.json issues with 1.15, does it make sense > to aim for 1.16 fairly soon, say 3-4 weeks? > > Cheers, > > Tim > > signature.asc Description: OpenPGP digital signature

[jira] [Resolved] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists

2017-05-31 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2379. -- Resolution: Fixed > tika-bundle 1.15 has wrong import of org.sfl4j.event package which d

[jira] [Commented] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists

2017-05-31 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032423#comment-16032423 ] Bob Paulin commented on TIKA-2379: -- Looks like a lot changed in this bundle between 1.14. I couldn't find

[jira] [Commented] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists

2017-05-31 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16031223#comment-16031223 ] Bob Paulin commented on TIKA-2379: -- Will take a look. I'm guessing there was a change in the dependency

[jira] [Assigned] (TIKA-2379) tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists

2017-05-31 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin reassigned TIKA-2379: Assignee: Bob Paulin > tika-bundle 1.15 has wrong import of org.sfl4j.event package which d

Re: Tika talk next week - help needed!

2017-05-16 Thread Bob Paulin
Quick slide on camel-tika. https://docs.google.com/presentation/d/1OUORiDwB4d0FkLZ0HIlQDLE30vvTniawdyzhQmLj1xE/edit?usp=sharing On 5/16/2017 10:31 AM, Nick Burch wrote: > On Tue, 16 May 2017, Eric Pugh wrote: >> It was great to read through >>

Re: [Q] reason for tika-parser-*-bundle to be separated from corresponding parser modules in 2.x

2017-03-29 Thread Bob Paulin
Hey Konstantin, Your observation is spot on and also is the reason why there is an advantage to having separate ones. The bundles are not meant to be used outside of OSGi. The current tika-bundle has many entries in the MANIFEST.MF due to the embedded dependencies. We also depend on maven to

Re: Happy 10th birthday, Apache Tika!

2017-03-22 Thread Bob Paulin
Great project. Even better community. Double digits is a big deal in software! - Bob On 3/22/2017 9:36 PM, Tyler Bui-Palsulich wrote: > Happy birthday! Thanks to everyone for creating such a great project. > > Tyler > > On Mar 22, 2017 8:35 AM, "Konstantin Gribov" wrote: >

Tika Component added to Apache Camel

2017-01-29 Thread Bob Paulin
Hi, Just wanted to let list know there's a Tika component[1] integrated into the Apache Camel [2] project. Just wanted to let folks know it's there. Open to ideas of how to improve it. Also wanted to throw out there this could potentially provide a new way to look at some of the 2.0

[jira] [Commented] (TIKA-2245) Standardise logging

2017-01-19 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15830464#comment-15830464 ] Bob Paulin commented on TIKA-2245: -- [~grossws] In my experience OSGi is pretty unopinionated on logging

[jira] [Comment Edited] (TIKA-2245) Standardise logging

2017-01-19 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15830464#comment-15830464 ] Bob Paulin edited comment on TIKA-2245 at 1/19/17 7:30 PM: --- [~grossws] In my

Re: FW: ApacheCon Miami is coming in May.

2016-11-30 Thread Bob Paulin
I bet the Sling and Jackrabbit/Oak projects would be interested in such a track. Those projects have pretty strong corporate backing which could help with sponsorship. Would it make sense to cross post this to them to get the ball rolling? - Bob On 11/30/2016 2:15 PM, Tom Barber wrote: >

Re: [VOTE] Apache Tika 1.14 Release Candidate #1

2016-10-20 Thread Bob Paulin
+1 Builds and tests pass on Java 8 and Windows 10. - Bob On 10/19/2016 1:48 PM, Chris Mattmann wrote: Hi Folks, A first candidate for the Tika 1.14 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in:

Re: Plans for the first Tika 2.0 release

2016-09-19 Thread Bob Paulin
I think that could work! I've also created a custom filter that might help https://issues.apache.org/jira/browse/TIKA-2083?filter=12338448 Logic is as follows: project = TIKA AND affectedVersion = 2.0 AND priority >= Blocker AND status != Closed AND status != Fixed - Bob On 9/19/2016

[jira] [Created] (TIKA-2083) Tika 2.0 - Audit master branch against 2.x branch

2016-09-19 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2083: Summary: Tika 2.0 - Audit master branch against 2.x branch Key: TIKA-2083 URL: https://issues.apache.org/jira/browse/TIKA-2083 Project: Tika Issue Type: Task

Re: Plans for the first Tika 2.0 release

2016-09-19 Thread Bob Paulin
. Thanks! Cheers, Tim -Original Message----- From: Bob Paulin [mailto:b...@bobpaulin.com] Sent: Monday, September 19, 2016 10:32 AM To: dev@tika.apache.org Subject: Re: Plans for the first Tika 2.0 release Hi, I think it's a good thing to discuss. I know there are othe

Re: Plans for the first Tika 2.0 release

2016-09-19 Thread Bob Paulin
Hi, I think it's a good thing to discuss. I know there are other features that are targeted for 2.0. Do we have a general sense of where those features are at? My concern is we have been dual maintaining 2 branches for about 9 months. I think the longer we do this the more risk there is

Re: PDF with embedded attachments and Tika 2.0 modularity

2016-09-16 Thread Bob Paulin
Hi Sergey, On 9/15/2016 3:33 PM, Sergey Beryozkin wrote: Hi Bob, Tim, All, On 15/09/16 18:06, Bob Paulin wrote: Hi Sergey, I definitely get the challenges. In fact recently we merged the PDF module into the Multimedia module due to the tight coupling around the TesseractOCR[1] [2]. We

Re: PDF with embedded attachments and Tika 2.0 modularity

2016-09-15 Thread Bob Paulin
Hi Sergey, I definitely get the challenges. In fact recently we merged the PDF module into the Multimedia module due to the tight coupling around the TesseractOCR[1] [2]. We could look into separating the PDF parser out again but I'm a bit short on a simple way to do it with TesseractOCR in

Re: A new Tika App in 2.0?

2016-09-13 Thread Bob Paulin
/ ++ On 9/13/16, 8:35 PM, "Bob Paulin" <b...@bobpaulin.com> wrote: Hey Nick, Thanks for the thoughts. Just to clear a few things up. The version of the app on my github does alr

Re: A new Tika App in 2.0?

2016-09-13 Thread Bob Paulin
-parser-bundle [2] https://issues.apache.org/jira/browse/TIKA-1285 On 9/13/2016 3:38 PM, Nick Burch wrote: On Sun, 11 Sep 2016, Bob Paulin wrote: I'd like to propose a new Tika App for the 2.0 branch. One of the reasons we broke apart the Tika parsers into modules was due to the complexity

A new Tika App in 2.0?

2016-09-11 Thread Bob Paulin
Hi, I'd like to propose a new Tika App for the 2.0 branch. One of the reasons we broke apart the Tika parsers into modules was due to the complexity of having to deal with all the parser dependencies and transitive dependencies. Now developers can use just the modules they want without

[jira] [Created] (TIKA-2076) Tika 2.0 - Tika App using bundles

2016-09-11 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2076: Summary: Tika 2.0 - Tika App using bundles Key: TIKA-2076 URL: https://issues.apache.org/jira/browse/TIKA-2076 Project: Tika Issue Type: Improvement Affects

[jira] [Resolved] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2070. -- Resolution: Fixed > Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service >

[jira] [Updated] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods

2016-09-09 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-2075: - Summary: Tika 2.0 - Expose Additional TikaService methods (was: Tika 2.0 - Expose Additonal TikaService

[jira] [Resolved] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService

2016-09-09 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2072. -- Resolution: Fixed > Tika 2.0 - Create TikaServiceFactory for creating TikaServ

[jira] [Created] (TIKA-2075) Tika 2.0 - Expose Additonal TikaService methods

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2075: Summary: Tika 2.0 - Expose Additonal TikaService methods Key: TIKA-2075 URL: https://issues.apache.org/jira/browse/TIKA-2075 Project: Tika Issue Type: Improvement

[jira] [Created] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2074: Summary: Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading Key: TIKA-2074 URL: https://issues.apache.org/jira/browse/TIKA-2074 Project: Tika

[jira] [Created] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2073: Summary: Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules Key: TIKA-2073 URL: https://issues.apache.org/jira/browse

[jira] [Created] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2072: Summary: Tika 2.0 - Create TikaServiceFactory for creating TikaService Key: TIKA-2072 URL: https://issues.apache.org/jira/browse/TIKA-2072 Project: Tika Issue Type

[jira] [Updated] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers

2016-09-09 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-2071: - Issue Type: Bug (was: Improvement) > Tika 2.0 - DefaultParser and CompositeParser does not fil

[jira] [Updated] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-2070: - Affects Version/s: 2.0 > Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Serv

[jira] [Created] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2071: Summary: Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers Key: TIKA-2071 URL: https://issues.apache.org/jira/browse/TIKA

[jira] [Created] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2070: Summary: Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader Key: TIKA-2070 URL: https://issues.apache.org/jira/browse/TIKA-2070 Project: Tika

[jira] [Reopened] (TIKA-2061) Tika 2.0 - Embed xmpcore dependency in tika-xmp bundle

2016-08-29 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin reopened TIKA-2061: -- Would someone be able to review this to ensure I added the xmpcore BSD license correctly? Thanks! > T

[jira] [Resolved] (TIKA-2063) Tika 2.0 - Create Vorbis Parser bundle

2016-08-29 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2063. -- Resolution: Fixed Assignee: Bob Paulin > Tika 2.0 - Create Vorbis Parser bun

[jira] [Resolved] (TIKA-2060) Tika 2.0 - Add Toggle to Tika Batch ClassLoaderUtil to enable OSGi loading

2016-08-29 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2060. -- Resolution: Fixed > Tika 2.0 - Add Toggle to Tika Batch ClassLoaderUtil to enable OSGi load

[jira] [Created] (TIKA-2063) Tika 2.0 - Create Vorbis Parser bundle

2016-08-28 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2063: Summary: Tika 2.0 - Create Vorbis Parser bundle Key: TIKA-2063 URL: https://issues.apache.org/jira/browse/TIKA-2063 Project: Tika Issue Type: Task

[jira] [Resolved] (TIKA-2061) Tika 2.0 - Embed xmpcore dependency in tika-xmp bundle

2016-08-28 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2061. -- Resolution: Fixed > Tika 2.0 - Embed xmpcore dependency in tika-xmp bun

[jira] [Resolved] (TIKA-2062) Tika 2.0 - Remove Inlining of Bouncy Castle jars in tika-bundle projects

2016-08-28 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2062. -- Resolution: Fixed > Tika 2.0 - Remove Inlining of Bouncy Castle jars in tika-bundle proje

[jira] [Created] (TIKA-2062) Tika 2.0 - Remove Inlining of Bouncy Castle jars in tika-bundle projects

2016-08-27 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2062: Summary: Tika 2.0 - Remove Inlining of Bouncy Castle jars in tika-bundle projects Key: TIKA-2062 URL: https://issues.apache.org/jira/browse/TIKA-2062 Project: Tika

[jira] [Created] (TIKA-2061) Tika 2.0 - Embed xmpcore dependency in tika-xmp bundle

2016-08-27 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2061: Summary: Tika 2.0 - Embed xmpcore dependency in tika-xmp bundle Key: TIKA-2061 URL: https://issues.apache.org/jira/browse/TIKA-2061 Project: Tika Issue Type: Task

[jira] [Created] (TIKA-2060) Tika 2.0 - Add Toggle to Tika Batch ClassLoaderUtil to enable OSGi loading

2016-08-27 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2060: Summary: Tika 2.0 - Add Toggle to Tika Batch ClassLoaderUtil to enable OSGi loading Key: TIKA-2060 URL: https://issues.apache.org/jira/browse/TIKA-2060 Project: Tika

[jira] [Created] (TIKA-2059) Tika 2.0 - Merge PDF and Multimedia Modules

2016-08-27 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2059: Summary: Tika 2.0 - Merge PDF and Multimedia Modules Key: TIKA-2059 URL: https://issues.apache.org/jira/browse/TIKA-2059 Project: Tika Issue Type: Task Affects

  1   2   3   >