[jira] [Updated] (TIKA-1724) Create parser for .obo file format.

2015-08-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1724: --- Description: This parser implementation caters for files of the [OBO Flat File Format

[jira] [Commented] (TIKA-1599) Switch from TagSoup to JSoup

2015-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15055191#comment-15055191 ] Lewis John McGibbney commented on TIKA-1599: Hi [~talli...@mitre.org] we actually use [Neko

[jira] [Commented] (TIKA-1599) Switch from TagSoup to JSoup

2015-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15055193#comment-15055193 ] Lewis John McGibbney commented on TIKA-1599: Hi [~talli...@mitre.org] we actually use [Neko

Re: [VOTE] Moving SCM to Git

2016-01-04 Thread Lewis John Mcgibbney
[X] +1 Move the Apache Tika source control to Writeable Git repos at the ASF On Sat, Jan 2, 2016 at 7:40 PM, wrote: > > > DISCUSS thread here: http://s.apache.org/wVE > > Time to officially VOTE on moving Tika to Git. I’ve made a wiki > page for our SCM

[jira] [Commented] (TIKA-1820) Upgrade rome to 1.5.1

2016-01-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15081015#comment-15081015 ] Lewis John McGibbney commented on TIKA-1820: Would like to commit by EoB today unless

[jira] [Commented] (TIKA-1820) Upgrade rome to 1.5.1

2016-01-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15083110#comment-15083110 ] Lewis John McGibbney commented on TIKA-1820: ACK yes, is there a roadmap for 2.X? > Upgr

[jira] [Resolved] (TIKA-1825) Add 2.x branch to Hudson

2016-01-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-1825. Resolution: Fixed Assignee: Lewis John McGibbney Done https

[jira] [Updated] (TIKA-1820) Upgrade rome to 1.5.1

2015-12-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1820: --- Flags: Patch > Upgrade rome to 1.5.1 > - > >

[jira] [Updated] (TIKA-1820) Upgrade rome to 1.5.1

2015-12-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1820: --- Issue Type: Improvement (was: Bug) > Upgrade rome to 1.

[jira] [Updated] (TIKA-1820) Upgrade rome to 1.5.1

2015-12-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1820: --- Attachment: TIKA-1820.patch Patch for trunk folks. > Upgrade rome to 1.

[jira] [Commented] (TIKA-1818) Tika 2.0 - Decouple Parser Test Documents and files from tika-parsers project

2015-12-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15074272#comment-15074272 ] Lewis John McGibbney commented on TIKA-1818: hi [~bobpaulin] we've done this well over in Any23

[jira] [Created] (TIKA-1820) Upgrade rome to 1.5.1

2015-12-29 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-1820: -- Summary: Upgrade rome to 1.5.1 Key: TIKA-1820 URL: https://issues.apache.org/jira/browse/TIKA-1820 Project: Tika Issue Type: Bug

[jira] [Resolved] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE

2016-01-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-1516. Resolution: Fixed Committed revision 1722935 in trunk > Downgrade Rome depende

[jira] [Resolved] (TIKA-1820) Upgrade rome to 1.5.1

2016-01-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-1820. Resolution: Fixed Committed revision 1722935 in trunk > Upgrade rome to 1.

[jira] [Comment Edited] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE

2016-01-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15081480#comment-15081480 ] Lewis John McGibbney edited comment on TIKA-1516 at 1/4/16 5:58 PM

Re: NER wiki page up

2015-11-20 Thread Lewis John Mcgibbney
Hi Chris, Nice work generally. I love this kind of thing hence why I love GATE which uses both Stanford toolkits and Tika as well :) On Fri, Nov 20, 2015 at 8:37 AM, wrote: > > Thamme and I added a wiki page for Tika/Stanford NER and Apache > OpenNLP

[jira] [Resolved] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-1978. Resolution: Fixed Done and dusted > Invocation of java.net.URL.equals(Obj

[jira] [Commented] (TIKA-1996) Upgrade to PDFBox 2.0.2 when available

2016-06-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15327730#comment-15327730 ] Lewis John McGibbney commented on TIKA-1996: ACK, I'll head over to builds@ as the stack trace

Build step 'Execute shell' marked build as failure in tika-2.x-windows Jenkins build

2016-06-13 Thread lewis john mcgibbney
Hi Builds@, We established the tika-2.x-windows Jenkins build [0] a while ago and it has always failed with the below error. An example can also be seen at [1]. Can someone please help me to resolve the issue on the Windows slave(s). Thanks in advance for any help. Lewis FATAL: command execution

[jira] [Updated] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1978: --- Fix Version/s: 2.0 > Invocation of java.net.URL.equals(Object), which blocks to

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302473#comment-15302473 ] Lewis John McGibbney commented on TIKA-1978: No problem I will make it right now [~talli

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302611#comment-15302611 ] Lewis John McGibbney commented on TIKA-1978: [~talli...@mitre.org] please see https

Fwd: [jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Lewis John Mcgibbney
Hi Folks, Pretty cool news. It seems like Tim and I managed to win the hearts and minds of Uwe and Solr devs. Solr 6.2 will run with Tika 1.13. Hopefully this sets a precedent for us breaking down the barriers which have meant that until now Tika has been upgraded sparingly in Solr. Nice work Tim.

[jira] [Commented] (TIKA-1820) Upgrade rome to 1.5.1

2016-01-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091130#comment-15091130 ] Lewis John McGibbney commented on TIKA-1820: Committed revision 1723942 in 2.X > Upgrade r

[jira] [Updated] (TIKA-1820) Upgrade rome to 1.5.1

2016-01-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1820: --- Attachment: TIKA-1820.patch Patch for 2.X > Upgrade rome to 1.

[jira] [Updated] (TIKA-1724) Create parser for .obo file format.

2016-01-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1724: --- Attachment: TIKA-1724.patch Patch for trunk folks. I have a major problem

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15141649#comment-15141649 ] Lewis John McGibbney commented on TIKA-1851: [~talli...@apache.org] bq. Lewis John McGibbney

Re: [VOTE] Apache Tika 1.12 Release Candidate #1

2016-02-04 Thread Lewis John Mcgibbney
Hi Chris, +1 to release this release candidate Thanks Lewis On Tue, Feb 2, 2016 at 4:24 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Chris, > > Signatures all good. Verified using the scripts apachestuff. > mvn install and all tests pass fine on MacOSX 10.9

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15132561#comment-15132561 ] Lewis John McGibbney commented on TIKA-1851: Are we using the most recent osgi/Felix

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135483#comment-15135483 ] Lewis John McGibbney commented on TIKA-1851: If you can build locally then can you try a manual

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135521#comment-15135521 ] Lewis John McGibbney commented on TIKA-1851: Ack -- *Lewis* > Tika 2.0 - Move t

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135475#comment-15135475 ] Lewis John McGibbney commented on TIKA-1851: All dependencies should always come from first

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135496#comment-15135496 ] Lewis John McGibbney commented on TIKA-1851: Can you check which modules failed. If something

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135348#comment-15135348 ] Lewis John McGibbney commented on TIKA-1851: I'm the same Ken. We have been using cTAKES

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135559#comment-15135559 ] Lewis John McGibbney commented on TIKA-1851: In all honesty, if one takes a step back

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135594#comment-15135594 ] Lewis John McGibbney commented on TIKA-1851: Regression in Tika advanced module and also

[jira] [Updated] (TIKA-1848) Address issues with Tika 1.12rc#1

2016-02-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1848: --- Priority: Major (was: Blocker) > Address issues with Tika 1.12r

[jira] [Updated] (TIKA-1846) Set up Hudson (or similar?) with new Git repo

2016-02-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1846: --- Fix Version/s: 1.13 2.0 > Set up Hudson (or similar?) with new

[jira] [Resolved] (TIKA-1846) Set up Hudson (or similar?) with new Git repo

2016-02-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-1846. Resolution: Fixed Assignee: Lewis John McGibbney DONE Thanks > Set up Hud

[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1

2016-02-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130667#comment-15130667 ] Lewis John McGibbney commented on TIKA-1848: Ack Ken -- *Lewis* > Address iss

[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1

2016-02-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130582#comment-15130582 ] Lewis John McGibbney commented on TIKA-1848: Hi Folks, I am +1 to this being closed

[jira] [Created] (TIKA-1848) Address issues with Tika 1.12rc#1

2016-02-02 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-1848: -- Summary: Address issues with Tika 1.12rc#1 Key: TIKA-1848 URL: https://issues.apache.org/jira/browse/TIKA-1848 Project: Tika Issue Type: Bug

Re: [VOTE] Apache Tika 1.12 Release Candidate #1

2016-02-02 Thread Lewis John Mcgibbney
Hi Chris, Signatures all good. Verified using the scripts apachestuff. mvn install and all tests pass fine on MacOSX 10.9.5 Ran DRAT from master branch with following output Notes Binaries Archives Standards Apache Generated Unknown 0 2 0 868 836 0 32 Issue filed in Jira to address and resolve

[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1

2016-02-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129484#comment-15129484 ] Lewis John McGibbney commented on TIKA-1848: [~talli...@mitre.org] ACK I've not VOTE'd so

[jira] [Updated] (TIKA-1848) Address issues with Tika 1.12rc#1

2016-02-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1848: --- Issue Type: Task (was: Bug) > Address issues with Tika 1.12r

Re: Who's going to Apache: Big Data in May?

2016-04-07 Thread Lewis John Mcgibbney
Hi Ken, I'll be there and will look forward to seeing you again. Best On Tue, Mar 29, 2016 at 1:18 PM, wrote: > > From: Ken Krugler > To: "dev@tika.apache.org" > Cc: > Date: Tue, 29 Mar 2016 11:53:38 -0700 >

Re: pre-release 1.13 regression testing

2016-04-26 Thread Lewis John Mcgibbney
Hi Tim, What does this consist of? Are the tests hosted and executed on the Infra hosted VM? It would be great to see what the outcome of integration tests are... I've never seen this before and it would be very helpful for making a positive case for upgrading Tika in projects such as Solr cf.

[jira] [Created] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-19 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-1978: -- Summary: Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL) Key: TIKA-1978

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15292295#comment-15292295 ] Lewis John McGibbney commented on TIKA-1978: https://github.com/apache/tika/blob/master/tika

Re: [VOTE] Release Apache Tika 1.13 Candidate #1

2016-05-11 Thread Lewis John Mcgibbney
Hi David, Good job on the RC The .zip artifact contains 2015 in NOTICE Everything else looks great All Signatures good. Tests pass on MacOSX, Java 1.7 [X] +1 Release this package as Apache Tika 1.13 On Wed, May 11, 2016 at 6:50 AM, wrote: > > From: David Meikle

Re: Tika 1.14?

2016-08-12 Thread lewis john mcgibbney
Good thread Tim, Regarding open issues and low hanging fruit to make it into 1.14, I will also work on finishing https://github.com/apache/tika/pull/112. I think Bob has an excellent point. The 2.X work is major and would be a big step in the right direction. Having both branches longer and longer

Fwd: Google Summer of Code 2017 is coming

2017-02-03 Thread lewis john mcgibbney
Hi Folks, Please see above. If anyone is interested in participating in or mentoring a GSoC project then please respond to this thread. Usually, from there you can open a Jira ticket in which ever project it is you are interested and we take it from there. Have a great weekend. Lewis --

Re: Rest API Documentation

2017-01-24 Thread lewis john mcgibbney
I'm on it. On Tue, Jan 24, 2017 at 9:56 AM, wrote: > > From: "Allison, Timothy B." > To: "u...@tika.apache.org" > Cc: "dev@tika.apache.org" > Date: Mon, 23 Jan 2017 12:21:20 + > Subject: RE:

Fwd: Reminder - Action recommended: Migrate Microsoft Translator API to Azure—limited subscription access in Azure DataMarket through April 30, 2017

2017-01-24 Thread lewis john mcgibbney
Hi Folks, May be of use to anyone currently using the Microsoft Translator implementation in Tika. Lewis -- Forwarded message -- From: Azure Team Date: Tue, Jan 24, 2017 at 9:51 AM Subject: Reminder - Action recommended: Migrate Microsoft

Re: Rest API Documentation

2017-01-24 Thread lewis john mcgibbney
. Lewis On Tue, Jan 24, 2017 at 10:05 AM, lewis john mcgibbney <lewi...@apache.org> wrote: > I'm on it. > > On Tue, Jan 24, 2017 at 9:56 AM, <dev-digest-h...@tika.apache.org> wrote: > >> >> From: "Allison, Timothy B." <talli...@mitre.org> >&g

[jira] [Created] (TIKA-2253) Obtain new Miredot license key and upgrade plugin version in tika-server

2017-01-26 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2253: -- Summary: Obtain new Miredot license key and upgrade plugin version in tika-server Key: TIKA-2253 URL: https://issues.apache.org/jira/browse/TIKA-2253

[jira] [Resolved] (TIKA-2253) Obtain new Miredot license key and upgrade plugin version in tika-server

2017-01-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-2253. Resolution: Fixed > Obtain new Miredot license key and upgrade plugin vers

Tika on apache.org

2016-09-10 Thread lewis john mcgibbney
Yaldi Tika is featured today on apache.org :) :) :) -- http://home.apache.org/~lewismc/ @hectorMcSpector http://www.linkedin.com/in/lmcgibbney

https://issues.apache.org/jira/browse/INFRA-12186

2016-09-20 Thread lewis john mcgibbney
Hi Folks, Check out https://issues.apache.org/jira/browse/INFRA-12186, it will help us to reduce major bugs in Tika over time. Thanks Lewis -- http://home.apache.org/~lewismc/ @hectorMcSpector http://www.linkedin.com/in/lmcgibbney

Re: https://issues.apache.org/jira/browse/INFRA-12186

2016-09-21 Thread lewis john mcgibbney
Hi Ken, Good question. Answer below On Wed, Sep 21, 2016 at 2:16 PM, wrote: > > From: Ken Krugler > To: dev@tika.apache.org > Cc: > Date: Tue, 20 Sep 2016 09:22:46 -0700 > Subject: Re:

MicrosoftTranslator moving to Azure

2016-10-25 Thread lewis john mcgibbney
I suspect this will have an impact on our code https://translatorbusiness.uservoice.com/knowledgebase/articles/1078534-microsoft-translator-on-azure?WT.mc_id=azurebg_email_Trans_1228_Microsoft_Translator_Azure_Portal Lewis -- http://home.apache.org/~lewismc/ @hectorMcSpector

[jira] [Resolved] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2016-10-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-1343. Resolution: Fixed Assignee: Lewis John McGibbney (was: Chris A. Mattmann

Re: FW: tika-2.x-windows - Build # 94 - Still Failing

2017-01-13 Thread lewis john mcgibbney
Hi Tim, What do you want to change the polling to? We can make it nightly or something. What do you want? Thanks On Thu, Jan 5, 2017 at 4:35 AM, Allison, Timothy B. wrote: > Lewis, > Looks like our 2.x windows build is still failing. The new behavior, > though, is that

Fwd: Action recommended: Migrate Microsoft Translator API to Azure—limited access via Azure DataMarket starting January 1, 2017

2016-12-16 Thread lewis john mcgibbney
-- Forwarded message - From: Azure Team Date: Thu, Dec 15, 2016 at 2:50 PM Subject: Action recommended: Migrate Microsoft Translator API to Azure—limited access via Azure DataMarket starting January 1, 2017 To:

[jira] [Resolved] (TIKA-2291) REST API documentation is down

2017-03-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-2291. Resolution: Fixed > REST API documentation is d

[jira] [Updated] (TIKA-2291) REST API documentation is down

2017-03-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-2291: --- Fix Version/s: 1.15 > REST API documentation is d

[jira] [Assigned] (TIKA-2291) REST API documentation is down

2017-03-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned TIKA-2291: -- Assignee: Lewis John McGibbney > REST API documentation is d

[jira] [Commented] (TIKA-2291) REST API documentation is down

2017-03-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15901710#comment-15901710 ] Lewis John McGibbney commented on TIKA-2291: Hi [~mikecasd] you can build the documentation

[jira] [Updated] (TIKA-2636) ENVI Header metadata fields can span more than one line

2018-05-01 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-2636: --- Fix Version/s: (was: 2.0.0) 1.19 > ENVI Header metadata fie

[jira] [Updated] (TIKA-2636) ENVI Header metadata fields can span more than one line

2018-05-01 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-2636: --- Affects Version/s: (was: 1.17) 1.18 > ENVI Header metad

[jira] [Resolved] (TIKA-2636) ENVI Header metadata fields can span more than one line

2018-05-01 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-2636. Resolution: Fixed > ENVI Header metadata fields can span more than one l

[jira] [Created] (TIKA-2639) Update freedesktop.org shared-mime-info-spec hyperlink in MimeTypesReader.java

2018-05-01 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2639: -- Summary: Update freedesktop.org shared-mime-info-spec hyperlink in MimeTypesReader.java Key: TIKA-2639 URL: https://issues.apache.org/jira/browse/TIKA-2639

[jira] [Created] (TIKA-2565) Upgrade edu.ucar dependencies to 4.6.11

2018-02-02 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2565: -- Summary: Upgrade edu.ucar dependencies to 4.6.11 Key: TIKA-2565 URL: https://issues.apache.org/jira/browse/TIKA-2565 Project: Tika Issue Type

Re: relying on a non-Maven central repo?

2018-02-08 Thread lewis john mcgibbney
Hi Folks, Thanks for the input so far... some more comments below. In the past I've uploaded the 'problem' artifacts (namely those classified as 'scientific' e.g. netcdf and dependencies) to OSSRH. There is no working around the fact that is a PITA... and that's not me being obstinate... it

Re: Unnecessary WARNING Logging?

2018-03-01 Thread lewis john mcgibbney
8:03:53 + (GMT) > Subject: Re: Unnecessary WARNING Logging? > On Tue, 27 Feb 2018, lewis john mcgibbney wrote: > >> I don't know when it was introduced, by I see the following, rather >> annoying WARNING messages in many logs now. >> > > IIRC we're changing those to

[jira] [Created] (TIKA-2636) ENVI Header metadata fields can span more than one line

2018-04-23 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2636: -- Summary: ENVI Header metadata fields can span more than one line Key: TIKA-2636 URL: https://issues.apache.org/jira/browse/TIKA-2636 Project: Tika

Unnecessary WARNING Logging?

2018-02-27 Thread lewis john mcgibbney
Hi Folks, I don't know when it was introduced, by I see the following, rather annoying WARNING messages in many logs now. I feel that these should not be WARNING e.g. what if I don't want any of them to be initiated... which is exactly the case over when we use Tika in Apache Any23. Does anyone

[jira] [Created] (TIKA-2762) Capture short fields (<150 chars) in EnviParserHeader Metadata

2018-10-19 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2762: -- Summary: Capture short fields (<150 chars) in EnviParserHeader Metadata Key: TIKA-2762 URL: https://issues.apache.org/jira/browse/TIKA-2762 Project: T

[jira] [Created] (TIKA-2763) PDFParser - java.io.IOException: Missing root object specification in trailer

2018-10-19 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2763: -- Summary: PDFParser - java.io.IOException: Missing root object specification in trailer Key: TIKA-2763 URL: https://issues.apache.org/jira/browse/TIKA-2763

[jira] [Assigned] (TIKA-2770) Convert EnviHeader "map info" from UTM to LatLon

2018-11-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned TIKA-2770: -- Assignee: Lewis John McGibbney > Convert EnviHeader "map info&q

[jira] [Reopened] (TIKA-2770) Convert EnviHeader "map info" from UTM to LatLon

2018-11-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened TIKA-2770: > Convert EnviHeader "map info" from

[jira] [Updated] (TIKA-2770) Convert EnviHeader "map info" from UTM to LatLon

2018-11-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-2770: --- Fix Version/s: 1.20 > Convert EnviHeader "map info" from

[jira] [Resolved] (TIKA-2763) PDFParser - java.io.IOException: Missing root object specification in trailer

2018-10-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-2763. Resolution: Not A Problem I fixed this if I used the correct command curl -X PUT

Resource Sharing Tika Corpus with Any23

2018-11-30 Thread Lewis John Mcgibbney
Hi dev@tika, Over at Any23 we have been discussing the prospect of running large scale jobs over a significant, challenging dataset, same as is done with Tika via Tika batch on the VM. Is there any possibility, a very small number of us from the Any23 team could access VM and the dataset(s)? If

Re: Resource Sharing Tika Corpus with Any23

2018-11-30 Thread Lewis John McGibbney
Hi Tim, Thanks for the reply... answer inline On 2018/11/30 19:22:23, Tim Allison wrote: > I think that'd be great. Some questions: > > 1) Would you use the same input docs that we're using or would you > need/want a new TB drive for your input/output? The same docs I suspect. We *could*

Re: 1.20?

2018-11-28 Thread Lewis John McGibbney
+1 would be nice to get the recent ENVI work released as well folks. On 2018/11/20 23:04:29, Tim Allison wrote: > All, >POI 4.0.1 will be out shortly with some important bug fixes. What would > you all think of targeting 1st/2nd week of December for 1.20? > > Cheers, > Tim

[jira] [Created] (TIKA-2796) Update GoogleTranslator to use google-cloud-translate Java API

2018-12-06 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-2796: -- Summary: Update GoogleTranslator to use google-cloud-translate Java API Key: TIKA-2796 URL: https://issues.apache.org/jira/browse/TIKA-2796 Project: Tika

Re: Tika master branch not building

2020-04-07 Thread Lewis John McGibbney
I suspected this was the case folks :) I actually really like this idea. I'll take the action item to address this seeing as I pulled it up... seeing as I am also working on tika-server right now I'll also take the action item to address the vulnerable CXF deps. Thanks, Lewis On 2020/04/06

[jira] [Commented] (TIKA-3082) Consider adding an OpenAPI for tika-server

2020-04-01 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17072998#comment-17072998 ] Lewis John McGibbney commented on TIKA-3082: [~grossws] absolutely. # Firstly, OpenAPI

[jira] [Commented] (TIKA-3082) Consider adding an OpenAPI for tika-server

2020-04-01 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17073000#comment-17073000 ] Lewis John McGibbney commented on TIKA-3082: I'm going to start working on an OpenAPI and I

[jira] [Assigned] (TIKA-3082) Consider adding an OpenAPI for tika-server

2020-04-01 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned TIKA-3082: -- Assignee: Lewis John McGibbney > Consider adding an OpenAPI for tika-ser

Re: Tika master branch not building

2020-04-06 Thread lewis john mcgibbney
I'm also seeing a depreciation notice for the ossindex-maven-plugin as well https://github.com/OSSIndex/ossindex-maven-plugin#deprecated-please-upgrade-to-ossindex-maven Any info please folks? Thanks On Sun, Apr 5, 2020 at 11:14 PM lewis john mcgibbney wrote: > Hi dev@, > Working on TIK

Tika master branch not building

2020-04-06 Thread lewis john mcgibbney
Hi dev@, Working on TIKA-3082, I just tried to build master branch Downgrading my Java version to 1.8 java -version java version "1.8.0_221" Java(TM) SE Runtime Environment (build 1.8.0_221-b11) Java HotSpot(TM) 64-Bit Server VM (build 25.221-b11, mixed mode) [INFO] ---

[jira] [Updated] (TIKA-3082) OpenAPI for tika-server

2020-04-03 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-3082: --- Summary: OpenAPI for tika-server (was: Consider adding an OpenAPI for tika-server

[jira] [Commented] (TIKA-3093) Enable tika-server to forward parse results to another endpoint

2020-04-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17091926#comment-17091926 ] Lewis John McGibbney commented on TIKA-3093: [~tallison] bq. ...will converting tika-server

[jira] [Commented] (TIKA-2253) Obtain new Miredot license key and upgrade plugin version in tika-server

2020-03-16 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17060561#comment-17060561 ] Lewis John McGibbney commented on TIKA-2253: I was planning on putting together an OpenAPI

[jira] [Commented] (TIKA-3113) Currently Tika is detecting a .aux file as text/html

2020-06-26 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17146370#comment-17146370 ] Lewis John McGibbney commented on TIKA-3113: After a wee bit of research I understand

[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0

2021-01-06 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17260026#comment-17260026 ] Lewis John McGibbney commented on TIKA-3258: # Excellent # 10K sounds adventurous

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

2020-11-05 Thread lewis john mcgibbney
Hi dev@, Is anyone interested in co-mentoring https://issues.apache.org/jira/browse/TIKA-94 ? Lewis On Mon, Nov 2, 2020 at 7:52 PM Sally Khudairi wrote: > Hello PMCs --I hope you are all well. > > ASF Community Development (ComDev) oversees our participation in Google > Summer of Code, for

Re: [VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1

2021-01-20 Thread Lewis John McGibbney
Hi Tim, FWIW here's my review SIGS BOTH LOOK GOOD gpg --verify tika-2.0.0-ALPHA-src.zip.asc tika-2.0.0-ALPHA-src.zip gpg: Signature made Wed Jan 13 15:26:10 2021 PST gpg:using RSA key 184454FAD8697760F3E00D2E4A51A45B944FFD51 gpg: Good signature from "Tim Allison (ASF signing key)

<    1   2   3   4   5   >