[jira] [Commented] (TIKA-1334) Add presentation layer for results of each run

2017-05-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15994941#comment-15994941 ] Tyler Palsulich commented on TIKA-1334: --- The format should probably be in the

Re: Squashing GitHub pull requests while merging

2016-05-07 Thread Tyler Palsulich
A contributor should be able to squash the commits in the pull request before we merge into the Tika. So, we don't need to mess up Tika's history. Right? Tyler On May 6, 2016 8:41 PM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Squashing messes up history and atm requires

Re: JIRA issue?

2016-04-21 Thread Tyler Palsulich
Hi Ben, Sorry for the inconvenience. The infrastructure team had to disable the create and comment features of JIRA for many projects to mitigate spam. Hopefully everything will be back up and running again soon. Thanks for emailing. Tyler Hi, I'd like to create an issue on the JIRA. When I vis

Re: [VOTE] Apache Tika 1.12 Release Candidate #1

2016-02-13 Thread Tyler Palsulich
A bit late to the party, but +1 from me. Tyler On Thu, Feb 4, 2016 at 1:44 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Chris, > +1 to release this release candidate > Thanks > Lewis > > On Tue, Feb 2, 2016 at 4:24 PM, Lewis John Mcgibbney < > lewis.mcgibb...@gmail.com> wrot

Re: [VOTE] Moving SCM to Git

2016-01-02 Thread Tyler Palsulich
r SCM explaining how to use Git at Apache, and how to > >>> use it with Github, and how to use it even in a traditional SVN > >>> sense. The page is here: > >>> > >>> https://wiki.apache.org/tika/UsingGit > >>> > >>> > >>

RE: NER Parser tests behind proxy?

2015-11-23 Thread Tyler Palsulich
Apologies if i missed a discussion about this earlier, but should we be downloading a model by default? Tyler On Nov 23, 2015 8:03 AM, "Allison, Timothy B." wrote: > The problem comes down to: ModelGetter.groovy which is trying to grab: > ${basedir}/src/test/resources/org/apache/tika/parser/ner/

Re: Named Entity Recognition support in trunk

2015-11-18 Thread Tyler Palsulich
That's awesome! Great work. Have we tried running any benchmarks? Tyler On Nov 18, 2015 6:42 AM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Hey Folks, > > With the commit of TIKA-1787/GH-61 in trunk we now have full integration > of Named Entity Recognition with Stanfor

Re: [DISCUSS] Moving to Git

2015-11-18 Thread Tyler Palsulich
+1 from me. Tyler On Nov 18, 2015 6:46 AM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Hey Team, > > I propose we move to writeable git repos for Tika for our repository. > I mostly interact with Git & Github nowadays even with Tika using the > mirroring and PR interactio

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

2015-10-22 Thread Tyler Palsulich
+1 from me -- builds, tests pass, sanity check files parse, and sums look good. But, I get a warning that the signature is not certified with a trusted signature. Tyler On Wed, Oct 21, 2015 at 6:43 AM Allison, Timothy B. wrote: > +0 (some regressions in ppt content) > > I just finished the batc

Re: Tika Tesseract configuration

2015-10-17 Thread Tyler Palsulich
Hi Aditya, The wiki (https://wiki.apache.org/tika/TikaOCR) also had some good information about setting up and configuring Tesseract. Let me know if you have any questions. Thanks, Tyler On Wed, Oct 14, 2015, 6:59 AM Aditya Dhulipala wrote: > Hi Tika devs, > > Scratch that previous email. > >

Re: svn commit: r1706077 - /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/gdal/TestGDALParser.java

2015-10-01 Thread Tyler Palsulich
+++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++ > > > > > > -Original Message- > From: Tyler Palsulich > Reply

Re: svn commit: r1706077 - /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/gdal/TestGDALParser.java

2015-10-01 Thread Tyler Palsulich
Hi Chris, It looks like these two lines are equivalent (assert not null versus assert true not null). Right? Tyler On Wed, Sep 30, 2015, 9:45 AM wrote: > Author: mattmann > Date: Wed Sep 30 16:45:32 2015 > New Revision: 1706077 > > URL: http://svn.apache.org/viewvc?rev=1706077&view=rev > Log:

[jira] [Commented] (TIKA-1743) NetworkParser can create Unbounded Number of Threads

2015-09-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14903878#comment-14903878 ] Tyler Palsulich commented on TIKA-1743: --- [Copied from the list] This sounds li

Re: [jira] [Created] (TIKA-1743) NetworkParser can create Unbounded Number of Threads

2015-09-22 Thread Tyler Palsulich
This sounds like a great idea! We should make the size of the pool configurable with TikaConfig. On Tue, Sep 22, 2015, 3:04 PM Bob Paulin (JIRA) wrote: > Bob Paulin created TIKA-1743: > > > Summary: NetworkParser can create Unbounded Number of Thread

Re: Sorry for excessive tika-example update

2015-09-22 Thread Tyler Palsulich
Sounds great! Thank you for taking the initiative on this. Tyler On Mon, Sep 21, 2015, 10:04 AM Konstantin Gribov wrote: > I was about to add simple example of interruptable parsing with Tika and > found that tika-example module have inconsistent formatting (tabs and 4 > spaces), ALv2 license h

Re: [ANNOUNCE] Welcome Bob Paulin as Tika Committer + PMC Member

2015-09-16 Thread Tyler Palsulich
Welcome! On Wed, Sep 16, 2015, 6:37 PM Allison, Timothy B. wrote: > Welcome! Great to have you on board! > > Cheers, > > Tim > > -Original Message- > From: Bob Paulin [mailto:b...@bobpaulin.com] > Sent: Wednesday, September 16, 2015 9:16 PM > To: dev@tika.apache.org > Subjec

[jira] [Commented] (TIKA-1672) Integrate tika-java7 component

2015-08-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14722705#comment-14722705 ] Tyler Palsulich commented on TIKA-1672: --- Hmm. Maybe we should rename the mo

Re: [ANNOUNCE] Apache Tika 1.10 release

2015-08-08 Thread Tyler Palsulich
Thanks, Dave! On Sat, Aug 8, 2015, 7:01 AM David Meikle wrote: > The Apache Tika project is pleased to announce the release of Apache Tika > 1.10. The release contents have been pushed out to the main Apache release > site and to the Central sync, so the releases should be available as soon > as

Re: [VOTE] Apache Tika 1.10 Release Candidate #1

2015-08-04 Thread Tyler Palsulich
Everything looks good to me! +1 Thanks, Dave! Tyler On Tue, Aug 4, 2015, 6:48 AM Ken Krugler wrote: > +1 > > Built on Mac, tested with Bixo. > > -- Ken > > > From: David Meikle > > Sent: August 2, 2015 12:15:24am PDT > > To: dev@tika.apache.org; u...@tika.apache.org > > Subject: [VOTE] Apache

[jira] [Commented] (TIKA-1362) Add GoogleTranslate implementation of Translation API

2015-07-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14623246#comment-14623246 ] Tyler Palsulich commented on TIKA-1362: --- If you have a pressing need for be

[jira] [Created] (TIKA-1672) Integrate tika-java7 component

2015-07-02 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1672: - Summary: Integrate tika-java7 component Key: TIKA-1672 URL: https://issues.apache.org/jira/browse/TIKA-1672 Project: Tika Issue Type: Improvement

[jira] [Resolved] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-07-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1536. --- Resolution: Fixed Upgraded in r1688779. Thanks, all. Will open a new issue regarding

[jira] [Commented] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605772#comment-14605772 ] Tyler Palsulich commented on TIKA-1536: --- Yep, see http://apache.markmail.org/th

[jira] [Closed] (TIKA-1439) PDF embeded with document can not parse.

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1439. - Resolution: Duplicate > PDF embeded with document can not pa

[jira] [Closed] (TIKA-1452) parser.parse() throws exception after which the procesed file is not getting renamed/moved/deleted

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1452. - Resolution: Not A Problem I'm closing this as not a problem. But, please feel free to reop

[jira] [Closed] (TIKA-1481) TikaJAXRS get metadata calls give different results

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1481. - Resolution: Not A Problem Hi [~arbuzovada]. Sorry for the trouble! Did you make sure to respond to

[jira] [Commented] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605300#comment-14605300 ] Tyler Palsulich commented on TIKA-1536: --- Now that 1.9 is released, are there

[jira] [Closed] (TIKA-1552) Pdf document parser

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1552. - Resolution: Not A Problem Marking this as not a problem, since Adobe Reader also adds white space

[jira] [Commented] (TIKA-1493) Update for JAXRS page with details on passing password

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605292#comment-14605292 ] Tyler Palsulich commented on TIKA-1493: --- Can someone familiar with the lates

[jira] [Closed] (TIKA-1429) Unable to View a 9mb file even after setting a large Heap Size of 3GB while TIKA GUI

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1429. - Resolution: Not A Problem Closing this as not a problem. The file needs to be kept in memory for

[jira] [Updated] (TIKA-1233) PDFBox can throw StringIndexOutOfBoundsException on some dates

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1233: -- Fix Version/s: (was: 1.6) 1.10 > PDFBox can th

[jira] [Updated] (TIKA-1022) DWG Custom properties not extracted

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1022: -- Fix Version/s: (was: 1.3) > DWG Custom properties not extrac

[jira] [Resolved] (TIKA-756) XMP output from Tika CLI

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-756. -- Resolution: Fixed Marking this as Fixed, since there are a few more references to tika-parser

[jira] [Resolved] (TIKA-1585) Create Example Website with Form Submission

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1585. --- Resolution: Fixed Good idea, [~lewismc]. I added it to http://people.apache.org/~tpalsulich

Re: Troubleshooting guide

2015-06-24 Thread Tyler Palsulich
Looks good! Thanks, Nick. Tyler On Wed, Jun 24, 2015 at 2:42 PM Nick Burch wrote: > Hi All > > I've had a go at writing up a troubleshooting guide on the wiki, hopefully > covering the main problems people face (content detected wrong, parser > missing etc). It's linked from the front page and

Re: Configuring parsers and translators

2015-06-13 Thread Tyler Palsulich
++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > >

[jira] [Closed] (TIKA-1199) Tika extracts weird signs instead of text

2015-06-09 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1199. - Resolution: Not A Problem > Tika extracts weird signs instead of t

[jira] [Resolved] (TIKA-1630) Mention APK support in List of Supported Formats

2015-06-09 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1630. --- Resolution: Fixed Fix Version/s: 1.9 Assignee: Tyler Palsulich Bolded the

Re: [VOTE] Release Apache Tika 1.9 Candidate #2

2015-06-09 Thread Tyler Palsulich
+1 from me. Thanks for running this, Chris! Tyler On Mon, Jun 8, 2015 at 1:11 PM Allison, Timothy B. wrote: > +1 > > Built in Windows and Linux. Works on problems (that I caused!) in rc1. > > Let's make sure to include "last Java 1.6" version in the release notes, > if that's what we've decid

Re: Configuring parsers and translators

2015-06-06 Thread Tyler Palsulich
nt > University of Southern California, Los Angeles, CA 90089 USA > ++++++ > > > > > -Original Message- > From: Tyler Palsulich > Reply-To: "dev@tika.apache.org" > Date: Saturday, June 6, 2015 at 3:45 PM > To: "dev@tika.apache.or

[jira] [Commented] (TIKA-1652) Tika Server should allow config file override from the command line like Tika App

2015-06-06 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14575986#comment-14575986 ] Tyler Palsulich commented on TIKA-1652: --- I think this is a duplicate of TIKA-

Re: Configuring parsers and translators

2015-06-06 Thread Tyler Palsulich
Hi Nick, I've been mulling this over since you sent the first message. But, I'm afraid I don't have a good solution or developed ideas. I agree, it would be very nice to consolidate all configuration for all parsers in the server and app. Is it feasible to put everything into tika-config? Then P

Re: [DISCUSS] Thinking about completely refactoring the ExternalParser and using commons-exec

2015-05-25 Thread Tyler Palsulich
On Mon, May 25, 2015 at 4:05 PM, Nick Burch wrote: > On Mon, 25 May 2015, Mattmann, Chris A (3980) wrote: > >> ExternalParser is way broke. I have some patches that somewhat fix it, >> but in doing so, I realized, why not just use commons-exec? I realize that >> this is another dependency into co

[jira] [Resolved] (TIKA-1619) SHA1 and MD5 verification hashes for v1.8 still show old v1.7 hashes

2015-05-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1619. --- Resolution: Fixed Fix Version/s: (was: 1.9) Assignee: Tyler Palsulich

[jira] [Commented] (TIKA-1624) Syntax error in DOAP file release section

2015-05-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553281#comment-14553281 ] Tyler Palsulich commented on TIKA-1624: --- Thanks, Ken. I published the file a

[jira] [Commented] (TIKA-1630) Mention APK support in List of Supported Formats

2015-05-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553272#comment-14553272 ] Tyler Palsulich commented on TIKA-1630: --- That is a very good point. There

Re: Any reason we removed the links to other downstream Tika APIs off the main web site?

2015-05-20 Thread Tyler Palsulich
on. I believe the versioning issues from the other thread have been resolved. Tyler On Wed, May 20, 2015 at 5:54 PM, Tyler Palsulich wrote: > Hi Chris, > > I may have botched the version of the index on the site (see the other > thread with Nick's comments.) I'll investigate m

Re: Any reason we removed the links to other downstream Tika APIs off the main web site?

2015-05-20 Thread Tyler Palsulich
Hi Chris, I may have botched the version of the index on the site (see the other thread with Nick's comments.) I'll investigate more tonight or tomorrow, if you don't beat me to it. Tyler On May 20, 2015 4:39 PM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Hey Folks, > >

[jira] [Commented] (TIKA-1624) Syntax error in DOAP file release section

2015-05-14 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14544150#comment-14544150 ] Tyler Palsulich commented on TIKA-1624: --- [~kkrugler], yes. I just updated

Published Site Changes

2015-05-14 Thread Tyler Palsulich
Hi Everyone, I was about to update the site for TIKA-1619 (checksums wrong on the site), but found unpublished changes in the site. This is the status after checking out the repo and running `mvn install`: ➜ site svn status M publish/1.7/examples.html M publish/1.8/examples.html M

[jira] [Commented] (TIKA-1630) Mention APK support in List of Supported Formats

2015-05-14 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14544104#comment-14544104 ] Tyler Palsulich commented on TIKA-1630: --- Hi. Thanks for reporting this! Can you

Re: Translation API question

2015-05-05 Thread Tyler Palsulich
Hi Sergey, Unfortunately, not yet. See TIKA-1328. Tyler On Tue, May 5, 2015 at 4:51 PM, Sergey Beryozkin wrote: > Hi All > > Is it possible to submit a document to the Translation API and get the > translated words as a sequence of events ? For example, with a regular Tika > API it is possible

Re: Java 1.6 support for Tika 1.9?

2015-04-27 Thread Tyler Palsulich
I should have included the fact this is the last release planned to support Java 1.6 in the announcement (as we talked about a while back). But, since that has passed, should we just update the announcement on the website, wait another release, or just drop Java 1.6 support when we release 1.9? I

Re: comparing Tika's file detect with other tools?

2015-04-22 Thread Tyler Palsulich
Hi Tim, I do not know about if there would be licensing concerns. But, we do have TIKA-289 to track merging magic bytes from `file` into Tika. Tyler On Wed, Apr 22, 2015 at 10:40 AM, Ken Krugler wrote: > Hi Tim, > > I don't believe there's any issue with comparing results. > > If you were look

[jira] [Commented] (TIKA-1585) Create Example Website with Form Submission

2015-04-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14507259#comment-14507259 ] Tyler Palsulich commented on TIKA-1585: --- Is there an Apache hosted location

Re: NUTCH-1994 and UCAR Dependencies

2015-04-21 Thread Tyler Palsulich
Hi Lewis, I also tried upgrading Tika in Nutch. But, ran into the same issue (but, udunits is found, as expected): [ivy:retrieve] :: [ivy:retrieve] :: UNRESOLVED DEPENDENCIES :: [ivy:retrieve] :::

[jira] [Commented] (TIKA-1607) Introduce new HashMap data structure for persitsence of Tika Metadata

2015-04-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14503778#comment-14503778 ] Tyler Palsulich commented on TIKA-1607: --- Good idea! What if you created a subc

[ANNOUNCE] Apache Tika 1.8 Released

2015-04-20 Thread Tyler Palsulich
available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/tika.asc For more information on Apache Tika, visit the project home page: http://tika.apache.org/ -- Tyler

[RESULT] [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-20 Thread Tyler Palsulich
Hi Everyone, The VOTE to release Tika 1.8 RC #2 has passed with the following tally: +1: Chris Mattmann Hong-Thai Nguyen Konstantin Gribov Lewis John Mcgibbney Oleg Tikhonov Tim Allison Tyler Palsulich ±0: None -1: None I'll move forward with the release process now. Thank you all for

Re: [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-20 Thread Tyler Palsulich
Thank you, Everyone! I'll move forward now. Lewis, KEYS are here: https://people.apache.org/keys/group/tika.asc. Of course, I'm also +1. Tyler On Mon, Apr 20, 2015 at 3:47 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Folks, > > On Thu, Apr 16, 2015 at 2:42 PM, wrote: > >

Re: [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-19 Thread Tyler Palsulich
ix for > https://issues.apache.org/jira/browse/TIKA-1606? > > It's a simple change (bumping the Guava version), but as seen this can > have unexpected consequences. > > I'm fine either way. > > -- Ken > > > From: Tyler Palsulich > > Sent: April 18, 2015 8:29:22pm PDT

RE: [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-18 Thread Tyler Palsulich
years ago. So it seems like we should upgrade to at least 11.0.2 But I don't know if this is enough of an issue to require another RC. -- Ken PS - I've created https://issues.apache.org/jira/browse/TIKA-1606 to track this. > From: Tyler Palsulich > Sent: April 13, 20

[jira] [Closed] (TIKA-1266) Tika OSGI Bundle needs Bundle-ClassPath to work in Equinox

2015-04-16 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1266. - Resolution: Not A Problem Thanks, [~bobpaulin]! > Tika OSGI Bundle needs Bundle-ClassPath to w

[VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-13 Thread Tyler Palsulich
Hi Folks, A candidate for the Tika 1.8 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.8-rc2/ The SHA1 checksum of the archive is 5e22fee9079370398472e59082d171ae2d7f

[jira] [Comment Edited] (TIKA-1593) Doco: Broken link to "Parser Quick Start Guide"

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492662#comment-14492662 ] Tyler Palsulich edited comment on TIKA-1593 at 4/13/15 5:0

[jira] [Resolved] (TIKA-1593) Doco: Broken link to "Parser Quick Start Guide"

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1593. --- Resolution: Fixed Assignee: Tyler Palsulich Fixed in r1673240. Thank you [~bhamail

[jira] [Commented] (TIKA-1593) Doco: Broken link to "Parser Quick Start Guide"

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492638#comment-14492638 ] Tyler Palsulich commented on TIKA-1593: --- See https://svn.apache.org/repos/asf/

Re: [VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-13 Thread Tyler Palsulich
gt; I don't think we've solved TIKA-1600, yet, or have we? > > -Original Message- > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] > Sent: Sunday, April 12, 2015 12:12 AM > To: dev@tika.apache.org > Subject: Re: [VOTE] Release Apache Tika 1.8 Candid

[jira] [Updated] (TIKA-1600) Unable to parse ODT files because of failed to close temporary resources

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1600: -- Priority: Blocker (was: Major) > Unable to parse ODT files because of failed to close tempor

[jira] [Resolved] (TIKA-1600) Unable to parse ODT files because of failed to close temporary resources

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1600. --- Resolution: Fixed Assignee: Hong-Thai Nguyen Thanks, [~thaichat04]! I just updated it

Re: [VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-11 Thread Tyler Palsulich
Are we ready for another RC? I'd like to make sure the above issues are (believed to be) settled before the next cut. Thanks, Tyler On Apr 10, 2015 4:55 PM, "David Meikle" wrote: > > > On 10 Apr 2015, at 11:38, Allison, Timothy B. > wrote: > > > > I agree that the ODT issue might require a res

Re: [VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-07 Thread Tyler Palsulich
CC'ing user@tika for visibility. Tyler On Tue, Apr 7, 2015 at 4:54 PM, Tyler Palsulich wrote: > Hi Folks, > > A candidate for the Tika 1.8 release is available at: > https://dist.apache.org/repos/dist/dev/tika/ > > The release candidate is a zip archive of t

[VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-07 Thread Tyler Palsulich
Hi Folks, A candidate for the Tika 1.8 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.8-rc1/ The SHA1 checksum of the archive is ddeb3b43ca1c1ef346658a7005434019507e

[jira] [Closed] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1592. - Resolution: Invalid Closing as Invalid. Feel free to create additional issues if you run into

[jira] [Commented] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393246#comment-14393246 ] Tyler Palsulich commented on TIKA-1592: --- I tried building ikube on a Mac, but I

[jira] [Comment Edited] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393184#comment-14393184 ] Tyler Palsulich edited comment on TIKA-1592 at 4/2/15 7:0

[jira] [Commented] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393184#comment-14393184 ] Tyler Palsulich commented on TIKA-1592: --- Thanks for reporting this, [~michaelc

Re: [jira] [Commented] (TIKA-1330) Add robust tika-batch code

2015-04-01 Thread Tyler Palsulich
All tests are passing. Only issue I see is excessive logging. The Hudson failure does just look like a hiccup. Tyler On Wed, Apr 1, 2015 at 2:55 PM, Allison, Timothy B. wrote: > This looks like a Hudson hiccup. > > Tyler is seeing excessive logging: > Running org.apache.tika.cli.TikaCLIBatchInt

[jira] [Comment Edited] (TIKA-1585) Create Example Website with Form Submission

2015-04-01 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14390841#comment-14390841 ] Tyler Palsulich edited comment on TIKA-1585 at 4/1/15 3:5

[jira] [Commented] (TIKA-1585) Create Example Website with Form Submission

2015-04-01 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14390841#comment-14390841 ] Tyler Palsulich commented on TIKA-1585: --- Done. It works. I'll see if I can

Re: Access Control Allow Origin

2015-04-01 Thread Tyler Palsulich
tion to -C and use "-c" for the tika > config file? > > -----Original Message- > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] > Sent: Wednesday, April 01, 2015 11:13 AM > To: dev@tika.apache.org > Subject: Re: Access Control Allow Origin > > Thank you fo

[jira] [Updated] (TIKA-1426) Let's allow users to specify a tika config file on the commandline for tika-app and tika-server

2015-04-01 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1426: -- Component/s: server cli > Let's allow users to specify a tika config

Re: Access Control Allow Origin

2015-04-01 Thread Tyler Palsulich
roach works too, might be tricky to extend it to support > more properties but great for a start > > Thanks, Sergey > > > > > > On 27/03/15 18:56, Tyler Palsulich wrote: > >> Thank you, Sergey! I didn't know about that feature. I am going to try to >> w

[jira] [Commented] (TIKA-1558) Create a Parser Blacklist

2015-03-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389470#comment-14389470 ] Tyler Palsulich commented on TIKA-1558: --- Thank you, [~gagravarr]! > Create a

[jira] [Comment Edited] (TIKA-1558) Create a Parser Blacklist

2015-03-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1432#comment-1432 ] Tyler Palsulich edited comment on TIKA-1558 at 3/31/15 9:4

[jira] [Updated] (TIKA-1558) Create a Parser Blacklist

2015-03-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1558: -- Description: As talked about in TIKA-1555 and TIKA-1557, it would be nice to be able to disable

Re: [DISCUSS] Tika 1.8 or 1.7.1

2015-03-30 Thread Tyler Palsulich
y last build of Tika app (a few days ago) ballooned to ~43MB, and that's > before I add ~3MB for xerial. Tika server is now ~48MB. As of my last > build, we are still including ~4MB of pdfs (README.NLDAS1.pdf and > README.NLDAS2.pdf) from the GRIB(?) parser in the tika-app and tika-ser

[jira] [Comment Edited] (TIKA-1584) Tika 1.7 possible regression (nested attachment files not getting parsed)

2015-03-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14386906#comment-14386906 ] Tyler Palsulich edited comment on TIKA-1584 at 3/30/15 4:0

[jira] [Commented] (TIKA-1584) Tika 1.7 possible regression (nested attachment files not getting parsed)

2015-03-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14386906#comment-14386906 ] Tyler Palsulich commented on TIKA-1584: --- Yup! The 1.7 release process should s

RE: including refactored docs from govdocs1 in test suite

2015-03-30 Thread Tyler Palsulich
ovdocs1 before or after the RC? > > > > My last build of Tika app (a few days ago) ballooned to ~43MB, and that's > > before I add ~3MB for xerial. Tika server is now ~48MB. As of my last > > build, we are still including ~4MB of pdfs (README.NLDAS1.pdf and > > READM

Re: including refactored docs from govdocs1 in test suite

2015-03-30 Thread Tyler Palsulich
add ~3MB for xerial. Tika server is now ~48MB. As of my last > build, we are still including ~4MB of pdfs (README.NLDAS1.pdf and > README.NLDAS2.pdf) from the GRIB(?) parser in the tika-app and tika-server > jars. > > Best, > > Tim > > > > -Orig

[jira] [Commented] (TIKA-1587) ForkParser::setJavaCommand should take List

2015-03-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14386685#comment-14386685 ] Tyler Palsulich commented on TIKA-1587: --- Thank you for reporting this! It seems

[jira] [Resolved] (TIKA-1579) Add file type to NetCDFParser

2015-03-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1579. --- Resolution: Fixed > Add file type to NetCDFPar

[jira] [Resolved] (TIKA-1578) Add file type description to HDFParsers

2015-03-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1578. --- Resolution: Fixed > Add file type description to HDFPars

[jira] [Updated] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1575: -- Fix Version/s: 1.8 > Upgrade to PDFBox 1.8.9 when availa

Re: [DISCUSS] Tika 1.8 or 1.7.1

2015-03-29 Thread Tyler Palsulich
Once TIKA-1584 and TIKA-1575 are resolved, I'll work up an RC (unless something else pops up). Thank you everyone. Tyler On Mar 29, 2015 4:43 AM, "Hong-Thai Nguyen" wrote: > +1 for 1.8 > > Hong-Thai > > > On 28 Mar 2015, at 16:01, Tyler Palsulich wrote: > &

Re: [DISCUSS] Tika 1.8 or 1.7.1

2015-03-28 Thread Tyler Palsulich
+ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++ > > > > > > > -Original Message- > From: T

[jira] [Commented] (TIKA-1584) Tika 1.7 possible regression (nested attachment files not getting parsed)

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14385483#comment-14385483 ] Tyler Palsulich commented on TIKA-1584: --- We now have two major issues which ne

[jira] [Commented] (TIKA-1585) Create Example Website with Form Submission

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14385411#comment-14385411 ] Tyler Palsulich commented on TIKA-1585: --- CORS work is now integrated. [~t

[jira] [Resolved] (TIKA-1586) Enable CORS on Tika Server

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1586. --- Resolution: Fixed Fixed in r1669799. > Enable CORS on Tika Ser

  1   2   3   4   5   6   7   8   9   >