Re: [VOTE] Moving SCM to Git

2016-01-13 Thread Julien Nioche
+1 On 2 January 2016 at 04:30, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Hi Everyone, > > DISCUSS thread here: http://s.apache.org/wVE > > Time to officially VOTE on moving Tika to Git. I’ve made a wiki > page for our SCM explaining how to use Git at Apache, and how to >

Tika questions on StackOverflow

2016-01-13 Thread Nick Burch
Hi All This may be old news for some of you, in which case you can skip the email, but for others... StackOverflow is a programming-focused question and answer site, with excellent google-foo, quite wide use, and growing use. At the moment I'd say there's something like a new Tika question a

Re: [VOTE] Moving SCM to Git

2016-01-13 Thread Konstantin Gribov
Hi. [x] +1 Move the Apache Tika source control to Writeable Git repos at the ASF [ ] +0 Indifferent. [ ] -1 Don’t move the Apache Tika source control to Writeable Git repos at the ASF because.. For me git is more convenient (and I actually use git-svn for svn repos). пн, 11 янв. 2016 г. в

[jira] [Updated] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1830: -- Issue Type: Improvement (was: Bug) > Upgrade to PDFBox 1.8.11 when available >

[jira] [Updated] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1830: -- Priority: Minor (was: Major) > Upgrade to PDFBox 1.8.11 when available >

[jira] [Updated] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1830: -- Attachment: reports_pdfbox_1_8_11-rc1.zip Reports on 1.8.11-rc1 Caveats: # I haven't reviewed these

[jira] [Commented] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096866#comment-15096866 ] Tilman Hausherr commented on TIKA-1830: --- I can't reproduce the difference for the file 074531.pdf.

[jira] [Commented] (TIKA-1436) improvement to PDFParser

2016-01-13 Thread Stefano Fornari (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15097245#comment-15097245 ] Stefano Fornari commented on TIKA-1436: --- Thanks for the feedback Tim. I'll work the trunk code and

[jira] [Comment Edited] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096229#comment-15096229 ] Tim Allison edited comment on TIKA-1830 at 1/13/16 3:40 PM: Reports on

[jira] [Updated] (TIKA-1829) org.apache.tika.parser.ocr.TesseractOCRParser.getSupportedTypes(TesseractOCRParser.java:92) NPE

2016-01-13 Thread frank (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] frank updated TIKA-1829: Attachment: TesseractOCRParser.java Patch File Updated >

[jira] [Updated] (TIKA-1829) org.apache.tika.parser.ocr.TesseractOCRParser.getSupportedTypes(TesseractOCRParser.java:92) NPE

2016-01-13 Thread frank (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] frank updated TIKA-1829: Attachment: (was: TesseractOCRParser.java) >

[jira] [Commented] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096384#comment-15096384 ] Uwe Schindler commented on TIKA-1830: - It would be good to update to 1.8.11 as soon as it is out,

Re: Tika questions on StackOverflow

2016-01-13 Thread Mattmann, Chris A (3980)
Great post Nick ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email:

RE: Tika questions on StackOverflow

2016-01-13 Thread Allison, Timothy B.
Y, thank you, Nick! I've been monitoring the Solr user's list when I have time. Are there other consumer lists we should be following? Elastic Search? -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Wednesday, January 13, 2016 9:53 AM To:

[jira] [Commented] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096506#comment-15096506 ] Tim Allison commented on TIKA-1830: --- [~thetaphi], good to know. Thank you! Speaking of integration with

[jira] [Updated] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1830: -- Priority: Major (was: Minor) > Upgrade to PDFBox 1.8.11 when available >

[jira] [Commented] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096663#comment-15096663 ] Uwe Schindler commented on TIKA-1830: - bq. Speaking of integration with Solr, would you have a

[jira] [Commented] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-01-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096668#comment-15096668 ] Uwe Schindler commented on TIKA-1824: - Hi, as invited on TIKA-1830, here some comments from Apache