Tika-Python: parsing PDFs and showing analytics

2016-06-30 Thread Mattmann, Chris A (3980)
Great Blog post by Clinton Brownley today: If you haven’t had a chance to check out tika-python [1], I recommend doing so! Would also appreciate any feedback or stars! Cheers, Chris [1] http://github.com/chrisma

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-06-30 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15357807#comment-15357807 ] Hudson commented on TIKA-1978: -- SUCCESS: Integrated in tika-2.x #120 (See [https://builds.apa

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-06-30 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1535#comment-1535 ] Hudson commented on TIKA-1978: -- FAILURE: Integrated in tika-2.x-windows #24 (See [https://bui

tika-2.x-windows - Build # 24 - Still Failing

2016-06-30 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #24) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/24/ to view the results.

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-06-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15357710#comment-15357710 ] ASF GitHub Bot commented on TIKA-1978: -- Github user asfgit closed the pull request at:

[GitHub] tika pull request #124: TIKA-1978 Invocation of java.net.URL.equals(Object),...

2016-06-30 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/tika/pull/124 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enable

[jira] [Commented] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results

2016-06-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15357188#comment-15357188 ] Tim Allison commented on TIKA-2025: --- Y, I'm wondering about a narrower fix...perhaps go b

[jira] [Commented] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results

2016-06-30 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15357125#comment-15357125 ] Nick Burch commented on TIKA-2025: -- We could always test the formatted value for {{E+}} (o

[jira] [Comment Edited] (TIKA-2018) Attempt to get Title from Full text if not present in MetaData ( Application/Pdf )

2016-06-30 Thread Joeran (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356792#comment-15356792 ] Joeran edited comment on TIKA-2018 at 6/30/16 9:16 AM: --- hey there, i

[jira] [Commented] (TIKA-2018) Attempt to get Title from Full text if not present in MetaData ( Application/Pdf )

2016-06-30 Thread Joeran (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356792#comment-15356792 ] Joeran commented on TIKA-2018: -- hey there, i am one of the creators of "Docear's PDF Inspecto