[jira] [Commented] (TIKA-2342) Broken words

2017-05-02 Thread Nino Skopac (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992739#comment-15992739 ] Nino Skopac commented on TIKA-2342: --- I've traced it down to PDFBox issue: https://issues

[jira] [Closed] (TIKA-2344) Cannot sub to mailing list

2017-05-02 Thread Nino Skopac (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nino Skopac closed TIKA-2344. - Resolution: Invalid > Cannot sub to mailing list > -- > > Key: TIKA

[jira] [Commented] (TIKA-2344) Cannot sub to mailing list

2017-05-02 Thread Nino Skopac (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992747#comment-15992747 ] Nino Skopac commented on TIKA-2344: --- Hey Konstantin, it was my error - I was sending an e

[jira] [Closed] (TIKA-2342) Broken words

2017-05-02 Thread Nino Skopac (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nino Skopac closed TIKA-2342. - Resolution: Information Provided > Broken words > > > Key: TIKA-2342 >

RE: Tika 1.15

2017-05-02 Thread Allison, Timothy B.
Y. It is daunting at this point, and please do help! The key sheets I look at: exceptions/exceptions_compared_by_mime_type.xlsx exceptions/new_exceptions_in_B_by_mime.xlsx mimes/mime_diffs_A_to_B.xlsx attachments/attachment_diffs.xlsx metadata/metadata_value_count_diffs.xlsx I can dump json,

RE: Tika 1.15

2017-05-02 Thread Allison, Timothy B.
The other two critical files: Content/common_token_comparisons_by_mime.xlsx Content/content_diffs_ignore_exceptions.xlsx Oh, and the key part, which is less than ideal, is that there has to be a human in the loop...which makes the need for visualizations even more critical. For example: 1) We

[jira] [Created] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread VENU (JIRA)
VENU created TIKA-2351: -- Summary: Getting error while parsing documents Key: TIKA-2351 URL: https://issues.apache.org/jira/browse/TIKA-2351 Project: Tika Issue Type: Bug Components: general

[jira] [Commented] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992821#comment-15992821 ] Nick Burch commented on TIKA-2351: -- Can you attach the failing document? If not, could y

[jira] [Updated] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread VENU (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] VENU updated TIKA-2351: --- Attachment: 03 - Json_creat_code.txt 02 - Pipeline.txt 01 - Templete.txt

[jira] [Updated] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread VENU (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] VENU updated TIKA-2351: --- Attachment: 04 - stackTrace.txt > Getting error while parsing documents > - > >

[jira] [Commented] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread VENU (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992842#comment-15992842 ] VENU commented on TIKA-2351: Attached document, stcktrace, template, pipeline and json file gen

[jira] [Commented] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992845#comment-15992845 ] Nick Burch commented on TIKA-2351: -- I've just tried with a recent nightly build, and no er

[jira] [Commented] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread VENU (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992852#comment-15992852 ] VENU commented on TIKA-2351: is it possible to provide us the code, which you have executed to

[jira] [Created] (TIKA-2352) Incorrect EOF error in WordPerfect parser

2017-05-02 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2352: - Summary: Incorrect EOF error in WordPerfect parser Key: TIKA-2352 URL: https://issues.apache.org/jira/browse/TIKA-2352 Project: Tika Issue Type: Bug Re

[jira] [Updated] (TIKA-2352) Incorrect EOF exception in WordPerfect parser

2017-05-02 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2352: -- Summary: Incorrect EOF exception in WordPerfect parser (was: Incorrect EOF error in WordPerfect parser)

[jira] [Updated] (TIKA-2352) Incorrect EOF exception in WordPerfect parser

2017-05-02 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2352: -- Attachment: 462321.wp Triggering file. I think something is going wrong with {{skipUntilChar}}. I thi

[jira] [Commented] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992896#comment-15992896 ] Nick Burch commented on TIKA-2351: -- Just {{java -jar tika-app-1.15-snapshot.jar --text pro

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992931#comment-15992931 ] Chris A. Mattmann commented on TIKA-2322: - [~ThejanWijesinghe] what is your wiki us

[jira] [Resolved] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved TIKA-2322. - Resolution: Fixed PR #168 close. Thanks [~msha...@usc.edu], [~ThejanWijesinghe] and [~tgow

Re: Tika 1.15

2017-05-02 Thread Chris Mattmann
Team, check out Polar Insights, which my USC IRDS student NIthin did: http://polar.usc.edu/html/polar-deep-insights/index.html#/config Click Download, then Download (the 2 download buttons), then Save, then click the Query Interface. Something like this? All code is OSS on http://github.com/USCD

[jira] [Commented] (TIKA-2351) Getting error while parsing documents

2017-05-02 Thread VENU (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992960#comment-15992960 ] VENU commented on TIKA-2351: Dear Nick, Thank you very much for the info. Can you let me know,

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread Thejan Wijesinghe (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992998#comment-15992998 ] Thejan Wijesinghe commented on TIKA-2322: - Will do, thanks. [~chrismattmann], it's

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993110#comment-15993110 ] Chris A. Mattmann commented on TIKA-2322: - permission granted [~ThejanWijesinghe]

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread Thejan Wijesinghe (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993236#comment-15993236 ] Thejan Wijesinghe commented on TIKA-2322: - I can now edit the Wiki. thank you [~chr

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993447#comment-15993447 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on issue #168: fix for TIKA-2

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993449#comment-15993449 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on issue #168: fix for TIKA-2

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993453#comment-15993453 ] ASF GitHub Bot commented on TIKA-2322: -- chrismattmann commented on issue #168: fix for

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993466#comment-15993466 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on issue #168: fix for TIKA-2

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993468#comment-15993468 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on issue #168: fix for TIKA-2

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993470#comment-15993470 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on issue #168: fix for TIKA-2

[jira] [Commented] (TIKA-2352) Incorrect EOF exception in WordPerfect parser

2017-05-02 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993599#comment-15993599 ] Tim Allison commented on TIKA-2352: --- {noformat}2FF0 0A 0C D0 08 0A 00 00

[jira] [Comment Edited] (TIKA-2352) Incorrect EOF exception in WordPerfect parser

2017-05-02 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993599#comment-15993599 ] Tim Allison edited comment on TIKA-2352 at 5/2/17 7:51 PM: --- {nofo

[jira] [Commented] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.

2017-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15993633#comment-15993633 ] ASF GitHub Bot commented on TIKA-2016: -- chrismattmann commented on issue #169: TIKA-20

Re: Tika 1.15

2017-05-02 Thread Tyler Bui-Palsulich
Thanks for the link. It looks like the UI is written with Angular and uses Elastic + static JSON. See https://github.com/USCDataScience/polar-deep-insights/wiki/Architecture. I also like d3. In general, I think we are on the same page the best option is a web based UI. I see a few options to get