David Eric Pugh created TIKA-3497:
-
Summary: Update README for installing Tika Server as a service for
2.0 release
Key: TIKA-3497
URL: https://issues.apache.org/jira/browse/TIKA-3497
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17386334#comment-17386334
]
David Eric Pugh commented on TIKA-3495:
---
Looking at that json file you linked to, nest_parent is of
[
https://issues.apache.org/jira/browse/TIKA-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17386315#comment-17386315
]
David Eric Pugh edited comment on TIKA-3495 at 7/23/21, 3:44 PM:
-
This
[
https://issues.apache.org/jira/browse/TIKA-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17386315#comment-17386315
]
David Eric Pugh commented on TIKA-3495:
---
This area of Solr has been changing a bit. According to
[
https://issues.apache.org/jira/browse/TIKA-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17343965#comment-17343965
]
David Eric Pugh commented on TIKA-1570:
---
The associated pr seems reasonable, would be nice to have
[
https://issues.apache.org/jira/browse/TIKA-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17343963#comment-17343963
]
David Eric Pugh commented on TIKA-1570:
---
I might suggest trying to go down the docker on windows
[
https://issues.apache.org/jira/browse/TIKA-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17343962#comment-17343962
]
David Eric Pugh commented on TIKA-1570:
---
Unfortunately they are Linux only. However I have used
[
https://issues.apache.org/jira/browse/TIKA-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17259809#comment-17259809
]
David Eric Pugh commented on TIKA-3258:
---
I'm thinking that this is a pointer towards two general
[
https://issues.apache.org/jira/browse/TIKA-3166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17181262#comment-17181262
]
David Eric Pugh commented on TIKA-3166:
---
I did a diff, and while I can't say that I read through it
[
https://issues.apache.org/jira/browse/TIKA-3093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17091703#comment-17091703
]
David Eric Pugh commented on TIKA-3093:
---
Out of curiosity, is this type of behavior, the "Let me
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17076673#comment-17076673
]
David Eric Pugh commented on TIKA-2368:
---
I'm actually not sure I touched {{SentimentParser}}, as
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17076501#comment-17076501
]
David Eric Pugh commented on TIKA-2368:
---
In [https://github.com/apache/tika/pull/316] I messed with
[
https://issues.apache.org/jira/browse/TIKA-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17062619#comment-17062619
]
David Eric Pugh commented on TIKA-3075:
---
Not sure I understand what this issue is about? As in be
[
https://issues.apache.org/jira/browse/TIKA-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17044533#comment-17044533
]
David Eric Pugh commented on TIKA-3035:
---
Tried it with tika-app-1.23.jar and worked great.
It
[
https://issues.apache.org/jira/browse/TIKA-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17044406#comment-17044406
]
David Eric Pugh commented on TIKA-3035:
---
Here is my command:
java -cp tika-app-1.23-SNAPSHOT.jar
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17043796#comment-17043796
]
David Eric Pugh commented on TIKA-3037:
---
[~tallison]did you see the gettingstarted.apt patch file?
[
https://issues.apache.org/jira/browse/TIKA-3038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030925#comment-17030925
]
David Eric Pugh commented on TIKA-3038:
---
Also, the url for the plugin has changed from https to just
David Eric Pugh created TIKA-3038:
-
Summary: Miredot license key expired
Key: TIKA-3038
URL: https://issues.apache.org/jira/browse/TIKA-3038
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030904#comment-17030904
]
David Eric Pugh commented on TIKA-2253:
---
Hi all...The license has expired ;-)
> Obtain new
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030900#comment-17030900
]
David Eric Pugh commented on TIKA-3037:
---
Okay, I've attached a SVN DIFF patch file to the
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Eric Pugh updated TIKA-3037:
--
Attachment: gettingstarted.apt.patch
> Tika Docs should highlight Tika-Server
>
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030862#comment-17030862
]
David Eric Pugh commented on TIKA-3037:
---
Okay, in
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030806#comment-17030806
]
David Eric Pugh commented on TIKA-3037:
---
I put some edits into the wiki at
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030766#comment-17030766
]
David Eric Pugh commented on TIKA-3037:
---
Thanks [~nick]
> Tika Docs should highlight Tika-Server
>
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030760#comment-17030760
]
David Eric Pugh commented on TIKA-3037:
---
Another comment, so the page
[
https://issues.apache.org/jira/browse/TIKA-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030748#comment-17030748
]
David Eric Pugh commented on TIKA-3037:
---
So... Where does the HTML for the website live? What is
David Eric Pugh created TIKA-3037:
-
Summary: Tika Docs should highlight Tika-Server
Key: TIKA-3037
URL: https://issues.apache.org/jira/browse/TIKA-3037
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16995174#comment-16995174
]
David Eric Pugh commented on TIKA-3010:
---
Made more progress. Now, when you run the `package` goal
[
https://issues.apache.org/jira/browse/TIKA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Eric Pugh updated TIKA-3010:
--
Flags: Patch,Important (was: Important)
> Tika needs service installation script
>
David Eric Pugh created TIKA-3010:
-
Summary: Tika needs service installation script
Key: TIKA-3010
URL: https://issues.apache.org/jira/browse/TIKA-3010
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-2968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957801#comment-16957801
]
David Eric Pugh commented on TIKA-2968:
---
And on a related aspect, maybe, if we want the Verbose mode
[
https://issues.apache.org/jira/browse/TIKA-2968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957799#comment-16957799
]
David Eric Pugh commented on TIKA-2968:
---
Hey community, any chance of this being added for 1.23, or
David Eric Pugh created TIKA-2971:
-
Summary: Link to download OpenNLP models needs to be http not https
Key: TIKA-2971
URL: https://issues.apache.org/jira/browse/TIKA-2971
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-2624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957204#comment-16957204
]
David Eric Pugh commented on TIKA-2624:
---
I am rereading this thread via JIRA versus the github PR,
[
https://issues.apache.org/jira/browse/TIKA-2970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955612#comment-16955612
]
David Eric Pugh commented on TIKA-2970:
---
It's a work in progress, however here is a unit test:
David Eric Pugh created TIKA-2970:
-
Summary: Configuring Tesseract for OCR of PDF via Tika Config is
not working
Key: TIKA-2970
URL: https://issues.apache.org/jira/browse/TIKA-2970
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-2705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955515#comment-16955515
]
David Eric Pugh commented on TIKA-2705:
---
I know this is marked as resolved, but I'm definitly not
[
https://issues.apache.org/jira/browse/TIKA-2969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955498#comment-16955498
]
David Eric Pugh commented on TIKA-2969:
---
I noticed that when I run `mvn test` the output is:
```
David Eric Pugh created TIKA-2969:
-
Summary: Unit test for TesseractOCRParserTest.java has confusing
behavior when Tesseract not on path
Key: TIKA-2969
URL: https://issues.apache.org/jira/browse/TIKA-2969
David Eric Pugh created TIKA-2968:
-
Summary: Display specific command for Tesseract if you are running
in Verbose mode
Key: TIKA-2968
URL: https://issues.apache.org/jira/browse/TIKA-2968
Project:
[
https://issues.apache.org/jira/browse/TIKA-2931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16918723#comment-16918723
]
Eric Pugh commented on TIKA-2931:
-
Okay, I've made a PR that fixes this problem, with a test.
[
https://issues.apache.org/jira/browse/TIKA-2931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16918137#comment-16918137
]
Eric Pugh commented on TIKA-2931:
-
Looks like the TikaCLI test does rely on this behavior...
Eric Pugh created TIKA-2931:
---
Summary: Tika CLI shouldn't log with System.out.println
Key: TIKA-2931
URL: https://issues.apache.org/jira/browse/TIKA-2931
Project: Tika
Issue Type: Improvement
Eric Pugh created TIKA-2106:
---
Summary: "hocr" case on Linux fails, but works on OSX. Related to
TIKA-2093
Key: TIKA-2106
URL: https://issues.apache.org/jira/browse/TIKA-2106
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15534613#comment-15534613
]
Eric Pugh edited comment on TIKA-2093 at 9/30/16 12:52 AM:
---
BTW, just got to
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15534613#comment-15534613
]
Eric Pugh commented on TIKA-2093:
-
BTW, just got to updating my project with the latest 1.14-SNAPSHOT, and
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15516177#comment-15516177
]
Eric Pugh commented on TIKA-2093:
-
Thanks for this, and the addition of the HOCRPassthroughHandler, I'll
Eric Pugh created TIKA-2093:
---
Summary: Add hOCR output type to the TesseractOCRParser
Key: TIKA-2093
URL: https://issues.apache.org/jira/browse/TIKA-2093
Project: Tika
Issue Type: Improvement
48 matches
Mail list logo