[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515169#comment-15515169
]
Hudson commented on TIKA-2093:
--
SUCCESS: Integrated in Jenkins build tika-2.x #148 (See
[http
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515128#comment-15515128
]
Hudson commented on TIKA-2093:
--
FAILURE: Integrated in Jenkins build tika-2.x-windows #52 (See
The Apache Jenkins build system has built tika-2.x-windows (build #52)
Status: Still Failing
Check console output at https://builds.apache.org/job/tika-2.x-windows/52/ to
view the results.
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515082#comment-15515082
]
Hudson commented on TIKA-2093:
--
SUCCESS: Integrated in Jenkins build Tika-trunk #1106 (See
[h
[
https://issues.apache.org/jira/browse/TIKA-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1627.
---
Resolution: Won't Fix
> Authentication for fileUrl
> --
>
> Key
[
https://issues.apache.org/jira/browse/TIKA-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515073#comment-15515073
]
Tim Allison commented on TIKA-1627:
---
We removed fileUrl in Tika 1.10 because it was a [se
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515035#comment-15515035
]
Tim Allison edited comment on TIKA-2093 at 9/23/16 1:26 AM:
[~e
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515035#comment-15515035
]
Tim Allison commented on TIKA-2093:
---
[~epugh], I made a few modifications. The biggest w
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515007#comment-15515007
]
ASF GitHub Bot commented on TIKA-2093:
--
Github user asfgit closed the pull request at:
Github user asfgit closed the pull request at:
https://github.com/apache/tika/pull/133
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enable
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reassigned TIKA-2093:
-
Assignee: Tim Allison
> Add hOCR output type to the TesseractOCRParser
> -
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2093:
--
Description: I've tweaked the TesseractOCRParser and TesseractOCRConfig to
add the "txt" or "hocr" parame
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2093:
--
Description: FI've tweaked the TesseractOCRParser and TesseractOCRConfig to
add the "txt" or "hocr" param
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514642#comment-15514642
]
Tim Allison commented on TIKA-2093:
---
On mobile, can't do full review. If hocr output is x
[
https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514215#comment-15514215
]
ASF GitHub Bot commented on TIKA-2093:
--
GitHub user epugh opened a pull request:
GitHub user epugh opened a pull request:
https://github.com/apache/tika/pull/133
add hOCR output format to TesseractParser TIKA-2093
Small change to Tesseract OCR code to add the hOCR outputType. In the
future we can add `pdf` and `tsv` as output types as well.
First pa
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514157#comment-15514157
]
Tim Allison edited comment on TIKA-2091 at 9/22/16 7:07 PM:
Thi
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514163#comment-15514163
]
Rodrigo Rosenfeld Rosas commented on TIKA-2091:
---
Thanks for your investigatio
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-2091.
---
Resolution: Not A Problem
Fix Version/s: (was: 1.7)
This particular exception is caused by S
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514142#comment-15514142
]
Rodrigo Rosenfeld Rosas commented on TIKA-2091:
---
Great, good job :) Anyway, t
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514131#comment-15514131
]
Tim Allison commented on TIKA-2091:
---
Y, I'm able to reproduce it in Solr trunk. The issu
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15514103#comment-15514103
]
Rodrigo Rosenfeld Rosas commented on TIKA-2091:
---
I just tried running on Solr
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513977#comment-15513977
]
Rodrigo Rosenfeld Rosas edited comment on TIKA-2091 at 9/22/16 5:51 PM:
-
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513977#comment-15513977
]
Rodrigo Rosenfeld Rosas commented on TIKA-2091:
---
I just confirmed it happens
Eric Pugh created TIKA-2093:
---
Summary: Add hOCR output type to the TesseractOCRParser
Key: TIKA-2093
URL: https://issues.apache.org/jira/browse/TIKA-2093
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Craig Pfeifer updated TIKA-2092:
Description:
"A general-purpose, deep learning-based system to decompile an image into
presentationa
Craig Pfeifer created TIKA-2092:
---
Summary: Integrate Math equation image extraction
Key: TIKA-2092
URL: https://issues.apache.org/jira/browse/TIKA-2092
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513795#comment-15513795
]
Rodrigo Rosenfeld Rosas commented on TIKA-2091:
---
Hmm, I'll try to get more de
[
https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513790#comment-15513790
]
Tim Allison commented on TIKA-2091:
---
Y, this is the place. Thank you.
I'm not able to r
Rodrigo Rosenfeld Rosas created TIKA-2091:
-
Summary: regression: Zip bomb detected! for HTML file
Key: TIKA-2091
URL: https://issues.apache.org/jira/browse/TIKA-2091
Project: Tika
Iss
Thank you, Chris!
-Original Message-
From: Chris Mattmann [mailto:mattm...@apache.org]
Sent: Thursday, September 22, 2016 12:25 PM
To: dev@tika.apache.org
Subject: Re: Tika 1.14?
Sounds great to me Tim. If you tell me when the tests are done, I’d be happy to
RC a release!
On 9/21/1
Sounds great to me Tim. If you tell me when the tests are done, I’d be happy to
RC a release!
On 9/21/16, 11:31 AM, "Allison, Timothy B." wrote:
All,
PDFBox 2.0.3 is now integrated, I'm about to push the integration with
POI-3.15. I have a few cleanup things I'd like to take car
[
https://issues.apache.org/jira/browse/TIKA-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513632#comment-15513632
]
Tim Allison commented on TIKA-2090:
---
How hard could it be? :)
http://stackoverflow.com/q
[
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513354#comment-15513354
]
Hudson commented on TIKA-2069:
--
SUCCESS: Integrated in Jenkins build Tika-trunk #1105 (See
[h
[
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513351#comment-15513351
]
Hudson commented on TIKA-2069:
--
SUCCESS: Integrated in Jenkins build tika-2.x #147 (See
[http
[
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513279#comment-15513279
]
Hudson commented on TIKA-2069:
--
FAILURE: Integrated in Jenkins build tika-2.x-windows #51 (See
The Apache Jenkins build system has built tika-2.x-windows (build #51)
Status: Still Failing
Check console output at https://builds.apache.org/job/tika-2.x-windows/51/ to
view the results.
[
https://issues.apache.org/jira/browse/TIKA-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Christian Weber closed TIKA-2088.
-
Resolution: Not A Problem
Somehow I messed up picking the correct Project.
I'm sorry for the incon
38 matches
Mail list logo