[
https://issues.apache.org/jira/browse/TIKA-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15549934#comment-15549934
]
Tim Allison commented on TIKA-2107:
---
One recommendation from Twitter was Libre Office commandline
Great writeup, Tim, thanks for taking the time to tell people about things
that we do!
Dominik.
On Wed, Oct 5, 2016 at 7:56 PM, Allison, Timothy B.
wrote:
> All,
>
> I recently blogged about some of the work we're doing with a large scale
> regression corpus to make Tika,
All,
I recently blogged about some of the work we're doing with a large scale
regression corpus to make Tika, POI and PDFBox more robust and to identify
regressions before release. If you'd like to chip in with recommendations,
requests or Hadoop/Spark clusters (why not shoot for the stars),
Tim this is GREAT!
Please link it from the wiki that mentions web resource document links. I think:
http://wiki.apache.org/tika/TikaResources
I fell behind on spinning the release. Will try and make progress today.
Chris
++
Chris
[
https://issues.apache.org/jira/browse/TIKA-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15548782#comment-15548782
]
Gaurav commented on TIKA-2107:
--
Please suggest a workaround to parse these files.
> Old MS Word files give
On Wed, 5 Oct 2016, Apache Jenkins Server wrote:
The Apache Jenkins build system has built tika-2.x (build #156)
Check console output at https://builds.apache.org/job/tika-2.x/156/ to view the
results.
Another one for our Jenkins experts. Looks like it needs a bit more memory
for the job,
The Apache Jenkins build system has built tika-2.x (build #156)
Status: Failure
Check console output at https://builds.apache.org/job/tika-2.x/156/ to view the
results.
On Wed, 5 Oct 2016, Apache Jenkins Server wrote:
The Apache Jenkins build system has built tika-2.x-windows (build #60)
Check console output at https://builds.apache.org/job/tika-2.x-windows/60/ to
view the results.
Anyone with Jenkins-foo able to fix our Windows Jenkin builds? This failed
[
https://issues.apache.org/jira/browse/TIKA-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15548427#comment-15548427
]
Nick Burch commented on TIKA-2107:
--
The attached file is an old Word 2 file, not supported by POI and
[
https://issues.apache.org/jira/browse/TIKA-2109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julian updated TIKA-2109:
-
Attachment: zafar-bug-9.docx
> OutOfMemory when parsing 5MB word document
>
Julian created TIKA-2109:
Summary: OutOfMemory when parsing 5MB word document
Key: TIKA-2109
URL: https://issues.apache.org/jira/browse/TIKA-2109
Project: Tika
Issue Type: Bug
Affects Versions:
11 matches
Mail list logo