[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16247218#comment-16247218
]
Thomas Mueller commented on OAK-5519:
-
[~jsedding] This only works if text extraction is reading, but in
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16247189#comment-16247189
]
Julian Sedding commented on OAK-5519:
-
[~tmueller] could the processing thread be terminated by closing
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16246063#comment-16246063
]
Thomas Mueller commented on OAK-5519:
-
http://svn.apache.org/r1814745
[~chetanm] I have incorporated
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16244308#comment-16244308
]
Chetan Mehrotra commented on OAK-5519:
--
bq. However, after a restart, Oak will not try to extract the
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16244076#comment-16244076
]
Thomas Mueller commented on OAK-5519:
-
> Going forward we can probably store some hidden property to
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16243754#comment-16243754
]
Chetan Mehrotra commented on OAK-5519:
--
bq. the text extraction cache only puts results in the cache
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16243702#comment-16243702
]
Thomas Mueller commented on OAK-5519:
-
I found out why there are two threads consuming 100% each, and
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16243573#comment-16243573
]
Thomas Mueller commented on OAK-5519:
-
My current approach is: extract larger binaries using a separate
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16104727#comment-16104727
]
Chetan Mehrotra commented on OAK-5519:
--
bq. it does nothing except throw an exception / error / out of
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16101666#comment-16101666
]
Thomas Mueller commented on OAK-5519:
-
[~catholicon] and [~chetanm] I think we should try the "Memory of
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996500#comment-15996500
]
Thomas Mueller commented on OAK-5519:
-
Do we have a test case (for example a PDF file that runs out of
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15976491#comment-15976491
]
Chetan Mehrotra commented on OAK-5519:
--
*Problematic Binary Handling*
h3. A - Out of process
Best
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15976472#comment-15976472
]
Chetan Mehrotra commented on OAK-5519:
--
bq. It probably makes sense to deal with OOME as well (at
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15976382#comment-15976382
]
Thomas Mueller commented on OAK-5519:
-
I recently saw OutOfMemory error during the index update; I'm not
[
https://issues.apache.org/jira/browse/OAK-5519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15840316#comment-15840316
]
Alexander Klimetschek commented on OAK-5519:
Related issues:
* OAK-4939 addresses this in 1.5
15 matches
Mail list logo