Luca Della Toffola created TIKA-1149:
Summary: 12% performance improvement by caching in CompositeParser
Key: TIKA-1149
URL: https://issues.apache.org/jira/browse/TIKA-1149
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Della Toffola updated TIKA-1149:
-
Attachment: CompositeParser.patch
ParseContext.patch
12% performance
[
https://issues.apache.org/jira/browse/TIKA-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715180#comment-13715180
]
Jukka Zitting commented on TIKA-1149:
-
Note that for example
Tim Allison created TIKA-1150:
-
Summary: Extract text from textbox in XLSX
Key: TIKA-1150
URL: https://issues.apache.org/jira/browse/TIKA-1150
Project: Tika
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/TIKA-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1150:
--
Attachment: testEXCEL_textbox.xlsx
Simple file that shows issue.
Extract text from
Hi Ken,
Yes, by other tika projects I meant tika-app, tika-bundle, tika-xmp, etc., and
yes each sub-project would end up with it's own test-jar.
It probably makes more sense to just add the plugin to each project
individually.
Since there's been no opposition to the concept in general I'll
Ray Gauss II created TIKA-1151:
--
Summary: Maven Build Should Automatically Produce test-jar
Artifacts
Key: TIKA-1151
URL: https://issues.apache.org/jira/browse/TIKA-1151
Project: Tika
Issue