Manish created TIKA-2544:
Summary: Docx Numbering Issue
Key: TIKA-2544
URL: https://issues.apache.org/jira/browse/TIKA-2544
Project: Tika
Issue Type: Bug
Components: parser
Affects Vers
[
https://issues.apache.org/jira/browse/TIKA-642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13021913#comment-13021913
]
Manish commented on TIKA-642:
-
Do we have alternate to this? There are many files that throws si
[
https://issues.apache.org/jira/browse/TIKA-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish updated TIKA-642:
Description:
Few of the RTF files dont get extracted properly.
This is the stack trace:
org.apache.tika.exception.
[
https://issues.apache.org/jira/browse/TIKA-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish updated TIKA-642:
Attachment: FIRM GAS GTC B RED.DOC
This is the file that is throwing the exception
> Few of RTF files not extracting
Few of RTF files not extracting properly
Key: TIKA-642
URL: https://issues.apache.org/jira/browse/TIKA-642
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 0.9, 1.0
Need API to get list of embedded documents
--
Key: TIKA-637
URL: https://issues.apache.org/jira/browse/TIKA-637
Project: Tika
Issue Type: New Feature
Components: parser
Affects Versions:
[
https://issues.apache.org/jira/browse/TIKA-489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Manish updated TIKA-489:
Attachment: doc1.doc
I am trying to parse the attached file doc1.doc.
I has doc2 embedded within it.
Anyway to get
Embedded Documents within documents
---
Key: TIKA-489
URL: https://issues.apache.org/jira/browse/TIKA-489
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 0.7
E